Summary
Overview
Work History
Education
Skills
Projects
Volunteering
Timeline
Generic

Raj Katta

(Raja Sekhar Katta)
New York,NY

Summary

Dynamic Data Architect with 9+ years of experience specializing in data modeling, ETL management, and data quality monitoring. Proven track record in optimizing data architectures and enhancing data integrity across complex systems. Adept at collaborating with cross-functional teams to drive improvements in data transfer and analysis. Passionate about implementing innovative solutions using advanced technologies such as Snowflake, Airflow, DBT, and Elementary to ensure robust and reliable data frameworks.

Overview

8
8
years of professional experience

Work History

Sr Data Architect

Thrive Market
06.2022 - Current

Key Achievements:

  • Optimized Data Processing Architecture: Enhanced UTM validations and improved data refresh frequency, resulting in 80% reduction in processing time and significant cost savings.
  • Streamlined EDW Implementation: Developed standardized architecture and automated processes to facilitate efficient data integration and management across systems.
  • DMS Decommissioning Support: Established foundational architecture for transitioning to a new data environment, ensuring smooth migration and reduced operational risks.
  • Data Quality Improvement: Integrated a robust data quality system into the existing architecture, enhancing data integrity and reliability across teams.
  • Cost Reduction Initiatives: Implemented architectural strategies that resulted in a projected annual savings of $80-90K through optimized data retention and query execution schedules.
  • Enhanced Scalability and Efficiency: Designed an architecture that significantly reduces query processing times and increases scalability, enabling effective resource utilization across multiple projects.

Environment: DBT(Data Build Tool), Snowflake, SQL, Python, AWS, Redshift, DMS, Airflow, Elementary

Sr Data Architect

Sema4
03.2021 - 06.2022

Key Achievements:

  • Automated Data Transformation Architecture: Established a robust pipeline using Airflow to automate over 20 manual processes, streamlining data transformation and enhancing workflow efficiency.
  • Modular Design for Cancer Data: Developed new modules to integrate various cancer types and test results from diverse health systems, improving data accessibility for analysis.
  • Standardization with OMOP Model: Transformed cancer data into the OMOP common data model, ensuring compatibility and acceptance across the healthcare industry for better data utilization.
  • SQL Query Optimization: Increased efficiency by optimizing SQL queries, significantly reducing the time required to execute data processing modules and improving overall performance.
  • Enhanced Data Analysis Capabilities: Provided structured and standardized data to support doctors and researchers in analyzing cancer metrics, facilitating more informed decision-making.
  • Improved Collaboration Across Systems: Designed an architecture that enhances collaboration and data sharing between healthcare systems, promoting comprehensive cancer research and insights.

Environment: SQL, Python, Airflow, Redshift, AWS services

Lead Data Engineer

Clickup
10.2020 - 03.2021

Key Achievements:

  • Built Comprehensive Data Architecture: Developed data architecture from ground up by understanding diverse business use cases across organization, ensuring alignment with company objectives.
  • Automated Data Transformation: Utilized Apache Airflow to schedule daily jobs, streamlining data transformation processes and enhancing operational efficiency.
  • Implemented Data Vault Modeling: Created data vault schemas that enable easy analysis by teams, promoting data accessibility and collaboration across ClickUp.
  • Integrated Diverse Data Sources: Successfully migrated large volumes of data from Redshift and RDS to Snowflake, consolidating data from eight sources into a common data lake with optimized schemas.

Environment: SQL, Python, Snowflake, Redshift, Postgres, Fivetran, RDS, AWS services

DATA DEVELOPER/Analyst Consultant

FreeWheel Media Inc
08.2018 - 06.2020

Key Achievements:

  • Developed Visual Analytics Dashboards: Created Looker and QuickSight dashboards to effectively visualize business use cases, enhancing client engagement and insights.
  • Delivered Tailored Data Solutions: Designed and implemented solutions to meet client data requests using AWS resources like Lambda and Databricks, with DataDog for monitoring.
  • Engineered Snowflake Data Architecture: Built a robust data architecture in Snowflake to support internal business intelligence analytics across Freewheel.
  • Managed Client Relationships: Successfully managed two clients over eight months, providing customized visualizations and data solutions tailored to their specific requirements.
  • Environment : Databricks, SQL, Looker, Quick-sight, Lambda, ECS, Spark, DataDog, Snow flake, Excel.

Big Data Developer

FreeWheel Media Inc
09.2017 - 07.2018

Key Achievements:

  • Led Large-Scale Data Projects: Managed comprehensive data projects encompassing data modeling, ETL development, and data warehousing to support organizational goals.
  • Designed Cloud-Focused Data Architecture: Developed an efficient and scalable data architecture that is GDPR compliant, facilitating targeted customer analysis for analysts.
  • Implemented Robust Security Measures: Planned and executed security protocols to safeguard sensitive data, ensuring compliance with industry regulations.
  • Established Data Accuracy Verification: Developed processes to verify data accuracy, enhancing the reliability and integrity of analytics across the organization.
  • Environment : GO, bash, SQL, python, Airflow, AWS(EC2, Athena, S3, Quicksight), Azakaban.

GRADUATE RESEARCH ASSISTANT

Bradley University
10.2016 - 12.2016
  • Developed various Machine Learning models to predict riskiest node from various metrics identified
  • Research to predict and formulate resilience of various nodes present in supply chain based on various factors.
  • Designed and developed various risk metrics and test efficiency of risk metrics
  • Environment: C#, Python, Neural Network model, Confidence Factor model, Decision tree model

Education

Master of Science - Computer science

Bradley University
Peoria

Bachelor of Science - Electronics and communication

Sastra University
Tamil Nadu

Skills

  • Data Warehousing: Snowflake, Redshift, Athena, RDS, DynamoDB
  • Data Modeling & Transformation: DBT, Data Vault, Data Lake Architecture
  • Data Quality & Monitoring: Elementary, Metaplane
  • Cloud Services (AWS): EC2, IAM, Lambda, Glue, S3, QuickSight, DMS
  • ETL & Workflow Orchestration: Apache Airflow, Azkaban, Databricks
  • Data Visualization: Looker, Domo
  • Programming Languages: SQL, Python, Go, Bash, Scala, C, Presto, PostgreSQL, NoSQL

Projects

Ad Skipping Analytics

Objective: Analyzed ad receptivity metrics on traditional and addressable platforms.

Achievements:.

  • Formed a team that won the Freewheel Hackathon among 50 teams, gaining a prize to attend AWS re-invent
  • Received visibility and input from various business leaders, resulting in a white paper and personal appreciation from the CEO.
  • Conducted comprehensive analysis of ad skippers using linear set-top box and digital data.
  • Established correlation between linear ad skipping habits and digital ad receptivity, and analyzed user ad viewing habits by vertical.
  • Technologies: Databricks, Scala, S3, Athena, Looker.

Volunteering

Minds Matter
Mentor & Team Lead | 7 years

  • Mentorship: Served as a mentor for three years, attending Saturday sessions to support my mentee, Britney, in her college journey, leading to her acceptance at Boston University and securing four scholarships.
  • Personal Growth: Witnessed significant growth in Britney over the years, fostering her academic and personal development.
  • Team Leadership: Currently in my fourth year as a Team Lead, managing eight mentees and sixteen mentors.
  • Mentor-Mentee Pairing: Analyze interests and personalities to effectively pair mentors with mentees, enhancing the mentorship experience.
  • Team Building: Organize bonding activities to strengthen relationships among mentors and mentees, promoting a supportive community.

Timeline

Sr Data Architect

Thrive Market
06.2022 - Current

Sr Data Architect

Sema4
03.2021 - 06.2022

Lead Data Engineer

Clickup
10.2020 - 03.2021

DATA DEVELOPER/Analyst Consultant

FreeWheel Media Inc
08.2018 - 06.2020

Big Data Developer

FreeWheel Media Inc
09.2017 - 07.2018

GRADUATE RESEARCH ASSISTANT

Bradley University
10.2016 - 12.2016

Bachelor of Science - Electronics and communication

Sastra University

Master of Science - Computer science

Bradley University
Raj Katta(Raja Sekhar Katta)