Summary
Overview
Work History
Education
Skills
Accomplishments
Certification
Timeline
Abhishek Regalla

Abhishek Regalla

Sr. Data Engineer
Dallas,TX

Summary

Results-driven Data Engineer with 6 years of experience designing and optimizing scalable data pipelines, cloud infrastructure, and big data solutions. Proven track record of improving operational efficiency and delivering actionable insights for clients across industries. Skilled in Python, PySpark, Snowflake, AWS,Azure and GCP, with a strong focus on data ingestion, transformation, and analytics.

Overview

8
8
years of professional experience
5
5
years of post-secondary education
3
3
Certifications
4
4
Languages

Work History

Volunteer

Habitat for Humanity
01.2023 - Current
  • Assisted in organizing and promoting fundraising events and community engagement activities

Data Engineer

Horizon Media, Inc.(Via Cyma Systems,Inc)
08.2022 - Current
  • Developing Python API's to extract data from clients and ingesting into cloud warehouse (Snowflake)
  • Designed and implemented scalable ETL pipelines using PySpark and AWS EMR, reducing data processing time by 40%.
  • Leveraged AWS Lambda and S3 to automate data ingestion and staging, improving data availability by 30%.
  • Leveraged Databricks to process and analyze terabytes of data, reducing processing time by 20% and enabling efficient data storage and querying in Snowflake
  • Worked with over 40 clients to understand and cleanse data, align it with transactions, and provide actionable insights for targeting customers, contributing to one of the most profitable teams in the company
  • Developed robust matching algorithms using PII elements (Email, Phone, Maid, Address), achieving a 90% match accuracy rate for client data.
  • Analyzed client data to identify high-value customer segments, resulting in a 20% increase in sales through targeted campaigns.
  • Deployed Apache Airflow to automate and monitor data workflows, reducing manual intervention by 50%.
  • Contact : Nathan Keane
  • Email nkeane@horizonmedia.com

Data Analytics Engineer

Bayer (Via Cyma Systems,Inc)
05.2021 - 08.2022
  • Identified high-value customer segments through Amazon Marketing Cloud, saving $1M in ad spend.
  • Led a team of 3 offshore developers to maintain data ingestion pipelines, ensuring 99.9% uptime.
  • Processed and analyzed large datasets using Databricks, optimizing query performance by 35%.
  • Analyzed sales data to identify trends and optimize distribution, increasing sales by 15%.
  • Led the analysis of Kantar customer data to evaluate drug reach, integrating findings with Bayer sales data to identify trends and drive data-driven decision-making for marketing and sales teams
  • Involved in gathering business requirements, analyzing the project, and creating Use Cases and Design and document for new requirements


  • Contact : Pitchi R +14709858478

Data Engineer

AT&T(Via Cyma Systems)
02.2020 - 05.2021
  • Developing Python API's to extract data from clients and ingesting into cloud warehouse (Snowflake)
  • Collaborated with the data science team to build a model generating engagement scores for 3rd-party sources, improving campaign targeting by 20%.
  • Migrated 10TB of data from AWS to Azure, reducing storage costs by 15%.
  • Migrated Data from Teradata & Sql Server to Azure Using Synapse Analytics.
  • Built data pipelines using Azure Synapse Analytics, improving data processing efficiency by 25%.
  • Applied post-processing scripts to differentiate prospect and live customers, improving campaign accuracy by 30%.

Software Developer

Telearc Technologies
05.2017 - 07.2018
  • Built and optimized Hadoop MapReduce programs to transform text files into AVRO format and load them into Hive tables for scalable data processing.
  • Utilized Hive for data analytics and Sqoop for exporting metrics to Oracle, enabling cross-platform data integration.
  • Engineered data pipelines using Flume to ingest log files into HDFS for streamlined processing and analysis.
  • Developed advanced MySQL queries to filter and analyze data, supporting business intelligence and reporting.
  • Designed ETL workflows in Talend for data profiling, migration, and transformation, ensuring high data quality and reliability.

Education

Master of Science -

University of Houston Clear - Lake
08.2018 - 12.2019

Bachelor of Technology - undefined

Andhra University
08.2013 - 04.2017

Skills

PySpark

undefined

Accomplishments

Automated data workflows and reduced manual intervention by 50% and improving data processing efficiency.

Supervised a team of 3 offshore developers to maintain data ingestion pipelines, ensuring 99.9% uptime and timely delivery of analytics insights.

Successfully migrated 5TB of legacy data to a modern cloud-based data warehouse (Snowflake), reducing query times by 30% and improving data accessibility.

Certification

AWS Certified Developer Associate

Timeline

Google Cloud Certified - Professional Data Engineer

11-2024

Google Analytics (GA4)

09-2024
Volunteer - Habitat for Humanity
01.2023 - Current
Data Engineer - Horizon Media, Inc.(Via Cyma Systems,Inc)
08.2022 - Current
Data Analytics Engineer - Bayer (Via Cyma Systems,Inc)
05.2021 - 08.2022

AWS Certified Developer Associate

08-2020
Data Engineer - AT&T(Via Cyma Systems)
02.2020 - 05.2021
University of Houston Clear - Lake - Master of Science,
08.2018 - 12.2019
Software Developer - Telearc Technologies
05.2017 - 07.2018
Andhra University - Bachelor of Technology,
08.2013 - 04.2017
Abhishek RegallaSr. Data Engineer