Summary
Overview
Work History
Education
Skills
Certification
Timeline
Generic

Ahalya Reddy Choda

Leawood

Summary

Results-driven Data Engineer with 3+ years of experience in cloud-based data solutions, ETL pipelines, and real-time streaming. Proficient in AWS, Azure, GCP, Spark, Kafka, Snowflake, and Airflow, with expertise in Python, SQL, and Scala. Skilled in big data processing, database optimization, and CI/CD automation using Terraform, Jenkins, and Git. Passionate about building scalable, high-performance data architectures to drive business insights.

Overview

4
4
years of professional experience
1
1
Certification

Work History

Azure Data Engineer

Invista
08.2023 - 12.2024
  • Designed and optimized ETL pipelines using Azure Data Factory (ADF), Databricks, and Spark for efficient data migration and transformation.
  • Integrated Kafka streaming to enable real-time data ingestion and processing for event-driven architectures.
  • Developed PySpark jobs in Databricks to transform structured and semi-structured data, improving query performance.
  • Automated CI/CD pipelines using Jenkins, Terraform, and Git, reducing deployment time by 40%.
  • Designed Azure Log Analytics dashboards for proactive monitoring of pipeline execution and failures.
  • Conducted Snowflake performance tuning, reducing query execution times by 50% through query optimizations and indexing.
  • Implemented Infrastructure as Code (IaC) using Terraform to provision cloud resources efficiently.

GCP Data Engineer

Thyrocare Technologies Limited
01.2022 - 04.2023
  • Developed BigQuery-based ETL pipelines and automated workflows using Cloud Composer (Airflow).
  • Migrated on-prem databases to Google BigQuery, achieving a 30% reduction in operational costs.
  • Designed and implemented real-time data pipelines with Google Pub/Sub and Dataflow for event-driven processing.
  • Created cost-optimization reports in Google Data Studio, monitoring billing and service usage analytics.
  • Built Dataproc Spark jobs for large-scale data processing and analytics across distributed datasets.
  • Automated Airflow-based cron jobs for seamless scheduling and execution of critical ETL tasks.

Data Engineer

Ingredion Incorporated
01.2021 - 12.2021
  • Developed ETL jobs in AWS Redshift & Snowflake, integrating structured and semi-structured data sources.
  • Built PySpark-based data transformation jobs in Databricks, enhancing analytics and reporting efficiency.
  • Implemented Kafka-based streaming data ingestion, enabling real-time insights for business operations.
  • Designed and optimized T-SQL queries, stored procedures, and triggers to improve database performance.
  • Developed containerized applications with Docker and Kubernetes, streamlining data processing workflows.

Education

Master of Science - Computer Science

University of Central Missouri
Warrensburg, MO
12-2024

Skills

    Cloud: AWS (Redshift, S3, EMR), Azure (ADF, Databricks, Synapse), GCP (BigQuery, Dataflow)
    Big Data: Spark, Hadoop, Kafka, Hive, Sqoop
    Databases: Snowflake, SQL Server, Oracle, MySQL, MongoDB
    ETL & Data Pipelines: Airflow, ADF, Talend, Informatica
    Programming: Python, SQL, Scala, Java, PowerShell
    DevOps & Automation: Terraform, Jenkins, Git, Docker, Kubernetes

Certification

  • Microsoft certified Azure Data Fundamentals
  • Google Cloud certified Data Engineer
  • AWS certified Data Engineer

Timeline

Azure Data Engineer

Invista
08.2023 - 12.2024

GCP Data Engineer

Thyrocare Technologies Limited
01.2022 - 04.2023

Data Engineer

Ingredion Incorporated
01.2021 - 12.2021
  • Microsoft certified Azure Data Fundamentals
  • Google Cloud certified Data Engineer
  • AWS certified Data Engineer

Master of Science - Computer Science

University of Central Missouri
Ahalya Reddy Choda