Summary
Overview
Work History
Education
Skills
Certification
Accomplishments
Timeline
Generic

Leela Nimmagadda

Dallas,Texas

Summary

Experienced Data Engineer with 5+ years of expertise in Azure Cloud (Data Factory, Databricks, Synapse), Snowflake, and PySpark. Specializes in building scalable data pipelines and conducting cost-effective cloud migrations. Demonstrated success in ETL/ELT optimization, multi-cloud integrations (Azure/GCP), and real-time streaming (Kafka, Spark). Certified in Azure Data Engineering, Databricks, and Apache Spark. Track record includes achieving 25% cost reduction and 40% faster data processing through automation and architecture improvements. Skilled in designing and optimizing data pipelines for seamless data flow, utilizing advanced SQL and Python skills to create and maintain robust data architectures. Proven ability to implement scalable solutions that enhance data integrity and support informed decision-making.

Overview

6
6
years of professional experience
1
1
Certification

Work History

Data Engineer

Quad Light Corp
04.2022 - Current
  • Migrated 2 TB of legacy logistics data from SQL Server to Google Cloud Platform (GCP) using Apache Airflow (DAG)
  • Leveraged cost-optimized storage tiers (Cold) to reduce annual infrastructure spend by 25%.
  • Automated high-volume ETL/ELT pipelines (PySpark) processing 500K+ daily records, improving data delivery speed by 40%
  • Created incremental load automation (change data capture) to slash batch processing time from 8 hours to 45 minutes.
  • Built and deployed real-time data pipelines for 15+ APIs and external sources (including IoT sensors).
  • Used Key Vault for managing credentials across 20+ pipelines which are Airflow DAGs, improving security and meeting compliance requirements.
  • Developed an incremental data ingestion process for CSAT survey data to support real-time reporting.
  • Collaborated with Power BI teams to optimize SQL queries, indexing strategies, and stored procedures, reducing dashboard load times by 25% and enabling real-time financial trend analysis for executive stakeholders.

Data Engineer

EPT IT SOLUTIONS
09.2019 - 08.2021
  • Automated the ingestion of Avro, JSON, and Delta data from to Azure Data Lake using Azure Data Factory and Databricks, saving over 30 hours of manual work per month.
  • Improved query performance by 35% and reduced data redundancy by 25%.
  • Cut Azure Synapse and Azure Data Lake Storage costs by 15% through partitioning, compute optimization.
  • Wrote reusable Python scripts to automate data extraction and transformation, reducing data load times by 20% and compute costs by 15% annually.
  • Involved in the migration of 3M+ sensitive customer records to Azure, deploying data masking in Databricks, reducing potential data breach exposure by 90% and increasing data trust.

Education

Master’s - Computer Science

University Of Dayton
Dayton, OH
01.2023

Skills

  • Cloud Platforms: Azure (Data Factory, Databricks, Synapse, ADLS), GCP (BigQuery, GCS), AWS
  • Databases: Snowflake, PostgreSQL, Oracle, DB2, Delta Lake, Hadoop, Redshift
  • Languages: Python (Pandas, PySpark), SQL, Spark SQL
  • Data Tools: Apache Kafka, Airflow, Azure DevOps, Docker, CI/CD (GitHub Actions), Jira, Unity catalog, Databricks
  • Data Modeling: Medallion Architecture, SCD2, CDC, OLAP/OLTP, Data Lake/Warehouse, Data warehouse
  • Analytics: Power BI, Spark Structured Streaming, IoT Data Processing, KPIs, ETL development,Data warehousing

Certification

  • Databricks certified professional Data Engineer | 2024
  • Microsoft Certified Azure | 2024
  • Apache Pyspark certified from Udemy | 2024
  • PCEP Certified Python Programmer | 2023
  • Certified in Data Engineering foundations from Linkedin | 2023

Accomplishments

  • Improved data retrieval speeds by 40% by optimizing PySpark jobs (parallelism tuning, broadcast joins) and SQL queries (indexing, query plan analysis) for enterprise logistics datasets, supporting faster reporting for 500+ daily users.
  • Increased logistics ETA accuracy by 15% by integrating weather and traffic API data.
  • Enhanced query performance by 25% by implementing clustering strategies.

Timeline

Data Engineer

Quad Light Corp
04.2022 - Current

Data Engineer

EPT IT SOLUTIONS
09.2019 - 08.2021

Master’s - Computer Science

University Of Dayton
Leela Nimmagadda