Summary
Overview
Work History
Education
Skills
Certification
Timeline
Generic

Suryatej Jakka

Naperville,IL

Summary

Data Engineer with over 5 years of experience building scalable ETL/ELT pipelines, real-time streaming solutions, and cloud-native data platforms across AWS, Azure, and GCP. Proficient in Spark, PySpark, SQL, and Snowflake, with expertise in data modeling, pipeline automation, and cost optimization. Proven track record of delivering secure, analytics-ready datasets that drive business outcomes.

Overview

6
6
years of professional experience
1
1
Certification

Work History

Data Engineer

Wayfair
07.2024 - Current
  • Built pipelines processing over 50 million daily events using Dataflow and Pub/Sub, enabling less than 5 minutes of latency analytics.
  • Optimized BigQuery queries (partitioning, clustering), improving performance by 40%, and cutting costs by 35%.
  • Integrated Great Expectations with Airflow, preventing 95% of bad data propagation.
  • Enhanced data quality by performing thorough cleaning, validation, and transformation tasks.

Data Engineer

HDFC
05.2021 - 07.2023
  • Designed ETL pipelines in Azure Data Factory, processing over 10 TB of monthly financial data.
  • Implemented Delta Lake for ACID compliance, cutting manual corrections by 70%.
  • Built Power BI dashboards from Synapse datasets, enabling real-time AML monitoring.
  • Collaborated on ETL (Extract, Transform, Load) tasks, maintaining data integrity and verifying pipeline stability.

Data Engineer

Verizon
06.2019 - 05.2021
  • Developed ETL pipelines with AWS Glue and PySpark, reducing storage costs by 25%.
  • Designed Kinesis streaming pipelines, cutting outage detection time by 60%.
  • Migrated on-premises Oracle jobs to AWS Glue, saving $150K per year.
  • Documented and communicated database schemas using accepted notations.

Education

Master of Science - Technology Management

Lindsey Wilson College
Bowling Green, KY

Skills

  • Data pipeline development
  • BigQuery optimization
  • ETL processes
  • Data quality integration
  • Cloud data engineering
  • Data modeling
  • SQL and Python
  • Cloud data platforms (AWS, Azure, GCP)
  • Data modeling (star, snowflake, delta lake)
  • ETL/ELT processes and workflow automation
  • Data warehousing
  • NoSQL databases
  • Teamwork and collaboration
  • Relational databases

Certification

AWS certified cloud practitioner

Timeline

Data Engineer

Wayfair
07.2024 - Current

Data Engineer

HDFC
05.2021 - 07.2023

Data Engineer

Verizon
06.2019 - 05.2021

Master of Science - Technology Management

Lindsey Wilson College