Summary
Overview
Work History
Education
Skills
Websites
Timeline
Generic

Siva Krishna Kandula

Irvine,CA

Summary

Senior Data Engineer with 8+ years of experience designing and implementing scalable data pipelines, real-time streaming systems, and data lake architectures. Expert in cloud-native solutions using AWS (S3, Glue, Lake Formation, Lambda, Kinesis), Apache Spark, and PostgreSQL. Strong programming background in Python, with working knowledge of Golang and Terraform. Adept at optimizing data flows, supporting ML training pipelines, and enforcing data quality, security, and governance. Highly collaborative and agile-focused professional with a track record of delivering robust, high-performance data systems.

Overview

10
10
years of professional experience

Work History

Senior Data Engineer

Edward Jones
12.2023 - Current
  • Built scalable ETL pipelines using AWS Glue, Lambda, and S3 to handle high-volume batch and real-time data.
  • Designed and implemented real-time streaming pipelines with Kafka and AWS Kinesis for mission-critical data ingestion.
  • Developed and managed AWS Lake Formation-based data lakes, ensuring secure and efficient data access.
  • Tuned PostgreSQL queries and applied partitioning to improve query performance and support analytics scale.
  • Partnered with data scientists to version datasets and support ML model training and evaluation pipelines.
  • Automated infrastructure provisioning using Terraform, improving deployment consistency and reducing setup time.
  • Implemented data quality validation and monitoring frameworks using Python and AWS-native tools.
  • Led Agile sprint planning and code reviews, mentoring junior engineers on best practices and cloud data architecture.

Data Engineer

Infotrack
04.2015 - 04.2023
  • Designed and deployed ETL pipelines in Python and SQL for processing structured and semi-structured datasets.
  • Led the migration of legacy data platforms to AWS S3 and Redshift, increasing scalability and reducing operational overhead.
  • Built Kafka-driven ingestion pipelines to support real-time data acquisition across internal applications.
  • Optimized PostgreSQL schemas and SQL queries, reducing report generation time by 60%.
  • Supported Tableau dashboards by delivering reusable, documented datasets with business logic embedded.
  • Ensured compliance with data governance standards, developing catalogs, lineage diagrams, and access policies.
  • Enabled machine learning initiatives by preparing clean, validated datasets aligned with model requirements.
  • Collaborated across teams to define data workflows, deployment standards, and documentation practices.

Education

Master of Science - Data Analytics

McDaniel College
Westminster, MD
05.2025

Skills

  • Python
  • SQL
  • Git
  • Terraform
  • AWS (S3, Glue, Lake Formation, Lambda, Kinesis, Redshift)
  • Apache Spark
  • Kafka
  • ETL Development
  • Data Lake Architecture
  • Real-Time Data Streaming
  • Data Modeling
  • PostgreSQL
  • DynamoDB
  • Linux Administration
  • AWS Glue Catalog
  • Data Validation
  • Tableau
  • Agile

Timeline

Senior Data Engineer

Edward Jones
12.2023 - Current

Data Engineer

Infotrack
04.2015 - 04.2023

Master of Science - Data Analytics

McDaniel College