Summary
Overview
Work History
Education
Skills
Languages
Timeline
Generic

Jaya Teja Challagundla

Overland Park

Summary

Master’s graduate in Computer Science with a strong academic foundation in data engineering, big data technologies, and cloud platforms. Skilled in designing and implementing ETL pipelines, building real-time and batch data workflows, and managing data lakes using AWS (Glue, Lambda, EMR, S3, Athena) and Apache Spark. Proficient in Python, SQL, and Azure Data Factory, with experience developing data models, ensuring data quality, and applying data governance best practices. Completed multiple academic and internship projects involving real-time streaming data pipelines, AWS-based data lake architecture, and large-scale batch processing. Adept at creating interactive dashboards with Power BI and Tableau to drive data-driven decisions. Seeking an entry-level Data Engineer role to leverage technical expertise, problem-solving skills, and a passion for building scalable, high-performance data solutions.

Overview

1
1
year of professional experience

Work History

Data Engineering & Analytics Intern

ExcelR
01.2023 - 04.2023
  • Designed and implemented data ingestion pipelines from multiple sources including APIs, CSV/Parquet files, and AWS S3, leveraging AWS Glue for transformation workflows.
  • Built data models in AWS Athena and Azure Data Factory for integration with downstream BI tools.
  • Developed Python validation scripts to automate data integrity checks, schema matching, and anomaly detection prior to loading into analytics environments.
  • Configured Role-Based Access Control (RBAC) in Tableau Server and IAM policies in AWS to ensure secure, compliant access to datasets.
  • Created Power BI dashboards with DAX measures and Row-Level Security (RLS) to deliver personalized insights for different business units.
  • Implemented incremental ETL processes in Airflow, reducing daily processing time by over 35%.
  • Assisted in setting up CI/CD workflows using GitHub Actions to automate deployment of data pipeline code.
  • Conducted cross-table reconciliation using SQL to ensure consistency between staging and production datasets.

Key Achievements:

  • Reduced dashboard refresh latency from 30 minutes to under 5 minutes by optimizing ETL and data modeling.
  • Improved data governance by standardizing naming conventions and access policies across multiple AWS services.

Software Engineer

ValueLabs
12.2021 - 03.2022
  • Assisted in designing and implementing ETL pipelines to extract, transform, and load structured and semi-structured data from multiple sources into a centralized database.
  • Worked with Apache Spark and PySpark for large-scale batch data processing and optimization of transformation jobs.
  • Contributed to data quality checks by implementing validation scripts in Python and SQL, ensuring accuracy before loading into analytics systems.
  • Automated data ingestion workflows using Apache Airflow, improving pipeline reliability and reducing manual intervention.
  • Created data profiling reports to assist senior engineers in identifying anomalies and inconsistencies.
  • Participated in building proof-of-concept solutions for real-time data streaming using Kafka and AWS Kinesis.
  • Collaborated with cross-functional teams, including business analysts and QA engineers, to translate business requirements into technical data solutions.
  • Documented workflows, transformation logic, and deployment steps, enabling smooth handover to production teams.

Key Achievements:

  • Reduced ETL job runtime by 20% through query optimization and data partitioning strategies.
  • Successfully contributed to the deployment of a streaming data proof-of-concept that was later adopted in a production use case.

Education

Computer Science

University of Central Missouri
Warrensburg, MO
05-2025

Bachelor of Technology - Electronics And Communication Engineering

KL University
04-2022

Skills

    Programming Languages

    Python, SQL

    Clouds and Data Platform

    AWS(Glue, Lambda, EMR, S3, IAM, Athena, Step Functions, API Gateway), Azure

    Big Data Technologies

    Apache Spark (Core, SQL, DataFrame, MLlib), Hadoop (HDFS, MapReduce, Pig, Hive, HBase, YARN)

    ETL & Data Integration

    AWS Glue, AWS Step Functions, API Gateway, Airflow, Azure Data Factory, SQL

    Business Intelligence & Reporting

    Power BI (DAX, RLS, Calculated Columns, KPIs), Tableau (LOD, Row-Level Security, Calculated Fields, Parameters)

    Data Modeling & Governance

    IAM Policies, Role-Based Access Control (RBAC), AWS Lake Formation, Tableau Server Security

    CI/CD & Version Control

    Git, GitHub Actions, Jenkins, Git Tags

    Data Quality & Validation

    SQL Data Integrity Checks, Python Validation Scripts, Cross-table Reconciliation

Languages

English
Full Professional
Hindi
Native or Bilingual
Telugu
Native or Bilingual

Timeline

Data Engineering & Analytics Intern

ExcelR
01.2023 - 04.2023

Software Engineer

ValueLabs
12.2021 - 03.2022

Computer Science

University of Central Missouri

Bachelor of Technology - Electronics And Communication Engineering

KL University