Summary

Overview

Work History

Education

Skills

Languages

Timeline

Jaya Teja Challagundla

Overland Park

Summary

Master’s graduate in Computer Science with a strong academic foundation in data engineering, big data technologies, and cloud platforms. Skilled in designing and implementing ETL pipelines, building real-time and batch data workflows, and managing data lakes using AWS (Glue, Lambda, EMR, S3, Athena) and Apache Spark. Proficient in Python, SQL, and Azure Data Factory, with experience developing data models, ensuring data quality, and applying data governance best practices. Completed multiple academic and internship projects involving real-time streaming data pipelines, AWS-based data lake architecture, and large-scale batch processing. Adept at creating interactive dashboards with Power BI and Tableau to drive data-driven decisions. Seeking an entry-level Data Engineer role to leverage technical expertise, problem-solving skills, and a passion for building scalable, high-performance data solutions.

Overview

year of professional experience

Work History

Data Engineering & Analytics Intern

ExcelR

01.2023 - 04.2023

Designed and implemented data ingestion pipelines from multiple sources including APIs, CSV/Parquet files, and AWS S3, leveraging AWS Glue for transformation workflows.
Built data models in AWS Athena and Azure Data Factory for integration with downstream BI tools.
Developed Python validation scripts to automate data integrity checks, schema matching, and anomaly detection prior to loading into analytics environments.
Configured Role-Based Access Control (RBAC) in Tableau Server and IAM policies in AWS to ensure secure, compliant access to datasets.
Created Power BI dashboards with DAX measures and Row-Level Security (RLS) to deliver personalized insights for different business units.
Implemented incremental ETL processes in Airflow, reducing daily processing time by over 35%.
Assisted in setting up CI/CD workflows using GitHub Actions to automate deployment of data pipeline code.
Conducted cross-table reconciliation using SQL to ensure consistency between staging and production datasets.

Key Achievements:

Reduced dashboard refresh latency from 30 minutes to under 5 minutes by optimizing ETL and data modeling.
Improved data governance by standardizing naming conventions and access policies across multiple AWS services.

Software Engineer

ValueLabs

12.2021 - 03.2022

Assisted in designing and implementing ETL pipelines to extract, transform, and load structured and semi-structured data from multiple sources into a centralized database.
Worked with Apache Spark and PySpark for large-scale batch data processing and optimization of transformation jobs.
Contributed to data quality checks by implementing validation scripts in Python and SQL, ensuring accuracy before loading into analytics systems.
Automated data ingestion workflows using Apache Airflow, improving pipeline reliability and reducing manual intervention.
Created data profiling reports to assist senior engineers in identifying anomalies and inconsistencies.
Participated in building proof-of-concept solutions for real-time data streaming using Kafka and AWS Kinesis.
Collaborated with cross-functional teams, including business analysts and QA engineers, to translate business requirements into technical data solutions.
Documented workflows, transformation logic, and deployment steps, enabling smooth handover to production teams.

Key Achievements:

Reduced ETL job runtime by 20% through query optimization and data partitioning strategies.
Successfully contributed to the deployment of a streaming data proof-of-concept that was later adopted in a production use case.

Education

Computer Science

University of Central Missouri

Warrensburg, MO

05-2025

Bachelor of Technology - Electronics And Communication Engineering

KL University

04-2022

Skills

Programming Languages

Python, SQL

Clouds and Data Platform

AWS(Glue, Lambda, EMR, S3, IAM, Athena, Step Functions, API Gateway), Azure

Big Data Technologies

Apache Spark (Core, SQL, DataFrame, MLlib), Hadoop (HDFS, MapReduce, Pig, Hive, HBase, YARN)

ETL & Data Integration

AWS Glue, AWS Step Functions, API Gateway, Airflow, Azure Data Factory, SQL

Business Intelligence & Reporting

Power BI (DAX, RLS, Calculated Columns, KPIs), Tableau (LOD, Row-Level Security, Calculated Fields, Parameters)

Data Modeling & Governance

IAM Policies, Role-Based Access Control (RBAC), AWS Lake Formation, Tableau Server Security

CI/CD & Version Control

Git, GitHub Actions, Jenkins, Git Tags

Data Quality & Validation

SQL Data Integrity Checks, Python Validation Scripts, Cross-table Reconciliation

Languages

English

Full Professional

Hindi

Native or Bilingual

Telugu

Native or Bilingual

Timeline

Data Engineering & Analytics Intern

ExcelR

01.2023 - 04.2023

Software Engineer

ValueLabs

12.2021 - 03.2022

Computer Science

University of Central Missouri

Bachelor of Technology - Electronics And Communication Engineering

KL University

Jaya Teja Challagundla

Summary

Overview

Work History

Data Engineering & Analytics Intern

Software Engineer

Education

Computer Science

Bachelor of Technology - Electronics And Communication Engineering

Skills

Languages

Timeline

Data Engineering & Analytics Intern

Software Engineer

Computer Science

Bachelor of Technology - Electronics And Communication Engineering

Similar Profiles

Naveen VemulaNaveen Vemula

Shana Sherin M KShana Sherin M K

Raghavi BandaruRaghavi Bandaru

Amutha NAmutha N

Anthony DiColaAnthony DiCola