Summary
Overview
Work History
Education
Skills
Certification
PROJECTS
Timeline
Generic

SAI BOTLA

AZ

Summary

Detail-oriented and results-driven Data Engineer with hands-on experience in building scalable ETL pipelines, real-time data workflows, and cloud-native analytics solutions using AWS, Snowflake, and PySpark. Completed a 7-month internship at American Express, contributing to credit risk and payment data processing projects in Agile teams. Strong foundation in data modeling, data quality frameworks, and stakeholder collaboration. Currently pursuing a second master’s in Engineering Management to enhance cross-functional leadership and project execution skills. Adept at combining technical expertise with strategic thinking to drive data-driven business outcomes.

Overview

1
1
year of professional experience
1
1
Certification

Work History

Data Engineering Intern

American Express
Phoenix, AZ
12.2024 - Current
  • Developed scalable ETL pipelines to process payments and credit concern data using AWS Glue and Snowflake.
  • Created PySpark jobs and Airflow DAGs for real-time analytics and fraud detection pipelines.
  • Built and optimized SQL procedures for generating credit risk summaries and financial reports.
  • Implemented data quality checks using Great Expectations to ensure high data accuracy.
  • Worked with Agile teams and product stakeholders to deliver solutions for credit decisioning workflows.

Education

Master of Science - Data Science

University of Maryland, Baltimore County
Baltimore, MD
05-2024

Master of Science - Engineering Management

Trine University
Phoenix, AZ

Skills

Programming Languages: Python, Java, SQL, Shell Scripting

Cloud Platforms: AWS (Glue, Redshift, S3, Lambda, EMR), Snowflake

Data Engineering: Apache Spark, PySpark, Airflow, dbt, ETL/ELT Pipelines

Databases: PostgreSQL, MySQL, MongoDB, DynamoDB, Cassandra

Tools & Visualization: Power BI, Tableau, AWS QuickSight, Git, Docker, Jenkins

Concepts: Data Modeling, Credit Data Processing, Payments Pipelines, Data Quality Frameworks

Management Skills: Agile project management, cross-functional collaboration, sprint planning, stakeholder communication, risk resolution, and process improvement

Certification

  • AWS Certified Data Engineer – Associate (DEA-C01)
  • Snowflake SnowPro Core Certified (COF-C02)

PROJECTS

Data-Driven TED Talk Recommendation PlatformUMBC Capstone Project
Jan 2024 – May 2024

  • Developed a personalized TED Talk recommendation system leveraging NLP and user behavior analytics.
  • Ingested and transformed metadata (titles, tags, transcripts, speaker info) from 3,000+ TED Talks using AWS Glue and PySpark.
  • Built a content-based filtering engine using cosine similarity on TF-IDF vectors to generate tailored talk suggestions.
  • Deployed the backend pipeline on AWS S3, Redshift, and Lambda; built dashboards in AWS QuickSight for engagement analysis.
  • Enabled real-time search and recommendations with 95% accuracy based on user interests and past activity.

Timeline

Data Engineering Intern

American Express
12.2024 - Current

Master of Science - Data Science

University of Maryland, Baltimore County

Master of Science - Engineering Management

Trine University
SAI BOTLA