Summary
Overview
Work History
Education
Skills
Certification
Projects
Timeline
Generic

SWEETY RACHUPALLECHINNABOREDDY

Summary

Results-driven Cloud Data Engineer with extensive experience at Wells Fargo, specializing in designing ETL pipelines and optimizing SQL queries, achieving a 40% improvement in analytics performance. Proficient in Python and AWS, I excel in mentoring teams and automating CI/CD processes, enhancing deployment efficiency and fostering collaboration.

Overview

5
5
years of professional experience
1
1
Certification

Work History

Cloud Data Engineer

Wells Fargo
12.2023 - Current
  • Designed and deployed real-time and batch ETL pipelines on Databricks (PySpark, SparkSQL) and AWS Glue, processing 5+ TB/day of financial data for analytics and compliance.
  • Built REST APIs with Flask/FastAPI integrated with AWS Lambda and API Gateway, enabling 10+ microservices to access standardized datasets.
  • Developed Generative AI / LLM pipelines using AWS Bedrock, Pinecone, OpenSearch, reducing compliance reporting time by 30%.
  • Automated CI/CD pipelines with GitHub Actions, Terraform, Docker, Kubernetes, reducing deployment time from 3 days to under 6 hours.
  • Optimized Redshift schemas and SQL queries, improving analytics query runtime by 40%.
  • Implemented event-driven pipelines using SNS/SQS and Lambda triggers, achieving <5 min latency for critical monitoring workflows.

Data Engineer

Accenture Pvt Ltd
09.2020 - 07.2023
  • Built ETL pipelines on Databricks and AWS Glue, processing 5+ TB/month of structured/unstructured data (Parquet, JSON, Avro, CSV) for analytics dashboards.
  • Implemented real-time streaming pipelines using Kafka and Spark Streaming to monitor 100k+ transactions/hour, reducing anomaly detection time by 25%.
  • Designed and optimized SQL/PLSQL and HiveQL queries, improving dashboard load times by 35%.
  • Developed Airflow DAGs for pipeline orchestration with retries, SLA monitoring, and dynamic scheduling, achieving 99.8% uptime.
  • Automated infrastructure provisioning with Terraform and containerized workloads with Docker, cutting environment setup time by 40%.
  • Integrated third-party APIs for data enrichment, improving analytics accuracy by 20%.
  • Partnered with BI teams to deliver dashboards in Tableau and Power BI, increasing executive visibility into KPIs by 30%.
  • Conducted code reviews, PySpark performance tuning, and mentored junior engineers, improving team productivity by 50%.

Education

Master's - Data Science

University At Albany, SUNY Albany
Albany, NY

Skills

  • Python and Java
  • SQL and SparkSQL
  • Unix and Bash
  • Databricks and PySpark
  • Hadoop and Hive
  • Kafka and AWS Glue
  • Airflow and Step Functions
  • Lambda and Cloud Functions
  • AWS and GCP
  • REST APIs and Flask
  • FastAPI and JSON
  • XML and Git
  • GitHub Actions and Jenkins
  • Terraform and Docker
  • Kubernetes and AWS Bedrock
  • LangChain and RAG pipelines
  • Pinecone and TensorFlow
  • Scikit-learn and Vertex AI
  • Tableau, Power BI, and AWS QuickSight
  • Matplotlib and Seaborn

Certification

  • Google Data Analytics Certificate from Coursera
  • AWS Certified Data Engineer - Associate

Projects

Scalable Data Pipeline using AWS (AWS S3, AWS Glue, AWS Lambda, AWS Redshift, Python, SQL)

  • Built an ETL pipeline using AWS Glue and Lambda to process streaming sensor data and stored raw data in Amazon S3 and used AWS Glue for data transformation.
  • Queried the data for insights using Athena and SQL and loaded transformed data into Amazon Redshift for further business intelligence reporting.

Data Analysis on E-commerce Sales (Pandas, NumPy, Matplotlib, Seaborn, SQL, AWS Athena)

  • Processed and cleaned 500,000+ transaction records from an e-commerce dataset.
  • Used AWS Athena to run SQL queries on structured data stored in S3, identified top-selling products, seasonal trends, and customer purchasing behavior using Pandas and SQL queries.
  • Built interactive visualizations in Matplotlib and Seaborn to present sales trends and customer segmentation and recommended business strategies based on customer retention analysis and product demand forecasting.

Timeline

Cloud Data Engineer

Wells Fargo
12.2023 - Current

Data Engineer

Accenture Pvt Ltd
09.2020 - 07.2023

Master's - Data Science

University At Albany, SUNY Albany
SWEETY RACHUPALLECHINNABOREDDY