Summary
Overview
Work History
Education
Skills
Websites
Certification
Projects
Internship
Timeline
Generic

SAI CHARAN REDDY PEDDIREDDY

Summary

Skilled Data Engineer with over 3 years of experience in designing, developing, and optimizing data warehousing, ETL/ELT pipelines, and business intelligence solutions. Proficient in Azure Data Factory, Snowflake, SQL Server, and Power BI, with a strong ability to streamline data workflows and enhance performance. Passionate about leveraging cloud technologies, automation, and advanced analytics to drive data-driven decision-making and operational excellence.

Overview

4
4
years of professional experience
1
1
Certification

Work History

Graduate Assistant

Southern Arkansas University
01.2024 - 04.2024
  • Assisting in teaching database-related courses (e.g., SQL, NoSQL, database design)
  • Conducting lab sessions on MySQL, MongoDB, or Oracle
  • Grading assignments, quizzes, and projects related to database queries, normalization, indexing, etc
  • Holding office hours to assist students with database concepts
  • Assisting faculty in research projects involving big data, database optimization, and cloud databases
  • Working on database security, indexing strategies, and performance tuning
  • Writing scripts for ETL (Extract, Transform, Load) processes

Data Engineer

Yantra Tech Innovation Lab Pvt. Ltd.
07.2022 - 08.2023
  • Company Overview: Client: Maravai Life Sciences
  • Extensively worked in the ETL process, including data transformation, mapping, conversion, and loading, ensuring data integrity and accuracy using Azure Data Factory
  • Created robust Fivetran pipelines to efficiently load data from various sources into Snowflake stage tables, enhancing data accessibility and usability
  • Migrated complex R scripts to optimized Snowflake stored procedures using SnowSQL, improving performance and maintainability
  • Conducted seamless data migration from on-premises databases to Snowflake, ensuring minimal downtime and data loss
  • Assisted in the deployment of critical objects to QA and Production environments, ensuring smooth transitions and minimal disruptions
  • Provided timely support for production issues, resolving data-related problems, and ensuring system reliability and stability.
  • Collaborated with cross-functional teams to understand data requirements and deliver customized solutions
  • Implemented data quality checks and validation processes to maintain high standards of data accuracy
  • Conducted performance tuning and optimization of Snowflake queries and procedures to enhance system efficiency.
  • Performed thorough data validation to ensure the accuracy, consistency, and completeness of data throughout the ETL process
  • Collaborated with teams to develop automated Power BI Workforce Planning reports, streamlining processes, reducing manual effort, and utilizing advanced DAX formulas to deliver precise and insightful analytics
  • Client: Maravai Life Sciences

SQL ADF Developer

Terex
07.2021 - 06.2022
  • Utilized Databricks to develop and execute data extraction and transformation workflows using Python, optimizing data processing pipelines
  • Orchestrated Databricks notebook execution within Azure Data Factory to automate and manage data workflows, enhancing data integration and ETL processes
  • Designed and maintained end-to-end data pipelines in Azure Data Factory, ensuring seamless data ingestion from APIs and transformation in Databricks
  • Developed and executed complex data transformation workflows using Python within the Databricks environment, enhancing data quality and processing efficiency
  • Designed and implemented ETL pipelines in Databricks, which automated data ingestion and transformation processes, significantly reducing manual effort and improving data accuracy
  • Implemented monitoring and error-handling mechanisms to ensure reliable data processing and timely issue resolution across Databricks and Azure Data Factory environments

SQL ADF Developer

Infosys
07.2021 - 06.2022
  • Company Overview: Client: Schlumberger
  • Responsible for developing and maintaining end-to-end operations of ETL data pipelines using Azure Data Factory
  • Researched and implemented various components pipelines, activities, mapping data flows, data sets, linked services, triggers, and control flows
  • Automated the pipelines using schedule triggers to run daily once
  • Build Data Flows to transform the data according to Business Needs
  • Constructed Fact and Dimension Tables, Joins, views, and Complex Stored Procedures on Dedicated SQL pools (ADW) using SQL Server and T-SQL
  • Created Complex stored procedures using Joins, Common Table Expression (CTE) and Temporary Tables using Complex T-SQL queries
  • Implemented Email notification feature using Azure Logic-Apps to get pipeline execution details
  • Build CICD Pipeline on Azure DevOps for Code Migration for different Environments (DEV, UAT, and PROD)
  • Providing support for production Power BI dashboards and troubleshooting issues
  • Back tracking and testing the Data issues from Power BI Reports to ADW table (Fact, Dimension) and Stage tables (source data)
  • Client: Schlumberger

Trainee-Machine Learning

Indian Servers
04.2020 - 07.2020
  • During the period of my internship programs, I have been exposed to the basics of Python, statistical Machine Learning Algorithms (Supervised and Unsupervised) and OpenCV

Education

Masters - Master's in Computer And Information Science

Southern Arkansas University
Arkansas City, AR
12.2024

Skills

Cloud Services: Snowflake, Azure (Azure Data Factory, Databricks, Storage accounts, Azure Data Warehouse, Azure Data Lake Gen2, Azure SQL Server, Azure Analysis Services, Azure Logic App), SSIS, SSRS, NetSuite ERP

Databases: MS SQL Server, Oracle, MySQL

ETL Tool: Azure Data Factory, Databricks, Fivetran

Business Intelligence and Analytics Tools: Advanced Excel, Power BI, Tableau, IBM Cognos

Languages and IDE: Python, SQL, R, C, Java, Google Colab, Spyder, PyCharm, Jupyter Lab, Visual Studio Code, SSMS, RStudio

Version Control: Git, GitHub, GitLab, Azure DevOps

Certification

  • Microsoft Certified Azure Fundamentals
  • Python for Beginners – Udemy
  • Certificate in participation of Building Chat-Bot contest by Indian Servers

Projects

Heart Disease Prediction, With the help of this project, we can calculate the accuracy of predicting heart diseases at its earliest stages it is build using machine learning algorithms like decision tree, and support vector machine (SVM). 

Twitter Sentiment Analysis, This project will determine the sentiment of a text, comment, or tweet, i.e., whether the written text is positive, negative, or neutral, using machine learning algorithms. Face Detection, We can detect the presence of people's faces in digital images using this project, which I created in Python using the OpenCV library.

Internship

Trainee-Machine Learning, Indian Servers, 04/01/20, 07/31/20, 

During the period of my internship programs, I have been exposed to the basics of Python, statistical Machine Learning Algorithms (Supervised and Unsupervised) and OpenCV.

Timeline

Graduate Assistant

Southern Arkansas University
01.2024 - 04.2024

Data Engineer

Yantra Tech Innovation Lab Pvt. Ltd.
07.2022 - 08.2023

SQL ADF Developer

Terex
07.2021 - 06.2022

SQL ADF Developer

Infosys
07.2021 - 06.2022

Trainee-Machine Learning

Indian Servers
04.2020 - 07.2020

Masters - Master's in Computer And Information Science

Southern Arkansas University
SAI CHARAN REDDY PEDDIREDDY