Summary
Overview
Work History
Education
Skills
Websites
Certification
Timeline
Generic

SHIREESHA GUJJA

Madison,USA

Summary

  • I am a results-oriented Data Engineer and Analyst with over 4 years of experience in building and optimizing ETL workflows, data pipelines, and integration frameworks across healthcare, finance, and business domains. My technical expertise includes SQL, Python, and R, combined with strong skills in data modeling, schema design, and relational databases such as MySQL, PostgreSQL, and SQL Server. I also bring hands-on exposure to cloud data platforms including AWS (S3, Glue, Redshift), Azure (Data Factory, Synapse), and Google Cloud (BigQuery, Dataflow), along with familiarity in big data tools like Apache Spark, Hadoop, and Kafka for large-scale data processing.

    In addition to engineering workflows, I have extensive experience in data analysis, exploratory data analysis (EDA), predictive modeling, and statistical techniques such as regression analysis, hypothesis testing, and A/B testing. I have successfully delivered projects in fraud detection, forecasting, and workflow optimization, leveraging machine learning and advanced analytics to provide actionable business insights. My proficiency in Power BI, Tableau, and Excel enables me to transform raw datasets into interactive dashboards and clear visualizations for stakeholders.

    I am also skilled in data validation frameworks, governance policies, automation, and error-handling mechanisms, ensuring the accuracy, compliance, and reliability of enterprise data assets. With a collaborative background in Agile environments, I excel at translating complex business requirements into scalable technical solutions that enhance pipeline performance and support advanced reporting and analytics. Holding a Master’s in Business Analytics from the University of Alabama, Huntsville, I bring a unique blend of engineering rigor, analytical depth, and business acumen to deliver data-driven solutions that bridge the gap between raw data and strategic decision-making.

Overview

3
3
years of professional experience
1
1
Certification

Work History

Data Analyst

R1RCM India
12.2021 - 12.2022
  • Designed and implemented ETL workflows using SQL and Python to extract, transform, and load healthcare datasets from multiple source systems.
  • Automated data ingestion pipelines for structured and unstructured data, improving consistency and reducing manual intervention.
  • Developed and maintained data validation frameworks in Python to detect anomalies, missing values, and duplicates.
  • Partnered with cross-functional teams to improve pipeline performance and ensure timely availability of data for business reporting.
  • Built SQL queries, stored procedures, and reusable scripts to support recurring reporting processes and ad-hoc requests.
  • Implemented logging, monitoring, and error-handling mechanisms for pipeline stability and easier troubleshooting.
  • Collaborated with analysts to design data models and schemas, enhancing data accessibility and supporting self-service analytics.

Data Engineer Intern

Tata Consultancy Services India
07.2019 - 10.2021
  • Supported the design and deployment of data integration workflows for financial and healthcare datasets.
  • Created SQL scripts to extract, transform, and validate large datasets across multiple systems.
  • Assisted in enhancing ETL pipelines, improving data flow efficiency and reliability.
  • Conducted root-cause analysis on data discrepancies and pipeline errors, implementing corrective actions for long-term stability.
  • Collaborated with engineering teams to design ER models and data dictionaries, ensuring data consistency and compliance with business rules.
  • Documented pipeline processes, transformation logic, and validation rules, supporting knowledge transfer and onboarding.
  • Delivered training sessions for team members on data validation automation using SQL and Python.

Education

Master's - Business Analytics

University of Alabama Huntsville
05.2024

Bachelor's - Commerce, Computer Science

Shiva Sivani Degree College, Osmania University
India
05.2019

Skills

Programming & Scripting: Python (Pandas, NumPy, Scikit-learn), R, SQL Data Engineering & Pipelines: ETL Processes, Data Warehousing, Data Integration, Data Modeling, Workflow Automation, API Data Extraction, Airflow (familiar)

Databases & Storage: MySQL, PostgreSQL, SQL Server, NoSQL (MongoDB – exposure), Data Lakes & Warehouses (Snowflake, Redshift, BigQuery – exposure)

Big Data & Processing (exposure): Apache Spark, Hadoop, Kafka

Cloud Platforms: AWS (S3, Glue, Redshift), Azure (Data Factory, Synapse), Google Cloud (BigQuery, Dataflow)

Data Analysis & Statistics: Exploratory Data Analysis (EDA), Regression Analysis, Hypothesis Testing, A/B Testing, Trend & Gap Analysis

Visualization & Reporting: Power BI, Tableau, Excel (PivotTables, VLOOKUP, Macros)

Data Governance & Quality: Data Validation, Data Cleaning, Error Handling, Compliance Standards, Metadata Documentation

Methodologies & Tools: Agile, Git, Jira, Microsoft Visio

Certification

  • Data Privacy Certification – Tata Consultancy Services | 2021
  • Information Security Awareness Certification – Tata Consultancy Services | 2021
  • Agile Way of Methodology Certification – Tata Consultancy Services | 2021

Timeline

Data Analyst

R1RCM India
12.2021 - 12.2022

Data Engineer Intern

Tata Consultancy Services India
07.2019 - 10.2021

Master's - Business Analytics

University of Alabama Huntsville

Bachelor's - Commerce, Computer Science

Shiva Sivani Degree College, Osmania University
SHIREESHA GUJJA