Summary
Overview
Work History
Education
Skills
Certification
Projects
Websites
Timeline
Generic
HRITIK RAKESH SINGH

HRITIK RAKESH SINGH

Boston,MA

Summary

Data Engineer with 3+ years of experience building scalable ETL pipelines across AWS and Azure environments. Proven track record of optimizing large-scale datasets (50M+ records/month) and improving pipeline performance, data reliability, and query efficiency. Experienced in enterprise and fast-paced startup environments, collaborating cross-functionally to deliver analytics-ready data solutions.

Overview

7
7
years of professional experience
1
1
Certification

Work History

Data Engineer Intern

GymIn Inc
09.2025 - 12.2025
  • Architected real-time streaming pipelines leveraging AWS Lambda, Amazon Kinesis, and Amazon S3, processing 1M+ high-frequency events per week with less than 5-second latency, enabling operational analytics and scalable cloud-based telemetry processing.
  • Developed ETL workflows using AWS Glue and Python to transform raw event data into analytics-ready datasets, increasing data accuracy by 25% and enhancing reliability while minimizing manual intervention across reporting environments.
  • Optimized storage architecture and query performance in Amazon Redshift through schema redesign and indexing strategies, reducing dashboard execution time by 30% and enhancing visibility into equipment usage trends for partner gym owners.
  • Established monitoring and validation frameworks with AWS CloudWatch and SQL-based data quality checks, reducing inconsistencies by 20% and ensuring reliable insights for business and external stakeholders.
  • Delaware, USA

Data Engineer

LTIMindtree
06.2021 - 12.2023
  • Engineered enterprise-scale ETL pipelines in Azure Data Factory, processing 50M+ records monthly while reducing runtime by 40%, supporting analytics workloads across 3 business units within a cloud-modernized architecture.
  • Led migration of on-premise SQL Server workloads to Azure SQL and Azure Data Lake, improving query performance by 30% and enabling scalable, cost-efficient cloud-native data storage and transformation frameworks supporting enterprise reporting and analytics.
  • Delivered 10+ executive dashboards in Power BI utilizing advanced SQL and DAX, driving KPI visibility, operational monitoring, and crossfunctional performance insights for senior leadership teams across global business operations.
  • Optimized ServiceNow incident workflows and delivered L2/L3 support for .NET and Java systems, achieving 99.9% uptime and accelerating issue resolution by 30% while collaborating with users across 7 global business units.
  • Client: Chevron (Oil & Gas, USA)

Data Analyst Intern

Bhaktivedanta Hospital & Research Institute
12.2018 - 01.2019
  • Analyzed structured healthcare datasets (clinical, operational reporting) using advanced SQL and Tableau, enhancing stakeholder satisfaction by 20% through performance-optimized queries and improved dashboards for inpatient and outpatient cancer care monitoring.
  • Transformed structured healthcare datasets into standardized reporting models using SQL, enhancing data integrity and supporting reliable analytics for inpatient and outpatient clinical operations.
  • Streamlined internal reporting workflows by redesigning database logic and automating analytics processes, reducing manual data preparation time by 15% and improving data accuracy and cross-departmental accessibility.
  • Mumbai, India

Education

Master of Science - Computer Software Engineering

Northeastern University
Boston, MA
04-2026

Bachelor of Engineering - Information Technology

University of Mumbai
Mumbai, India
05-2021

Skills

  • Python
  • SQL
  • T-SQL
  • MSSQL
  • MySQL
  • ETL/ELT Pipelines
  • Data Ingestion
  • Data Transformation
  • Data Cleansing
  • Data Validation
  • Data Quality Monitoring
  • Pandas
  • Numpy
  • PySpark
  • Apache Airflow
  • Azure Data Factory
  • AWS
  • Lambda
  • Kinesis
  • S3
  • Glue
  • Redshift
  • Snowflake
  • Databricks
  • Lakehouse Architecture
  • Dimensional Modeling
  • SCD Type 2
  • Parquet
  • Avro
  • Dbt
  • Data Warehousing
  • Data Reporting
  • Data Standardization
  • Python Scripting
  • CloudWatch
  • Azure
  • Azure Functions
  • Blob Storage
  • Azure Monitor
  • AWS Step Functions
  • Hadoop
  • RDBMS
  • CI/CD Pipelines
  • Azure DevOps
  • Git
  • Docker
  • Power BI
  • Tableau
  • Alteryx
  • Talend
  • MS Excel
  • Database Management and Design
  • Data Science
  • Machine Learning
  • Predictive Modeling
  • Feature Engineering
  • Model Evaluation & Tuning
  • Data Transformation
  • Data Reporting

Certification

  • SnowPro Associate - Snowflake Platform Certification
  • Google Data Analytics Professional Certificate
  • Databricks Fundamentals Accreditation - Databricks Academy

Projects

  • Vehicle Collision Analysis, Github, Alteryx, ADF, Parquet, Talend, Integrated 3M+ collision records into a centralized warehouse using Azure Data Factory and Talend, implementing Parquet optimization to enhance processing efficiency, reduce storage footprint, and support scalable downstream analytics workloads., Designed scalable dimensional models with SCD Type 2 and delivered KPI-driven dashboards in Power BI and Tableau, enabling historical trend analysis, interactive reporting, and data-driven decision support for stakeholders.
  • Fraud Detection Using ML, Github, Regression, SVC, Python, Developed classification models using Logistic Regression, Random Forest, and XGBoost, analyzing imbalanced financial datasets to strengthen fraud detection performance and improve rare-event prediction accuracy., Achieved 99.60% Accuracy, 1.00 Recall (Fraud), and 0.86 F1-Score with XGBoost, minimizing false positives to 3 cases through precision-driven model evaluation and comparative algorithm benchmarking.

Timeline

Data Engineer Intern

GymIn Inc
09.2025 - 12.2025

Data Engineer

LTIMindtree
06.2021 - 12.2023

Data Analyst Intern

Bhaktivedanta Hospital & Research Institute
12.2018 - 01.2019

Master of Science - Computer Software Engineering

Northeastern University

Bachelor of Engineering - Information Technology

University of Mumbai
HRITIK RAKESH SINGH