Summary
Overview
Work History
Education
Skills
Certification
Timeline
background-images

Mokesh Balakrishnan

Massachusetts,MA

Summary

Data Science Engineer with 2+ years of experience in data engineering, analytics, and machine learning, delivering solutions across healthcare and manufacturing domains. Proficient in Python, SQL, PySpark, TensorFlow, and PyTorch, with expertise in predictive modeling, anomaly detection, and real-time analytics. Skilled in building scalable ETL pipelines, developing interactive dashboards in Tableau/Power BI, and deploying ML models on cloud platforms (AWS, Azure, GCP). Adept at optimizing data workflows, ensuring reliability, and driving measurable business outcomes through data-driven innovation.

Overview

3
3
years of professional experience
1
1
Certification

Work History

Data Engineer

UnitedHealth Group
08.2024 - Current
  • Designed and built robust ETL pipelines using PySpark to ingest, process, and stage healthcare records per month from multiple sources (claims, patient data, EHRs), improving data freshness for analytics by ~40%.
  • Implemented data quality monitoring (missing values, schema drift) and alerting, reducing upstream data errors by ~30%.
  • Collaborated with data science teams to productionize ML models (e.g. patient risk scoring, cost prediction) using feature pipelines; managed feature engineering, deployment, and ongoing monitoring.
  • Migrated parts of the legacy on-prem data warehouse to Snowflake / cloud-based warehousing, optimizing storage and query performance; reduced query run times on key dashboards by ~50-60%.
  • Partnered with stakeholders (clinical operations, finance) to define KPIs, dashboard requirements; built dashboards in Tableau / Power BI that enabled non-technical teams to monitor trends and make strategic decisions.

Junior Data Analyst

Meril
01.2022 - 06.2023
  • Analyzed device manufacturing, supply chain, and quality control data by developing ETL workflows and preparing datasets for reporting, reducing reporting lead time from days to hours.
  • Wrote and optimized SQL queries and stored procedures to extract, clean, and transform large datasets, ensuring consistency and accuracy across multiple product lines.
  • Designed and delivered dashboards and reports in Tableau and Power BI, providing operations teams with insights on manufacturing yield, downtime, and defect rates; findings supported process improvements that increased throughput by ~15%.
  • Conducted data validation and cleansing to ensure accuracy of reports delivered to stakeholders.

Research Assistant

SRM Institute of Science and Technology
05.2020 - 12.2021
  • Led data preprocessing, feature selection, and model tuning, achieving a 92.99% prediction accuracy for a financial forecasting model.
  • Contributed to an IEEE-published research project involving the Hybrid Cat Boost Correlation (HCBC) algorithm for stock prediction.
  • Collaborated with faculty on algorithm validation, result interpretation, and publication efforts.

Education

Masters of Science - Data Science

University of Massachusetts Dartmouth
North Dartmouth, MA
08.2023

Bachelor of Technology - Computer Science and Engineering

SRM Institute Of Science And Technology
CHENNAI INDIA
05.2023

Skills

  • Database management: MySQL, PostgreSQL, Oracle, MongoDB, Firebase
  • ML and AI Frameworks: PyTorch, TensorFlow, Keras, Scikit-Learn, XGBoost, LightGBM, CatBoost, Hugging Face Transformers, OpenCV, NLTK, SpaCy, LangChain, Haystack, Ray, FastAPI, Flask, PySpark, Dask, ONNX, TorchServe, TensorFlow Serving
  • Data Engineering and MLOps: Apache Spark, Apache Airflow, Apache Kafka, Snowflake, Hadoop, MLflow, Docker, Kubernetes, AWS EKS, Jenkins, Git, GitHub, ETL Pipelines, Redshift
  • Visualization and Monitoring: Power BI, Tableau, Excel, Grafana, Prometheus, ELK Stack, Google Analytics, Hive, Alteryx, Bias & Fairness Monitoring
  • Cloud platform expertise: AWS, Azure, GCP
  • Proficient in Python, R, Go, C/C, and MATLAB

Certification

  • AWS Certified Cloud Practitioner – Amazon Web Services (AWS)
  • Certified Data Analyst: Coursera

Timeline

Data Engineer

UnitedHealth Group
08.2024 - Current

Junior Data Analyst

Meril
01.2022 - 06.2023

Research Assistant

SRM Institute of Science and Technology
05.2020 - 12.2021

Masters of Science - Data Science

University of Massachusetts Dartmouth

Bachelor of Technology - Computer Science and Engineering

SRM Institute Of Science And Technology
Mokesh Balakrishnan