Summary
Overview
Work History
Education
Skills
Interests
Timeline

Hari Varma Nagaraju

Data Scientist
Columbus,OH

Summary

Results-driven Data Scientist with 6 years of experience in building scalable machine learning solutions across insurance, healthcare, finance, and consumer industries. Proven expertise in predictive modeling, real-time data pipelines, and cloud-based ML platforms like Azure and AWS. Skilled in deploying end-to-end solutions using Python, Spark, and Kafka, and translating insights into impactful business outcomes through dynamic dashboards and stakeholder collaboration. Adept at solving complex problems with a blend of data engineering, machine learning, and domain knowledge.

Overview

6
6
years of professional experience

Work History

Data Scientist

Nationwide Insurance Company
09.2024 - Current
  • Built predictive models using Logistic Regression and Gradient-Boosted Trees in Azure ML to assess claims risk and prevent fraud, reducing false positives by 22%.
  • Engineered real-time data pipelines with Apache Kafka for ingesting policyholder data, enabling instant underwriting decisions.
  • Leveraged Azure Databricks and Spark for large-scale model training and batch scoring over 100M+ records.
  • Created dynamic Power BI dashboards to visualize fraud patterns, loss ratios, and customer risk profiles for business teams.
  • Conducted customer segmentation using clustering (K-means, Hierarchical) to support retention campaigns and product targeting.
  • Evaluated models using K-fold cross-validation, ROC curves, and AUC to ensure robustness in production environments.

Data Scientist

Bon Secours Mercy Health
11.2023 - 08.2024
  • Developed patient readmission prediction models using Azure ML, helping reduce unnecessary hospital stays and improve care planning.
  • Built distributed Spark pipelines to process clinical data from multiple hospitals, enhancing data quality and reporting efficiency.
  • Used NLP on clinical notes to detect critical indicators (e.g., pain scores, comorbidities), assisting in triage and care escalation.
  • Applied K-means clustering to identify patient cohorts with similar risk patterns, guiding population health initiatives.

Machine Learning Engineer

Goldman Sachs
02.2021 - 07.2023
  • Designed ML pipelines using Spark and AWS Lambda for credit risk scoring, improving inference efficiency by 30%.
  • Deployed Dockerized models via Kubernetes and managed versioning through Jenkins and Git-based CI/CD workflows.
  • Built fraud detection models (Random Forest, SVM) to flag anomalous transactions, improving detection precision by 12%.
  • Created Tableau Server dashboards for real-time model monitoring and stakeholder reporting.
  • Partnered with data engineers to optimize ETL jobs, reducing model retraining failures and improving pipeline uptime.
  • Standardized model deployment using CDC to automate refresh cycles without manual intervention.

Data Analyst

Havells, India
06.2019 - 01.2021
  • Automated ETL processes and built SQL/Python pipelines, reducing manual reporting tasks by 50%.
  • Developed Tableau dashboards to track KPIs across sales, inventory, and marketing.
  • Performed EDA to uncover seasonal trends and slow-moving SKUs, aiding inventory optimization.
  • Predicted churn using XGBoost, supporting targeted retention initiatives.
  • Used NLP to extract sentiment from customer reviews, guiding product improvements.

Education

Master of Computing And Information Systems

Youngstown State University, Youngstown, OH
05-2025

Skills

    Machine Learning: Logistic Regression, Random Forest, SVM, GBT, XGBoost, K-means, Hierarchical Clustering, NLP, Fraud Detection, Churn Prediction, AUC, ROC, K-fold

    Data Engineering: Spark, Kafka, ETL, CDC, Real-time Processing, CI/CD (Jenkins, Git)

    Cloud & Deployment: Azure ML, Databricks, AWS Lambda, Docker, Kubernetes

    Visualization: Power BI, Tableau, Tableau Server, Excel

    Programming: Python, SQL, Bash

    Tools & Platforms: Git, Jenkins, Azure AutoML, EHR Systems

Interests

Data-Driven Product Design, Volleyball, Traveling, Hiking

Timeline

Data Scientist - Nationwide Insurance Company
09.2024 - Current
Data Scientist - Bon Secours Mercy Health
11.2023 - 08.2024
Machine Learning Engineer - Goldman Sachs
02.2021 - 07.2023
Data Analyst - Havells, India
06.2019 - 01.2021
Youngstown State University - , Master of Computing And Information Systems
Hari Varma NagarajuData Scientist