Summary
Overview
Work History
Education
Skills
Websites
Leadership roles
Achievements/ Extracurricular
Academic Projects
Accomplishments
Affiliations
Certification
Languages
Timeline
Generic

ANURAG BANERJEE

Columbus,OH

Summary

Data-driven MS Statistics student with 2 years of experience in data science and a passion for problem-solving. Proficient in developing SQL queries, dashboards, Python and R scripts, AI/ML and statistical models, and end-to-end pipelines for model deployment. Strong background in statistics, programming, and a commitment to continuous improvement.

Overview

3
3
years of professional experience
1
1
Certification

Work History

Graduate Teaching Associate

Ohio State University
2024.08 - Current
  • Recitation leader and grader for STAT 1430: Statistics for the Business Sciences
  • Tutor for STAT 1350, 1430, 1450, 2450, and 2480.

Associate Data Science Engineer

OEConnection
2022.10 - 2024.06
  • Forecasted auto parts sales for US dealers to aid in inventory planning, utilizing statistical, machine learning, and deep learning techniques. Constructed a Power BI dashboard.
  • Wrote a SQL stored procedure for k-means clustering from scratch in T-SQL as a part of a statistical solution.
  • Predicted low confidence repair procedures on insurance estimates using NLP, Statistics and Machine Learning. Improved accuracy for some by 50%. Deployed solution to Azure.
  • Led the development of a POC for an auto-parts recommendation engine, successfully demonstrating the functionality through an intuitive UI.
  • Designed and implemented a churn prediction model utilizing statistical methods and machine learning algorithms to accurately anticipate customer churn in our e-commerce products. Developed an end-to-end automated machine learning pipeline on Azure for seamless integration into production.
  • Skills Used: Data Science, Exploratory Analysis, Ad-hoc Analysis, Statistical Analysis, Hypothesis Testing, Simulations, Machine Learning, Deep Learning, AI, Recommendation Engine, Data Wrangling, SQL, Data Visualization, Storytelling with data, Bayesian Statistics, Agile.
  • Technologies used: Python, R, SQL, Azure, Azure Machine Learning, HTML, CSS, PowerBI, Excel, MS Office, MS PowerPoint, MS Word.

Data Science Intern

Globallogic
2021.09 - 2022.01
  • Analyzed resignation trends at the organization and built a machine learning model which predicts employee churn
  • Presented findings to senior management
  • Supported in chatbot creation with dialogflow in GCP
  • Skills and Technologies Used: Python, Statistics, Exploratory Data Analysis, Data Visualisation, Machine Learning, Statistical Models, MS Power Point Presentation, MS Excel, GCP.

Education

Master of Science - Statistics

Ohio State University
2026-05

Post Graduate Diploma in Data Analytics -

Guru Gobind Singh Indraprastha University
Delhi, Delhi, India
08.2022

Bachelor’s of Science (honors) Statistics -

University of Delhi
Delhi, Delhi, India
06.2021

Skills

  • Data Science
  • Exploratory Analysis
  • Ad-hoc Analysis
  • Statistical Analysis
  • Hypothesis Testing
  • Simulations
  • Machine Learning
  • Deep Learning
  • AI
  • Recommendation Engine
  • Data Wrangling
  • SQL
  • Data Visualization
  • Storytelling with data
  • Bayesian Statistics
  • Agile
  • Cloud Technologies
  • Azure
  • Azure Machine Learning/AI

Leadership roles

  • Mentored a new joiner and interns at OEC
  • Led the housing price prediction project at Guru Gobind Singh Indraprastha University, Delhi
  • Led the smartphone buying trends project at the University of Delhi, Delhi

Achievements/ Extracurricular

  • Dare for Better (OEC): Annual award, for the year 2023-2024, given by OEConnection for innovation, problem-solving abilities and overall output.
  • Dare for Better (OEC): Monthly award, given in March 2023, for being innovative during the Labor Rates project by OEConnection
  • Elite Silver Badge & Topper in Data Science for Engineers: Given for exceptional performance in the NPTEL course, data science for engineers
  • Played in High School Basketball Team

Academic Projects

THYROID DETECTION, 05/2022, 07/2022

  • Found reasons that may lead to detection of thyroid using statistics and tools like tableau, excel and python.
  • Performed statistical tests for feature selection.
  • Used machine learning(classical and neural networks) to predict whether a patient has thyroid.
  • Performed feature engineering to improve the model performance.
  • Used Python, R, Excel and Tableau.

MARKET BASKET ANALYSIS, 02/2022, 04/2022

  • Analyzed the trends of products that are bought together from a supermarket under study.
  • Used the FPGrowth algorithm to recommend a product if a certain product was bought.
  • Used R and Excel.

MARKET ANALYSIS OF SUPERSTORE, 06/2021, 07/2021

  • Used python(pandas, numpy, seaborn, matplotlib, seaborn) and excel for analysis.
  • Provided insights on sales based on age, country, income, sales campaigns and other key demographics.
  • Segmented customers based on buying trends. Used Python, MS Excel and MS PowerPoint.

SMARTPHONE BUYING TRENDS, 03/2020, 05/2020 

  • Collected data from consumers on the basis of a self-created questionnaire.
  • Derived analysis results/ market trends with respect to brands, age, income, features.
  • Used Google Forms, Excel and R.

Accomplishments

  • Dare for Better (OEC): Annual award, for the year 2023-2024, awarded by OEConnection for innovation, problem-solving abilities and overall output.
  • Dare for Better (OEC): Monthly award, awdarded in March 2023, for being innovative during the Labor Rates project by OEConnection
  • Elite Silver Badge & Topper in Data Science for Engineers: Given for exceptional performance in the NPTEL course, data science for engineers

Affiliations

  • Played in high school basketball team

Certification

  • Data Science for Engineers (NPTEL), 07/2022 - 09/2022
  • Machine Learning Specialisation (Coursera), 07/2022 - 08/2022
  • SQL Intermediate Assessment (Hackerrank), 07/2022 - 07/2022
  • Introduction to Stochastic Processes (NPTEL), 01/2022 - 04/2022
  • R programming A-Z (Udemy), 05/2020 - 07/2020
  • Data Analysis with Python (Coursera), 07/2020 - 07/2020
  • Python Programming & Data Exploration (NIIT Ltd), 06/2019 - 08/2019

Languages

English
Native/ Bilingual
Hindi
Native/ Bilingual
Bengali
Professional

Timeline

Graduate Teaching Associate

Ohio State University
2024.08 - Current

Associate Data Science Engineer

OEConnection
2022.10 - 2024.06

Data Science Intern

Globallogic
2021.09 - 2022.01

Master of Science - Statistics

Ohio State University

Post Graduate Diploma in Data Analytics -

Guru Gobind Singh Indraprastha University

Bachelor’s of Science (honors) Statistics -

University of Delhi
  • Data Science for Engineers (NPTEL), 07/2022 - 09/2022
  • Machine Learning Specialisation (Coursera), 07/2022 - 08/2022
  • SQL Intermediate Assessment (Hackerrank), 07/2022 - 07/2022
  • Introduction to Stochastic Processes (NPTEL), 01/2022 - 04/2022
  • R programming A-Z (Udemy), 05/2020 - 07/2020
  • Data Analysis with Python (Coursera), 07/2020 - 07/2020
  • Python Programming & Data Exploration (NIIT Ltd), 06/2019 - 08/2019
ANURAG BANERJEE