Summary
Overview
Work History
Education
Skills
Accomplishments
Timeline
Generic
Megha Banerjee

Megha Banerjee

Dallas,TX

Summary

Meticulous Data Scientist accomplished in compiling, transforming and analyzing complex information through software. Expert in machine learning, large language models, and large dataset management. Demonstrated success in identifying relationships and building solutions to business problems.

Overview

3
3
years of professional experience

Work History

Senior Data Scientist

Walmart, Inc
03.2024 - Current
  • Currently leading/active part of several high-valued data science/generative AI initiatives including but not limited to W+ Membership cancellations defects, Ecom Spark Delivery escalations, Walmart legal pain points identfication, and workforce planning defects identfication.
  • Collaborating with Product Managers, Data Analysts, and Operation partners to understand business needs, translating into advanced analytics and data science solutions.
  • Fostered seamless onboarding by closely collaborating with new team members, providing comprehensive understanding of existing use cases.
  • Collaborated with external agencies/service providers guiding comprehensive use cases, technical strategies, data accessibility, project roadmaps, and envisioned outcomes.
  • Effectively communicating findings with senior leadership and non-technical audience.

Senior Data Analyst

Walmart, Inc
06.2023 - 03.2024
  • Implemented automated Customer Advocacy Program, using advanced NLP and GPT 3.5, and introduced daily executive email system, resulting in 93% reduction in assignment time, 41% in commitment time, and 71% in resolution time.
  • Automated quality control process for defect hub generative AI summaries and KPIs, ensuring smoother filtration process for higher and low quality retail defects.
  • Leveraged Google BigQuery and Tableau to extract and aggregate data from multiple sources and compile into digestible format; produced and presented weekly reports to stakeholders and executives with actionable error correction and operational improvement plans.

Research Assistant

Syracuse University
10.2021 - 05.2023
  • Performed sentiment analysis over 70k rows of user comments, extracted using Python API Wrapper and Pushshift API from subreddits.
  • Identfied pain points of fulfillment center workers’ work-life balance, created questionnaire that was later used to interview over 30+ candidates for research validation.
  • Identfied relevant research articles and crafted literature survey section of various research articles by scrutinizing research articles and journals.

Data Science Intern

Walmart Global Tech
05.2022 - 08.2022
  • Conducted through analysis of transaction database with over 50 million rows of data, customer reviews, and other textual data sources to identify effective drivers for returns and areas of improvement, aligning with business requirements and goals.
  • Build out data reporting infrastructure from scratch using SQL, Python, and PowerBI to Provide real-time insights into product, marketing funnels, and business KPIs.
  • Utilized NLP techniques to extract return reasons from customer reviews to build auto-tagging system associated with every dot com return.
  • Project highlighted top returned items at micro level and reduced return rates across Walmart’s E-commerce, eventually leading to better customer experience retention.

Education

Master of Science - Applied Data Science

Syracuse University
Syracuse, New York

Skills

  • Languages: Python, R, SQL, Latex
  • Technologies: PyCharm, Rstudio, Jupyter, Microsoft Office, MS Access, PowerBI, Tableau, Data Studio, Amazon Web
  • Services, GitHub, Vertex AI, Confluence, JIRA
  • Databases: MySQL, MongoDB, CosmosDB, Oracle, NoSQL, Azure, Google Cloud Platform, Data Discovery
  • Concepts: Data Cleaning, Data Visualization, Predictive Modeling, Forecasting, Supervised/Unsupervised Machine Learning, Classfication, Clustering, Natural Language Processing (Sentiment Analysis, Text Mining, Text Classification, Topic Modeling, Generative AI), Big Data, Anomaly Detection

Accomplishments

  • Hazra, R., Banerjee, M. and Badia, L., 2020, November. Machine learning for breast cancer classification with ANN and Decision Tree. In 2020 11th IEEE Annual Information Technology, Electronics and Mobile Communication Conference (IEMCON) (pp. 0522-0527). IEEE.
  • M. Banerjee and S. Majhi, ”Multi-class Heart Sounds Classification Using 2D-Convolutional Neural Network,” 2020 5th International Conference on Computing, Communication and Security (ICCCS), Patna, India, 2020, pp. 1-6. IEEE.

Timeline

Senior Data Scientist

Walmart, Inc
03.2024 - Current

Senior Data Analyst

Walmart, Inc
06.2023 - 03.2024

Data Science Intern

Walmart Global Tech
05.2022 - 08.2022

Research Assistant

Syracuse University
10.2021 - 05.2023

Master of Science - Applied Data Science

Syracuse University
Megha Banerjee