Summary
Overview
Work History
Education
Skills
Timeline
Generic
BHAWANA AGARWAL

BHAWANA AGARWAL

Data Scientist
Boston,MA

Summary

Results-driven data analyst with close to 5 years of experience in financial services and IT. Expertise in Python, and SQL, specializing in data analysis and automation tool with a proven track record in using Power Automate, PowerApps, and AWS to improve collaboration and productivity of teams. Adept at data visualization with tools like Power BI and Tableau and experienced in database management. Holds a Master’s in Information Systems from Northeastern University

Overview

5
5
years of professional experience

Work History

Data Scientist (Advanced Analyst)

Ernst & Young | Financial Services
10.2019 - 12.2021
  • Generated $2.4M in cost savings by developing a Python automation tool to pre-process and extract key information from
  • JSON and XML datasets, detecting data patterns and enhancing data analysis efficiency
  • Automated the reconciliation of over 10,000+ entries by developing and implementing a Name Entity Recognition (NER) extraction model, saving 10 hours of manual work and streamlining data validation process
  • Utilized data extraction and manipulation techniques to reduce tax form completion time by 32%, leading to process improvements and improved customer satisfaction
  • Enhanced product marketing strategies by applying unsupervised learning to segment 70k customer data, leading to a improvement in targeting accuracy
  • Spearheaded client engagements, analyzed business requirements, and facilitated internal promotion for an automation tool designed to reconcile VAT return files
  • Increased operational efficiency by 96% through the development of Python based tool creation for standardizing000+ rows of Excel and text data, leading to comprehensive and accurate reconciliation reports

Data Analyst

Tech Mahindra Ltd, IT Services and Consulting
02.2017 - 08.2019
  • Improved data accessibility by 18% through Oracle database design and implementation, enhancing client engagement data management with streamlined SQL queries
  • Developed Tableau dashboards through visualizing and communicating data to stakeholders, leading to a 15% increase in identifying support ticket trends
  • Engineered intranet site using SharePoint and web technologies, achieving 40% engagement in the number of users
  • Accelerated SQL data query efficiency by 20% through the implementation of subqueries and CTEs
  • Achieved an 80% increase in productivity by streamlining a multi-stage approval system through the integration of
  • Power Automate and PowerApps, automating workflows and reducing approval times
  • Delivered a 20% reduction in development cycle time by optimizing the complete Software Development Life Cycle (SDLC) through JIRA, improving workflow management and team collaboration
  • PROJECTS
  • Precipitation Prediction with Geospatial Data | Python, TensorFlow, ConvLSTM, LSTM
  • Brainstormed challenges related to imbalanced temporal data for precipitation prediction model
  • Successfully designed ensemble machine learning model, achieving 80% accuracy in rain prediction
  • STEDI Human Balance Analytics | AWS Glue, S3 Data Lakes, Spark
  • Processed sensor data to train machine learning models by loading JSON data from an S3 data lake into Athena tables using Spark and AWS Glue

Education

Master of Science - Information Systems

Northeastern University
Dec 2023

Bachelors - Electronics Engineering

Rajiv Gandhi Proudyogiki Vishwavidyalaya
May 2015

Data Science, Database Management, Data Structure & Algorithms, Machine Learning in -

Object Oriented Programming, Probability and Statistics, Linear Algebra -

Skills

  • TECHNICAL SKILLS
  • Programming Language:
  • Python (Pandas, NumPy, Matplotlib, Scikit-learn, Seaborn, OpenCV, TensorFlow, PySpark), R
  • Databases: Oracle SQL, MySQL, SQL Server, PostgreSQL, MongoDB (NoSQL), Apache Cassandra
  • Big Data Technologies: Spark and Datalakes, Kafka, Airflow
  • Business Intelligence Tools: Tableau, Power BI, Qlik Sense
  • Machine Learning Algorithms: Linear Regression, Logistic Regression, Random Forest, Decision Trees, K-means Clustering
  • Neural Networks, Deep Learning (CNN, RNN, ConvLSTM, LSTM), NLP, Generative AI
  • Hard Skills:
  • Data Modeling and Data Warehousing, Exploratory Data Analysis, Model Evaluation
  • Cloud Platform: AWS (IAM, EC2, AWS Sagemaker, S3, DyanamoDB), Snowflake
  • Other Tools:
  • Git, Slack, JIRA, Microsoft Teams, GitHub, Tableau, PowerAutomate, Power Apps

Timeline

Data Scientist (Advanced Analyst)

Ernst & Young | Financial Services
10.2019 - 12.2021

Data Analyst

Tech Mahindra Ltd, IT Services and Consulting
02.2017 - 08.2019

Master of Science - Information Systems

Northeastern University

Bachelors - Electronics Engineering

Rajiv Gandhi Proudyogiki Vishwavidyalaya

Data Science, Database Management, Data Structure & Algorithms, Machine Learning in -

Object Oriented Programming, Probability and Statistics, Linear Algebra -

BHAWANA AGARWALData Scientist