Summary
Overview
Work History
Education
Skills
Projects
Timeline
Generic

Khushboo Saxena

New York,United States

Summary

...FOCUSED ...DATA-DRIVEN ...ANALYTICAL MINDSET ...CONSISTENT. ...PASSIONATE...PROCESS DRIVEN (AGILE)…

...TEAM PLAYER

Recent M.S. graduate with hands-on experience eager to apply a strong command of Python, R, SQL, and DAX to analyze and interpret complex data. Skilled in advanced analytics tools like Power BI and Tableau, Specialize in statistical modeling, predictive analytics, and harnessing big data through cloud solutions and programming libraries, including Pandas and TensorFlow. With 5 years of proven track record in business intelligence, i am poised to deliver innovative solutions and strategic insights in a dynamic, data-driven landscape.

Overview

6
6
years of professional experience

Work History

Data Analyst

iConsult Collaborative
06.2022 - 08.2022
  • Revolutionized racial text detection in county deeds data by engineering advanced Python scripts, achieving an impressive 30% boost in efficiency; expertly managed version control using GitHub."
  • Pioneered integration of Google Cloud Computer Vision API for high-volume (600,000+ documents) image analysis, accelerating data retrieval by a remarkable 50%. Conducted insightful ad-hoc analysis, significantly cutting time-based expenses by 40%.

Business Intelligence Engineer

IBM
08.2016 - 07.2021
  • Spearheaded the credit risk and exposure movement analysis project for Lloyds Bank by accurately predicting 85% of high-risk credit defaulters
  • Developed prediction and classification models in Python, using advanced machine learning algorithm like Regression, KNN, Random Forest, Gradient boosting to identify credit defaulters and successfully reduced financial losses by 10%
  • Utilized SQL, Python to develop statistical model and interim report on the impact of KPI's on credit risk, to understand customer spending habit, customer segmentation, payment pattern, frequency of overdraft to forecast potential default and enhanced early risk detection by 15%
  • Integrated heterogeneous data sources in Tableau Prep using blends,joins, to create visually interactive reports and dashboards using charts, calculations, dual-axis and filters, to effectively identify patterns and anomalies to reduce credit risk.
  • Created ETL data pipelines in SSIS to extract and load data from diverse sources to on-premises SQL server ensuring privacy of historical data.
  • Leveraged and developed data pipelines in Databricks and Snowflake to efficiently extract, transform and load data for handling terabytes of data for analytics and machine learning purpose and minimized data processing time by 14%.
  • Constructed relational database design and data model in SQL Server, enhancing data integration, and decreasing data redundancy by 23%.
  • Implemented Azure DevOps to optimize project development lifecycle using Agile methodologies, promoting efficient team collaboration.
  • Executed end-to-end A/B testing pipelines to evaluate the impact of various strategies on key business metrics, providing valuable insights for decision-making.

Education

Master of Science in Applied Data Science, Introduction to Data Science, Business Analytics, Data Admin Concepts & DBMS, Machine Learning, Data Analysis, Quantitative Reasoning for Data Science, Big Data Analytics, Data Warehousing, Managing Data Science Projects -

Syracuse University
Syracuse, NY
05.2023

Bachelor of Technology in Computer Science & Engineering, Business Intelligence, Database Management, Computer Programming and Problem-Solving, Object-Oriented Programming -

Galgotias University
Greater Noida, India
05.2016

Skills

Technical Skills


Programming Languages:
Python, R, SQL, DAX
Tools: Power BI, Tableau, SSIS,
Alteryx, Rstudio, Jupyter
Notebook, MS Excel (Pivot table,
Power Query, VLOOKUP),
PowerPoint
Data Science & Machine
Learning: Statistical modeling,
Quantitative Analysis, Predictive
Analysis, Time Series
Forecasting, Regression
Analysis, Classification,
Clustering, Bayesian Methods,
Decision Trees, SVM, Random
Forest, Naïve Bayes, GLM,
Kmeans, gboosting


Libraries & Packages: NumPy,
Pandas, Matplotlib, Seaborn,
Scikit-learn, TensorFlow,
PySpark, NLTK, dplyr, tidyverse,
ggplot2, E1071, Rpart


Databases/Cloud Technologies:
MSSQL, MySQL, Looker, Google
Data Studio, S3, AWS Data
Pipeline, AWS Glue, Redshift,
Snowflake, Databricks


Big Data: Hadoop, MapReduce,
Apache Spark


Strategy Methodologies: SWOT
Analysis, Agile methodology,
Scrum

Projects

Hotel Cancellation Prediction Project, R, Tableau, Machine Learning, 10/01/21, 12/01/21

Used R for data cleaning and analysis, and Tableau for sensitivity analysis, leading to a 30% reduction in hotel cancellations by identifying key cancellation drivers and customer demand trends. Devised machine learning models including association rule mining, SVM, and decision trees, achieving 87% accuracy in predicting future hotel cancellations. 


Heart Disease Prediction, Python, Supervised Learning Models, Feature Scaling, 03/01/22, 05/01/22

Facilitated comprehensive data cleansing and exploratory. Analysis in Python, coupled with the use of SMOTE sampling and machine learning techniques like K-Nearest Neighbors and Random Forest, to develop a heart disease prediction model with a 90% accuracy rate. 


NBA Player Stats, PySpark, Regression/Clustering, Grid Search, Feature Engineering, 10/01/22, 12/01/22

Performed linear and random forest regression with hyperparameter tuning in PySpark and applied feature engineering and logistic regression techniques to predict a player's net rating and offensive capabilities, achieving an AUC of 71.21%.

Timeline

Data Analyst

iConsult Collaborative
06.2022 - 08.2022

Business Intelligence Engineer

IBM
08.2016 - 07.2021

Master of Science in Applied Data Science, Introduction to Data Science, Business Analytics, Data Admin Concepts & DBMS, Machine Learning, Data Analysis, Quantitative Reasoning for Data Science, Big Data Analytics, Data Warehousing, Managing Data Science Projects -

Syracuse University

Bachelor of Technology in Computer Science & Engineering, Business Intelligence, Database Management, Computer Programming and Problem-Solving, Object-Oriented Programming -

Galgotias University
Khushboo Saxena