Summary
Overview
Work History
Education
Skills
Kaggle Competitions
Selected Academic Projects
Timeline
Generic

Supreetha Naik

San Jose,CA

Summary

  • Total 5+ years of experience with 3+ years as a Data analyst. Strong skills in SQL, building reporting/KPI dashboards using Tableau and power BI.
  • Experience in building data models, ETL processing on AWS, with proficiency in SQL, NoSQL databases; experience working with Databricks, Hadoop and Spark.
  • Immense knowledge and experience in computer vision and NLP, and GPT3 models.
  • Built Machine Learning models using Python libraries, Pandas, Skicit-learn, Tensorflow, and PyTorch.
  • Agile project lead, technical project management, and cognizance in Agile requirement design.

Overview

7
7
years of professional experience

Work History

Data Science Intern - NLP

Hindsight Technology Solutions
06.2023 - Current
  • Working on building text recommendation system by extracting entities using Spacy for Named Entity Recognition.
  • Entails text scraping using Beautiful Soup and building NLP data pipeline.
  • Researching and leveraging parallel processing to accelerate NER on 17 million records.

Graduate Teaching Assistant

San Jose State University
01.2023 - Current
  • As teaching assistant for spring and fall 2023, Instructed 5 in-person lectures on supervised learning models and ensemble models for 40 students.
  • Mentored students, graded assignments, exams evaluating student progress.
  • Built ML models, wide and deep neural networks, image recognition and masking, hybrid recommender systems, and collaborative and content-based filtering.

Senior Analyst

Accenture
04.2021 - 01.2022
  • Led and managed operations within supply chain and inventory domain, ensuring end-to-end business delivery.
  • Implemented robust data quality checks on upstream data achieving 90% accuracy rate.
  • Developed insightful Power BI dashboards, enabling real-time project tracking and significant 25% improvement in defect detection
  • Successfully spearheaded database migration from Toad DB to MongoDB, resulting in remarkable 40% enhancement in system performance.
  • Developed advanced statistical models for accurate demand forecasting, effectively mitigating excess inventory, leading to significant 15% reduction in inventory holding costs.

IT Data Analyst

TATA Consultancy Services
01.2019 - 03.2021
  • Subject matter expert in data analysis/processing, optimizing SQL queries in PostgreSQL that improved query performance by 30%
  • Designed and developed data models to support business processes, resulting in a 40% reduction in data processing time
  • Maintained data warehouses for data storage, and over 500+ autosys jobs batch processing in Unix
  • Experience in Microsoft Azure cloud for project management, improved team efficiency by 25% through creating custom dashboards on JIRA and automating project workflows
  • Mentored a team of 13, resulting in a 20% increase in team productivity and individual growth.

Software Engineer

Tata Consultancy Services
05.2016 - 12.2018
  • Created over 800 test cases, test reports, worked on APIs and (CI/CD) pipelines in git and Jenkins
  • Implemented A/B testing on a new feature and identified about 10 critical bugs that were fixed before the release, resulting in a 30% decrease in customer tickets
  • Performed root cause analysis, bug fixing, and knowledge in SDLC, STLC, and agile requirement design.

Education

MASTER OF SCIENCE - DATA ANALYTICS

JOSE STATE UNIVERSITY
San Jose, CA
12.2023

BACHELOR OF ENGINEERING - ECE

VISVESWARAYYA TECHNOLOGICAL UNIVERSITY
2015

Skills

  • PROGRAMMING LANGUAGES: SQL, Python
  • LIBRARIES: Numpy, Pandas, Scikit-Learn, MatplotLib, Seaborn, Tensorflow, Pytorch, Keras, Scipy, OpenCV
  • DATABASES: MongoDB, Rapid SQL, Toad DB, MySQL Workbench, AWS (RDS, Redshift, S3, Glue, Brew)
  • REPORTING TOOLS: Tableau, Power BI, Microsoft Excel
  • PROJECT MANAGEMENT: Microsoft Azure, JIRA, Agile, HP ALM
  • MACHINE LEARNING: Regressions, SVM, Naïve bayes, KNN, Decision Tree, PCA, Semi-supervised,
  • DEEP LEARNING: Computer vision, NLP, CNN, RNN, LSTM, Transformer, BERT, eLMO, Deep QNN, GPT3, NER

Kaggle Competitions

  • Ship detection and masking using custom CNN (Unet) and pre-trained models (Resnet, Mask R CNN, Mobile Net).
  • Google competition of American Sign Language (ASL) recognition and conversion to text using LSTM and Transformer.

Selected Academic Projects

 STACK OVERFLOW RANKING SYSTEM USING NLP 

  • Built stack overflow search algorithm using BERT transformer model with LSTM, ELMO.
  • Conducting research on Facebook AI similarity search for building ranking model.

 MUSIC RECOMMENDATION SYSTEM FOR MENTAL HEALTH 

  • Built music recommendations using ML models – XGBoost, Random Forest, KNN, SVM. Utilized SMOTE for data balancing and developed a user-interactive chatbot for the recommendation system.
  • Implemented collaborative filtering using matrix factorization using SVD to enhance the system.

 ENERGY CONSUMPTION ANALYSIS USING BIG DATA TECHNOLOGIES 

  • Developed ETL pipelines to extract, transform, load and integrate 10GB of data from various sources into data warehouse Redshift from S3; utilized glue crawlers for processing

 DECENTRALIZED SMART GRID LOAD PREDICTION USING ML 

  • Smart grid stability prediction using machine learning models leveraging SVM, Random Forest,XGBoost to classify load performance.
  • Implemented hyperparameter tuning to optimize machine learning model performance, increasing accuracy by up to 5%.


Timeline

Data Science Intern - NLP

Hindsight Technology Solutions
06.2023 - Current

Graduate Teaching Assistant

San Jose State University
01.2023 - Current

Senior Analyst

Accenture
04.2021 - 01.2022

IT Data Analyst

TATA Consultancy Services
01.2019 - 03.2021

Software Engineer

Tata Consultancy Services
05.2016 - 12.2018

MASTER OF SCIENCE - DATA ANALYTICS

JOSE STATE UNIVERSITY

BACHELOR OF ENGINEERING - ECE

VISVESWARAYYA TECHNOLOGICAL UNIVERSITY
Supreetha Naik