Total 5+ years of experience with 3+ years as a Data analyst. Strong skills in SQL, building reporting/KPI dashboards using Tableau and power BI.
Experience in building data models, ETL processing on AWS, with proficiency in SQL, NoSQL databases; experience working with Databricks, Hadoop and Spark.
Immense knowledge and experience in computer vision and NLP, and GPT3 models.
Built Machine Learning models using Python libraries, Pandas, Skicit-learn, Tensorflow, and PyTorch.
Agile project lead, technical project management, and cognizance in Agile requirement design.
Overview
7
7
years of professional experience
Work History
Data Science Intern - NLP
Hindsight Technology Solutions
06.2023 - Current
Working on building text recommendation system by extracting entities using Spacy for Named Entity Recognition.
Entails text scraping using Beautiful Soup and building NLP data pipeline.
Researching and leveraging parallel processing to accelerate NER on 17 million records.
Graduate Teaching Assistant
San Jose State University
01.2023 - Current
As teaching assistant for spring and fall 2023, Instructed 5 in-person lectures on supervised learning models and ensemble models for 40 students.
Built ML models, wide and deep neural networks, image recognition and masking, hybrid recommender systems, and collaborative and content-based filtering.
Senior Analyst
Accenture
04.2021 - 01.2022
Led and managed operations within supply chain and inventory domain, ensuring end-to-end business delivery.
Implemented robust data quality checks on upstream data achieving 90% accuracy rate.
Developed insightful Power BI dashboards, enabling real-time project tracking and significant 25% improvement in defect detection
Successfully spearheaded database migration from Toad DB to MongoDB, resulting in remarkable 40% enhancement in system performance.
Developed advanced statistical models for accurate demand forecasting, effectively mitigating excess inventory, leading to significant 15% reduction in inventory holding costs.
IT Data Analyst
TATA Consultancy Services
01.2019 - 03.2021
Subject matter expert in data analysis/processing, optimizing SQL queries in PostgreSQL that improved query performance by 30%
Designed and developed data models to support business processes, resulting in a 40% reduction in data processing time
Maintained data warehouses for data storage, and over 500+ autosys jobs batch processing in Unix
Experience in Microsoft Azure cloud for project management, improved team efficiency by 25% through creating custom dashboards on JIRA and automating project workflows
Mentored a team of 13, resulting in a 20% increase in team productivity and individual growth.
Software Engineer
Tata Consultancy Services
05.2016 - 12.2018
Created over 800 test cases, test reports, worked on APIs and (CI/CD) pipelines in git and Jenkins
Implemented A/B testing on a new feature and identified about 10 critical bugs that were fixed before the release, resulting in a 30% decrease in customer tickets
Performed root cause analysis, bug fixing, and knowledge in SDLC, STLC, and agile requirement design.
DEEP LEARNING: Computer vision, NLP, CNN, RNN, LSTM, Transformer, BERT, eLMO, Deep QNN, GPT3, NER
Kaggle Competitions
Ship detection and masking using custom CNN (Unet) and pre-trained models (Resnet, Mask R CNN, Mobile Net).
Google competition of American Sign Language (ASL) recognition and conversion to text using LSTM and Transformer.
Selected Academic Projects
STACK OVERFLOW RANKING SYSTEM USING NLP
Built stack overflow search algorithm using BERT transformer model with LSTM, ELMO.
Conducting research on Facebook AI similarity search for building ranking model.
MUSIC RECOMMENDATION SYSTEM FOR MENTAL HEALTH
Built music recommendations using ML models – XGBoost, Random Forest, KNN, SVM. Utilized SMOTE for data balancing and developed a user-interactive chatbot for the recommendation system.
Implemented collaborative filtering using matrix factorization using SVD to enhance the system.
ENERGY CONSUMPTION ANALYSIS USING BIG DATA TECHNOLOGIES
Developed ETL pipelines to extract, transform, load and integrate 10GB of data from various sources into data warehouse Redshift from S3; utilized glue crawlers for processing
DECENTRALIZED SMART GRID LOAD PREDICTION USING ML
Smart grid stability prediction using machine learning models leveraging SVM, Random Forest,XGBoost to classify load performance.
Implemented hyperparameter tuning to optimize machine learning model performance, increasing accuracy by up to 5%.
Sr. Executive Support at Cognizant technology Solutions, Cognizant Technology SolutionsSr. Executive Support at Cognizant technology Solutions, Cognizant Technology Solutions