Seasoned Data Scientist experienced working with large datasets, breaking down information and applying interpretations to complex business concerns. Proficient in distribution, predictive and hypothetical modeling. Bringing 4+ years of related experience strengthening company operations.
Overview
5
5
years of professional experience
Work History
Data Scientist
TecArtists INC
Boston, USA
05.2023 - Current
Applied thorough data analysis using SQL, Python, and Excel on big datasets, finding patterns, trends, and status that research influenced important business choices
Implemented end-to-end natural language processing (NLP) and machine learning solutions and result evaluation metrics on web-based platforms like Heroku, using data-driven insights to improve business capabilities
Developed and handled automated data pipelines and ETL procedures for application that ensured consistency of data and correctness while cutting down on processing time by 30%
PL/SQL was employed to create, test, and execute triggers, stored procedures, and other database-level querying routines that improved database efficiency
Utilized Tableau and Power BI, dynamic and aesthetically pleasing dashboards and reports were created, giving stakeholders access to real-time data so they could make deft decisions.
Collaborated with other scientists, ML engineers and program managers to execute data science roadmap.
Data Scientist
Cognizant Technology Solutions
Pune, India
12.2018 - 11.2021
Design and created predictive models for automatic comment classification based on specified business factors utilizing clustering, SVM, Bayes, and Elastic Net for an Analytical and optimization tasks
Conducted A and B tests for validating changes in user interface designs that could improve user experience.
Used outlier detection methods to find and fix abnormalities in medical data, resulting in a 30 decrease in data mistakes
Oversaw a sizable database that weighed one terabyte finance data, making sure that procedures for data processing and r&d analysis ran smoothly
Used a variety of assessment criteria (such as the confusion matrix, AUC/ROC, and F - Score) to assess models, raised the accuracy of the model performance by 20%
Used the maximum variance and Pearson's correlation methods to find important regression model predictors
Employed Integrated Test-Driven Development (TDD) in an Agile setting with fast-paced environment that can increase code reliability and consumer and business development time by 20%
Collaboration with cross-functional teams with teamwork to assess project outcomes and features.
Marketing Data Analyst Intern
Integration IT Solutions
, India
01.2018 - 12.2018
Utilized technologies like Python and SQL, data analysis was done by debugging, troubleshooting, and monitoring to identify the top 10% growth-oriented regions and locations with strong distribution potential
Excel and Power BI were used to analyze supply chain data to find bottlenecks and areas that needed improvement to maximize inventory control and delivery efficiency helped the user experience
Reduced delivery times of software and 7% measures to optimize supply chain operations with feedback.
Education
Masters in Artificial Intelligence (AI) and Machine Learning -
University of North Texas
Denton, TX
05.2023
Bachelors in Electronics Communication and Computer science Engineering -