Experienced Data Science professional who interprets and extract intelligence from data and solves complex business problems with Data Visualization, Data Modeling, Statistical and Machine Learning. Proficient in furnishing executive leadership team with insights, analytics , reports and recommendations enabling effective strategic planning across all business units, distribution channels and product lines.
Data Quality Team has tiers of audit reports to alert any Data Integrity Issues. The scope of this project is to define operational strategy using Data Science tools and methodologies to identify outliers or anomalies to build smarter audits. The purpose of the Anomaly Detection team was to optimize those reports leveraging data science tools and methodologies.
Tools Used : Pandas, Scikit – learn , Seaborne, Tableau, Terradata SQL , Numpy, Jira , MSTR , Docker
Thinkful program offers bootcamps to adult learners across the nation in Data Science. Mentors get the opportunity to work 1:1 with learners to guide them and gauge their understanding on the field of Data Science.
· Rectify any topics/doubts learners might have following the week’s content and go in depth to explain the concepts
The mentored learning sessions are two-hour sessions held every weekend to complement the learning material released to the learners at the beginning of each week. These sessions are run by industry professionals who are Data Scientists themselves with experience and expertise in the field. The purpose of these sessions is to give a business perspective to the theoretical material that the learners have gone over, in order to also make them ready for business world challenges.
As a part of System Machine Health Analytics Initiative, this team monitors the health of various tiers of engines and other machineries. The team is currently undergoing BRU(Bulk rule updates) to analyze the telematics data and/or Warranty data from John Deere engines and predict the fault in parts of these Engines for every Customer/Dealer.
Working alongside with the Clinical Researcher to analyze the statistical significance of various protocols assigned to Veterans to improve their health conditions such as Hypertension, Blood Pressure, Overweight and Covid-19 effects.
Tools and Environment: Azure Databricks, Azure Synapse, Spark ML, PowerBI, RDD , MS Sql
To address the management's increasing concern of continuous bus failures obtained data for different bus categories of buses from multiple tables that were distributed in three different warehouses: Teradata, Microsoft SQL Server, and Oracle Database. Ingested these data in one single platform in order to perform exploratory analysis.
Agglomerative and Divisive, DBSCAN)
Data Science Career Track Certification