Highly competent Data Science/AI Senior Manager with over 16 years of cross-functional and technical experience in designing and implementing business engineering, intelligence, and machine learning solutions. Possessing 5+ years of Agile project management experience in the advertisement, banking, and insurance domains within the data and analytics technology space. Expertise in Python, Spark, Scala, and ML/DL algorithms to solve complex business problems. Seeking an enthusiastic, creative, and competitive environment that offers opportunities to enhance and apply skills while contributing to organizational growth. With 15 years of experience, including 6+ years providing business solutions on various data analysis platforms, data science, and machine learning. Also, have 5+ years of experience in AI/ML engineering with a strong focus on LLMs and NLP. Extensive experience in data testing for banking and insurance products.
ChatDnB provides capabilities of enhancing the search mechanism which helps to ask questions and get meaningful insights with relevant to the questions.,
Python, LLM, ChatGPT, Big Query, GCP, 4,
Individual Contributor, Worked on integrating test suite into github actions for automated execution on git commits, checkout., Implemented caching and memory usages., Performance tuning at the application level., Developed integration test suite using pytest in python for the ChatDnB framework.
This is an end-to-end data and analytics platform, with new advanced analytics tools, and a suite of modular front-end solutions that would help clients derive key insights, build new strategies, and take customer-level actions that can be easily integrated into their workflows.,
Azure DW(Redshift), AWS S3, AWS SageMaker, Matrix Profiler, Bayesian Networks, AWS DynamoDB, T5, NLP, ThoughtSpot, PowerBI
Data Engineering Manager, Leading 60+ engineers to implement, develop, design and test the Azure cloud based Data Modernize applications and worked with data science team to formalize the AI based decision-making products on the Financial Data., Re-engineer and re-developed the fraud detection algorithm, introducing new attributes resulting in decreased false positive by 90%., Used Amazon Sage Maker for Data Pre-processing such as Data Merging, Cleaning and doing missing value treatment., Leverage the Matrix Profile Algorithm to scan the Multi Variable Time Series Data to identify the Anomalies in the accelerated dataset., Extract Industry Events using NLP, Classify the Text based on the Window Based Outlier., Analyze the KPI Data Store to discover any inconsistent data characteristics, and pattern due to unavailability of key driving field values., Manage and work closely with User experience team Causal Graphs in D3.js & Python Instances., Owned the project planning, delivery, work breakdowns, resource planning, and day-to-day project management using Jira, Confluence and Scrums., Partner with engineering, and Product teams to identify product and technical requirements, and lead our team through dependencies and delivery milestones., Worked as an Individual contributor on developing ML models using python as well as building data pipeline using spark and scala.
The main aim of the is AI enabled solution is minimize the Order fallout and predict the root cause of the fall out incidents based on the Address Product. An efficient Order fallout management system ensures that order failures are detected and corrected early for prompt provisioning of customer service. This can be achieved by developing robust AI driven Process Control, a process that evaluates and monitors performance using data collected over time. Fallout results in Customer churn, Degradation of Service Offerings, and a diminished Customer experience. Here we have considered two AIML based use cases to predict, and resolve fallouts with minimum turnaround time
Root cause Analysis based on Historical Data:
Goal: When the Order fallout happens, AI can help in looking at the Symptoms, and predicting the Root Causes. The time taken to narrow down the root cause is a significant factor in determining resolution times. It requires expert help, usually the developers of the software or the product vendor. AI can help here by looking at the symptoms and predicting the root causes. This helps ITOps to get down to fixing the root cause quickly. Classification is a machine learning problem of identifying a set of categories to which a new observation belongs to classification. Those include simple decision trees, naive Bayes, random forests, support vector machines, and deep learning.
Predicting Root Causes with Keras:
When a new incident happens, we typically identify the symptoms of the incident first, and populate the related feature variables here, like error codes. We then pass these as an array to the model's predict class’s function. This function will return a numeric value for the root cause. We then translate the numeric value into a label using the inverse transform function on the encoder. We can use this model to predict root causes for a batch of incidents.
Tech Stack – GCP, BIGQUERY, Informatica, CoreLogic Data APIs for Address Products, Google Address Verifier APIs, Classifications Models (Random Forest, Survival Analysis, Multinomial logistic regression), Keras, PowerBI
Role – Data Science Manager.
Leading 30+ engineers to implement , Develop, Design and test the GCP cloud based MDM applications and worked with data science team to reduce address fallout risk using ML based Solution .Build an AI Driven Anomaly Detection to predict Errors in Service Order Management. Wherein, the ML model can gauge the Service Order Trends to determine if any item is an Error to Alert using Time Series Data with Service Order Requests, Used Python packages like SciKit-Learn for Data pre-processing activities like transforming values into an appropriate format.Analyze the KPI Data Store to discover any inconsistent data characteristics, and pattern due to unavailability of key driving field values, Manage and Worked closely with User experience team Causal Graphs in D3.jS & Python Instances,Owned the project planning, delivery, work breakdowns, resource planning, and day-to-day project management using Jira, Confluence and Scrums.Partner with engineering, and Product teams to identify product and technical requirements, and lead our team through dependencies and delivery milestones. Worked Architecting the entire solution and build a design of a single 360 polygon views of their MDM -Address products.Worked closely with the testing team to find out the automation strategy to test the Data , API, Reports and AI/ML models and their accuracy .
A privacy-friendly implementation of the item-item recommender system (an algorithm that learns “similarity” scores for two apps). The production model is trained on the Apple Media Prodcut server and transmits only the app-to-app similarity scores, which are user-agnostic, to Ad Platform servers. These app-to-app similarity scores would be calculated and transmitted for ALL ad-enabled storefronts on a daily/weekly basis. The main To have an user agnostic(unpersonalised) similarity score that tells us how common it is for the user to interact with a specific app and then another specific app, in the case where those two interactions happen .
Tech Stack – AWS, Spark , Scala , Statistical Model, Tableau
Role – Data Science Manager.
Responsibilities :
Worked and built the App recommendation model based on the app to app similarity using mathematical models .
Worked on EDA for user clicks data using python data analysis and pre-processing packages .
Build an MarkOv chain Model as well as mathematical logarithms models to find out the probabilistic ration( P value).
Worked with customer to understand the required and convert the requirement to model where we can get the similarity source between the two apps.
LLMs and NLP, Python, PyTorch, TensorFlow, LangChain, LlamaIndex, Text Analytics, Ad Analytics, Anomaly Detection, Recommendation Engine, Statistical Modelling, Gen AI and agnostic Ai