Summary
Overview
Work History
Education
Skills
Certification
Other responsibilities undertaken
Timeline
background-images

Priyadarshini Selvam

Herndon,USA

Summary

With nearly 8 years of experience as a data scientist, I have worked closely with various businesses to solve real-world, data-driven problems. My expertise lies in providing valuable solutions, predictions, and statistical recommendations through technical competencies and value-based skills. I offer crucial insights that drive business success. I am experienced in handling data on cloud platforms such as Snowflake, AWS, and Google Cloud. I have strong SQL query knowledge and programming expertise in Python, R, and SAS. I am proficient in building workflows and data pipelines using tools like Airflow, Snaplogic, AWS, PySpark, Hadoop, Snowflake, and Kafka. As an individual contributor, I have used tools like Databricks, AWS Sagemaker, AWS Data Pipeline, Informatica, AWS Lambda, Snaplogic, Snowpipe, PySpark, Airflow, PowerBI, and Tableau for reporting solutions. I have also mentored and developed junior associates while leading POC and research implementation of cutting-edge technologies like Gen AI with Microsoft Copilot and Power Virtual Agents. My business domain knowledge spans manufacturing, marketing, and finance industrial verticals.

Overview

10
10
years of professional experience
1
1
Certification

Work History

Data Scientist

Caterpillar Inc.
02.2025 - Current
  • As a data scientist for the conditioning monitoring platform, worked closely in analyzing the customer fleet to help troubleshoot issues in the condition monitoring platform.
  • Analyzed the data from the customer Telematics device to learn and identify the root cause of the reporting status through Analysis tools in Snowflake, SQL and Presented the same in Power BI
  • Conducted thorough exploratory data analyses for robust model building.
  • Streamlined data collection methods to minimize analysis errors by 50%.
  • Automated repetitive tasks using scripting languages such as Python or R, saving time during the analytical process significantly.

Data Scientist

Kaytics Inc.
11.2024 - 02.2025
  • Prototyped a SAAS product which delivers Marketing analytics solutions to a potential marketing agency which help budget decisions, digital and media marketing allocations. The product build in python and Azure helps in optimizing the marketing budget spend and also simulates the projected ROI % for the next quarters
  • Built a Digital media marketing attribution model for a CPG Client using python
  • Revamped enterprise visualization for faster and quicker insights in Power BI Report server
  • Customer segmentation - Aims at identifying the customer group based on the customer survey comments using Topic Modelling in Text Analytics NLP
  • Contributed in building a marketing mix modelling in identifying which category or channel is most effective in conversion and based on which allocation are performed.

Data Scientist

Caterpillar Inc.
07.2017 - 09.2024


  • Implemented the Marketing to Sales attribution model, to help attribute a sale to an online marketing campaign /email/ website click, thereby justifying the ROI on the Marketing spend. Enhanced the model through a mix of data & Multi attribution algorithms thereby improving the attribution from 10% to 12%.
  • Developed a Market Mix Model for optimizing the spend in various channels with a simulation tool to visualize the impact on sale. Participated in a Workshop in North Carolina and Illinois with the Business team to help improve the model. Built a validation matrix to help access the accuracy of the developed model and improved the accuracy by 5%.
  • Developed the Ready to Buy – List of customers who are at the potential to buy the next product based on an extensive customer scoring and targeting which resulted in highest conversion rates and a buy in from many business teams within the organization ( improved the recall by 2% and accessed and added two high impact variables).
  • Built a chat bot to integrate with the Power BI and dashboards to help interactive user search and Q&A with the data in the dashboard using GEN AI ,Microsoft Power Virtual Agent and Microsoft Power Automate
  • Developed a Weighted fuzzy match algorithm for customer data match based on which a White paper titled ‘Data Match algorithm using Weighted Averages’ was written which improved the match rate from 30% to 45%
  • Have Analyzed the Customer Survey Comments, to identify the potential areas of improvements provided as suggestion by the customer and highlight the Key Topics of appreciation and Sentiment Analysis on the customer’s perception
  • Implemented a Part distribution Dashboard in power BI, which tries to identify any part related information from the dealer service network complaint text and tries to correlate any new complaints to the existing list of potential issues thereby providing visibility to the business.
  • Have experience handling the unstructured data from social media platforms to Topic Model the area of concern a customer is more frequently talking about.
  • ROI (Return of Investment Analysis) and Campaign effectiveness analysis for special media campaigns run as a part of Online Marketing.

College Intern

Caterpillar Inc.
12.2016 - 06.2017
  • Developed a proof of concept and implemented an end to end product on Complex Event Processing (CEP) using Apache spark streaming on the near real time unstructured data as a part of thesis for the University.

Engineering Intern

Red Black tree technologies
06.2015 - 11.2015
  • Developed Android applications for Event Management firm.
  • Worked on the architecture and development of a complete web application using node.js

Education

Masters degree - Computer Science

College of Engineering Guindy, Anna University
05.2017

Skills

  • Tools & Languages: AWS (Sagemaker, Lambda,Glue,EC2) , Python programming, R, SAS(Both Advanced & BASE SAS), SQL, C, DataBricks, Apache Spark, Streamlit, Apache Kafka, Hadoop, Informatica, MongoDB, Microsoft Copilot ( PVA), pyTorch, Airflow
  • Visualization: Power BI, Tableau, HTML, CSS, JavaScript, Google Looker Studio
  • Domain: NLP (Natural Language processing), Image Processing
  • GenAI: LLM, LangChain
  • Cloud Platform: Snowflakes, Google Cloud Platform(GCP), Google Big Query, AWS Cloud Practitioner for Data Science

Certification

  • Blockchain Use case & Architecture from NPTEL Apr 2019
  • Introduction to Data Science (IIT Madras) – Caterpillar Continued Learning Program Dec 2019
  • Marketing Analytics – from NPTEL, certified by IIT Roorkee Dec 2020
  • Scrum Master Certified (SMC) – Scrum Alliance Oct 2021

Other responsibilities undertaken

  • Part of the recruitment process for hiring of the best talents into the Marketing Analytics group.
  • Mentored associates within the team on SAS and python skills.
  • Organized “Marketing & Brand- Getting to know” networking session with the leaders for business understanding within the marketing and brand team.
  • Completed the 18 Months Analytics Profession Development Program as a part of the Information Analytics Group, Caterpillar India.

Timeline

Data Scientist

Caterpillar Inc.
02.2025 - Current

Data Scientist

Kaytics Inc.
11.2024 - 02.2025

Data Scientist

Caterpillar Inc.
07.2017 - 09.2024

College Intern

Caterpillar Inc.
12.2016 - 06.2017

Engineering Intern

Red Black tree technologies
06.2015 - 11.2015

Masters degree - Computer Science

College of Engineering Guindy, Anna University
Priyadarshini Selvam