Summary
Overview
Work History
Education
Skills
Affiliations
Timeline
Generic

HARI VASANTAPU

Cedar Park,TX

Summary

Highly skilled data scientist with [6+ years] of experience in analyzing complex datasets, developing machine learning models, and driving data-driven decision-making processes.5-year intense hands-on experience with data manipulation/ wrangling using SQL.

  • Three-year hands-on experience with multiple machine learning models (linear models, decision tree, SVM, EM, neural network and different ensemble methods) in Python and R (Scikit-Learn and Caret).
  • Experienced with data visualization and app creation using R; Service and model production in Python and Docker.
  • Able to navigate high-stress situations and achieve goals on time. Specialized in Big Data, Spark, Scala, Python, SQL, Elastic Search, Machine Learning, Deep Learning and Data Analysis.

Overview

7
7
years of professional experience

Work History

Sr Data Scientist And Cloud Engineering

SHARP LINK SOFTWARE
08.2023 - Current
  • Set up SQL database on cloud servers to store client data for query analysis.
  • Conducted exploratory data analysis to uncover insights into customer behavior and preferences, informing product development and marketing strategies.
  • Designed and deployed real-time analytics solutions to monitor key performance metrics and detect anomalies, improving operational efficiency and reducing downtime.
  • Collaborated with software engineers to integrate predictive models into production systems, ensuring seamless deployment and scalability.
  • Provided technical guidance and support to cross-functional teams on data science methodologies and best practices.
  • Developed intricate algorithms based on deep-dive statistical analysis and predictive data modeling.
  • esigning, maintenance and management of tools for automation of different operational processes.
  • Designed and implemented an automated CI/CD pipeline for a high-tra c e - commerce website, reducing deployment time by 75% and increasing uptime by 20%.
  • Developed and maintained automation scripts for managing and deploying applications on AWS, resulting in a 30% reduction in manual effort and a 25% increase in team productivity.
  • Implemented automated solutions for monitoring and logging of AWS services, improving system reliability by 40% and reducing mean time to resolution
  • MTTR by 50%.
  • Implemented efficient data ingestion process using REST API, AWS S3 and Hadoop for 400M profiles, increasing data accessibility and reducing manual effort.
  • Improved data accessibility and efficiency by creating a near real-time and batch data lake for cross-functional teams.
  • Ingested data from disparate sources using SQL and Google Analytics API to construct data views for BI tools like Tableau
  • Communicated with investors to understand needs, and translated their feedback into actionable reports in Tableau, saving 46 hours of manual work each month
  • Deployed a recommendation engine to production to conditionally recommend other menu items based on past order history, increasing average order size by 7%

Data Scientist

AMEL IT SOLUTIONS
01.2020 - 04.2023
  • Develop machine learning models (e.g. provider fraud, inter-departmental record linkage, 30 day hospital readmission etc)through cloud-based infrastructure to facilitate continuous integration/continuous deployment
  • Led data analysis projects, collaborating with cross-functional teams to extract actionable insights and drive business decisions.
  • Developed predictive models using machine learning algorithms to forecast customer behavior and optimize marketing strategies, resulting in a 95% increase in campaign effectiveness.
  • Implemented data pipelines and ETL processes to ingest, clean, and preprocess large-scale datasets, ensuring data quality and integrity.
  • Utilized advanced statistical techniques to uncover hidden patterns and trends in data, contributing to the development of new product features and enhancements.
  • Presented findings and recommendations to senior leadership, translating technical insights into actionable recommendations for business stakeholders.
  • Designed and implemented cloud architectures on AWS, Azure, and Google Cloud Platform (GCP), ensuring scalability, reliability, and security of cloud-based applications.
  • Orchestrated containerized applications using Docker and Kubernetes, optimizing resource utilization and facilitating seamless deployment and scaling.
  • Automated infrastructure provisioning and configuration management using Infrastructure as Code (IaC) tools such as Terraform and CloudFormation, reducing deployment time
  • mplemented continuous integration and continuous deployment (CI/CD) pipelines to streamline software delivery processes, enabling faster time to market.
  • Conducted performance monitoring, troubleshooting, and optimization of cloud-based systems to maximize efficiency and minimize downtime.

Jr. Data Scientist

Capgemini
07.2019 - 10.2020
  • Worked with stakeholders to develop quarterly roadmaps based on impact, effort and test coordinations.
  • Utilized advanced querying, visualization and analytics tools to analyze and process complex data sets.
  • Applied statistical and algebraic techniques to interpret key points from gathered data.
  • Developed intricate algorithms based on deep-dive statistical analysis and predictive data modeling.
  • Set up SQL database on cloud servers to store client data for query analysis.
  • Built and maintained dashboards and reports to track key performance metrics, providing stakeholders with actionable insights into business performance.
  • Conducted exploratory data analysis to identify opportunities for process optimization and revenue growth.
  • Collaborated with IT teams to design and implement data pipelines and ETL processes, improving data accessibility and efficiency.
  • Provided training and support to team members on data analysis tools and techniques, fostering a culture of data-driven decision-making within the organization.

Cloud Engineer

Eco Sleek Tech
07.2017 - 07.2019
  • Developed and enhanced existing software applications, optimizing performance and usability for end-users.
  • Implemented best practices for code reviews, testing, and debugging to ensure high-quality deliverables.
  • Extensive experience in designing and developing scalable cloud-based applications utilizing AWS services.
  • Pro cient in developing and deploying serverless applications using AWS
  • Lambda, API Gateway, and S3.
  • Skilled in containerization technologies such as Docker and Kubernetes for deploying applications in cloud environments.
  • Experienced in implementing CI/CD pipelines using tools like AWS
  • CodePipeline and Jenkins for continuous integration and delivery in cloud development.
  • Created visually appealing and easy to understand dashboards to communicate complex data insights.
  • Collaborated with cross-functional teams to gather requirements and design data visualization solutions.
  • Implemented best practices for data visualization, including selecting appropriate chart types and color palettes.
  • Collaborated with a cross-functional team to develop and deploy web applications using agile methodologies.

Education

Master of Science - Data Science

UNIVERSITY OF HERTFORDSHIRE
Hatfield, United Kingdom
12.2022

Bachelor of Technology - Computer Science And Programming

JNT UNIVERSITY
HYDERABAD
06.2017

Skills

Programming Languages: Python, R, SQL
Data Analysis and Manipulation: pandas, NumPy, dplyr
Machine Learning Libraries: scikit-learn, TensorFlow, Keras
Statistical Analysis: scipy, statsmodels
Data Visualization: Matplotlib, Seaborn, Plotly, ggplot2
Big Data Technologies: Hadoop, Spark, Hive, Pig
Database Management Systems: MySQL, PostgreSQL, MongoDB
Version Control: Git, GitHub
Data Preprocessing and Cleaning: pandas, tidyverse
Natural Language Processing: NLTK, spaCy
Deep Learning: Convolutional Neural Networks (CNN), Recurrent Neural Networks (RNN)
Cloud Platforms: Amazon Web Services (AWS), Google Cloud Platform (GCP), Microsoft Azure
Dashboarding and Reporting: Tableau, Power BI, Dash

Affiliations

  • Founder of a small startup Dr Agri in india

Timeline

Sr Data Scientist And Cloud Engineering

SHARP LINK SOFTWARE
08.2023 - Current

Data Scientist

AMEL IT SOLUTIONS
01.2020 - 04.2023

Jr. Data Scientist

Capgemini
07.2019 - 10.2020

Cloud Engineer

Eco Sleek Tech
07.2017 - 07.2019

Master of Science - Data Science

UNIVERSITY OF HERTFORDSHIRE

Bachelor of Technology - Computer Science And Programming

JNT UNIVERSITY
HARI VASANTAPU