Summary
Overview
Education
Skills
Work History
Certification
Projects
Timeline
Teacher
Yashaswini Y

Yashaswini Y

Broadlands,VA

Summary

Highly competent Data Engineer with background in designing, testing, and maintaining data management systems. Possess strong skills in database design and data mining, coupled with adeptness at using machine learning to improve business decision making. Previous work resulted in optimizing data retrieval processes and improving system efficiency.

Highly competent Data Engineer with 4 years of experience in IT, specializing in data modeling, analytics, and software development for Finance and Healthcare sectors. Skilled in Agile (SCRUM), Lean and Six Sigma methodologies with expertise in big data frameworks. Known for high productivity, problem-solving, and collaboration, delivering innovative data solutions across complex environments.

Overview

2
2
Certification
1
1

Machine Learning course by Andrew NG

5
5
years of professional experience

Education

Masters - Computer Science

Texas A&M University
USA
08.2021 - 01.2023

Bachelor's - Computer Science

JNTUH University
INDIA
05.2019

Skills

IDE and BI tools: Eclipse, Android Studio, Notepad, Microsoft Visual Studio, Data Grip, PowerBI, Tableau

Programming languages: Python, C, C, R, Scala

Database: SQL Server, DynamoDB, MongoDB, Cassandra, Oracle, PostgreSQL, MySQL, NoSQL

Operating Systems: Windows, UNIX, Linux, Ubuntu, Mac

Work History

Data Engineer

ThermoFisher Scientific Company
San Diego, California
03.2023 - Current
  • Leveraged Python libraries likIe Matplotlib and NumPy to produce 50+ impactful data visualizations, enhancing decision-making and data interpretation for business units.
  • Automated data validation and cleaning with over 50 Python scripts, reducing data processing time by 30%. Ensured dataset consistency by deduplicating and standardizing data with Pandas.
  • Developed and optimized stored procedures in SQL Server Management Studio to ensure precise data validation from source to target.
  • Demonstrated advanced SQL expertise, including Common Table Expressions (CTE), for querying, manipulating, and analyzing large datasets to optimize performance and maintain data integrity.
  • Managed data governance within HDFS and performed data analysis in Spark using Scala and PySpark. Experienced in AWS Cloud services, including EC2, EMR, VPC, S3, IAM, CloudFront, CloudWatch, Redshift, CloudFormation, and Direct Connect.
  • Created 70+ Step Functions, 55+ Lambda functions, and rules, specializing in serverless architecture and ETL processes via AWS Step Functions and Glue.
  • Configured AWS Data Pipeline to load data from S3 into Redshift, utilizing AWS Glue Catalog with a crawler to retrieve data from S3 and execute SQL query operations.
  • Designed and developed Security Framework to provide fine grained access to objects in AWS S3 using AWS Lambda, DynamoDB.
  • Implemented AWS Step Functions to automate and orchestrate the Amazon SageMaker related tasks such as publishing data to S3, training ML model and deploying it for prediction.

Graduate Student Teaching Assistant

Texas A&M Corpus Christi
, TX
01.2022 - 12.2022
  • Skilled Assistant with hands-on experience in independently managing over 10 projects in dynamic business environments.Proficient in developing Python scripts to automate data parsing and loading processes, successfully handling over 50k records.
  • Strong background in database management, including SQL and MongoDB, with a proven ability to retrieve and process over 1TB of data in real-time. Adept at working in fast-paced settings and delivering data solutions that drive operational efficiency

AI Research Assitantship

Texas A&M Corpus Christi
, TX
08.2021 - 12.2021
  • Al Research Assistant with a strong foundation in data engineering, focusing on developing scalable data pipelines, real-time data processing, and automation for AI/ML models
  • Skilled in data preparation, ETL processes, and feature engineering for machine learning models, enabling efficient Al model training and evaluation.
  • Experienced in deploying and optimizing data workflows, ensuring the accuracy and integrity of research data for Al applications.

Associate Engineer

Virtusa
05.2019 - 07.2021
  • Experienced Engineer with a strong background in designing and optimizing data solutions across SQL, NoSQL, and cloud platforms
  • Skilled in building and automating CI/CD pipelines, managing large-scale datasets, and developing real-time data ingestion pipelines using tools like Spark, Kafka, and AWS services (Redshift, EMR, Kinesis).
  • Created complex SQL queries to pull data from multiple tables across remote databases using joins, database links, and bulk collects, improving execution time by 60%.
  • Proficient in SQL, Python, and data visualization tools like Tableau and Matplotlib, driving improved data processing efficiency and decision-making.
  • Designed scalable data pipelines for data ingestion, processing, and transformation with Spark, Flink, and Kafka.
  • Expertise in database management (PostgreSQL, SQL Server) and streamlining workflows to enhance project delivery timelines by up to 30%.

Certification

  • AWS Certified Cloud Practitioner
  • AWS Certified Solutions Architect Associate.

Projects

  • NOVEL METHOD FOR CAR PRICE PREDICTION WITH MACHINE LEARNING TECHNIQUES: [https://github.com/yachamaneniy/Car_Price_Prediction]
  • SMART FOOD MANAGEMENT:[ https://docs.google.com/document/d/1PvyYz7NLjofZo97dCsyc4_YV5yxRWVlr/edit]
  • DOG BREED CLASSIFICATION: [https://drive.google.com/file/d/1C8eiPuAAtDwDPFaHLbjQwB_fGQh338zJ/view]
  • IMPROVING USER SEARCH EXPERIENCE BY CALCULATING USER INTENSION: [https://docs.google.com/document/d/1shB_DgBlM4ZJNuv1U3mikt56bjpGRRzo/edit]
  • CLASSIFICATION OF IRIS SPEIES USING DATA MINING ALGORITHMS.[https://drive.google.com/file/d/1X0S3biqFs1yhdmVZfc5fV8YTU9i87CYv/view]

Timeline

Data Engineer

ThermoFisher Scientific Company
03.2023 - Current

Graduate Student Teaching Assistant

Texas A&M Corpus Christi
01.2022 - 12.2022

Masters - Computer Science

Texas A&M University
08.2021 - 01.2023

AI Research Assitantship

Texas A&M Corpus Christi
08.2021 - 12.2021

Associate Engineer

Virtusa
05.2019 - 07.2021

Bachelor's - Computer Science

JNTUH University
Yashaswini Y