Summary
Overview
Work History
Education
Skills
Projects
Certification
Timeline
Generic

Rahul Paruchuri

Overland Park,KS

Summary

Dedicated and results-driven Senior Data Engineer with over 3 years of experience in designing, developing, and maintaining complex data solutions. Adept at working within Agile environments and leveraging modern engineering practices to deliver high-quality solutions. Seeking to contribute expertise to a forward-thinking organization that values innovation and excellence.

Overview

4
4
years of professional experience
1
1
Certification

Work History

Data Engineer

Morgan Stanley
10.2022 - 05.2023
  • Designing, developing, and implementing data processing and analytical solutions that meet business requirements of organization
  • Actively participated in design reviews with peers and stakeholders to assess various technologies and make informed decisions for optimal implementation
  • Creating technical design for full report that featured examples of report layouts .Developing ETL mapping sheets that detailed source and target tables, join requirements between source and destination, and transformation algorithms
  • Leveraged AWS Snowflake cloud Data warehouse and AWS S3bucketto streamline data integration pipelines, resulting in improved efficiency and enhanced data quality
  • Leveraging experience with relational databases and SQL to develop and maintain data models and schemas that facilitate efficient data management and reporting
  • Designed and developed complex procedures to handle errors and exceptions at both application and database level using SQL and shell scripts
  • Developed Python Script to load CSV files into Staging Tables from NAS drive
  • Developed backend SQL packages, building UNIX shell scripts for data migration & Batch Processing
  • Developing Python scripts to support large data loading solutions, ensuring data quality and integrity throughout process
  • Worked with SQL Server Integration Services(SSIS) to optimize large data movements and loads, minimizing data processing times.
  • Managed multiple tasks and competing priorities in fast-paced environment, while maintaining high level of efficiency and quality in work.

Data Engineer

S2S SOFTSOL
09.2019 - 06.2021
  • Designed, developed, and tested ETL data integration solutions for product, ensuring data quality and reliability
  • Mentored and influenced peers to meet project commitments on time and with exceptional quality
  • Collaborated with cross-functional teams to understand data requirements and develop data models
  • Implemented Agile software development methodologies and utilized Agile management tools, including Jira, to enhance project efficiency and collaboration
  • Worked extensively with relational databases and Cloud-based technologies, optimizing data storage and retrieval processes
  • Led design and architecture of large-scale ETL solutions, improving data processing speed and efficiency.
  • Collaborated on ETL (Extract, Transform, Load) tasks, maintaining data integrity and verifying pipeline stability
  • Utilized Enterprise Data Warehouse data models and dimensional modeling concepts for source-to-target mapping and data integration architecture
  • Collaborated with data scientists and analysts to provide data support and ensure data accuracy and consistency.
  • Conducted performance tuning and optimization of data pipelines for optimal system performance
  • Documented data integration processes and best practices for knowledge sharing within team

Education

Masterof Science - Computer Science

University of Central Missouri
Warrensburg
12.2022

Bachelor of Technology - Computer Science

Lovely Professional University
India
08.2020

Skills

  • TECHNICAL SKILLS
  • Programming Languages:
  • Python, C/C, Java, JavaScript, SQL, HTML/CSS, XML, , C#NET, VMWare, Bash
  • Frameworks: Struts, Spring, Hibernate, Angular, Nodejs
  • Database: MYSQL, Mongo DB, Oracle, PostgreSQL
  • Technologies: Hadoop-MapReduce, Spark, CI/CD Pipelines, Pyspark, Airflow
  • Cloud: AWS, Azure, AWS EC2, AWS Lambda
  • Tools: Git, SVN, JIRA, Vim, Visual Studio, Docker, Lucid Chart, Visio, MS office, Excel, PPT, Office 365

Projects

 Discord chatbots with Integrated Google Calendar API.

 Designed and developed a discord bot in Spring framework to suggest user with most optimal way to finish the pending tasks. 

 Integrated bot with Google calendar to Login and directly fetch any prior scheduled tasks. The design flexibility allows to easily integrate with   any other calendars or chat-based applications.


 Invasive Ductal Carcinoma (IDC) Detection – Cancer detection from breast histology images 

  

 Created a convolutional neural network with TensorFlow and Kera’s that identified the presence of IDC with 87% accuracy.

 Compared the ability of traditional supervised learning techniques such as Linear Regression, K-Nearest Neighbors,

 Support Vector Machines, and Random Forests to detect IDC.

 Investigated the effects of dimensionality reduction on the ability of supervised techniques to classify images with IDC. 


 



Certification

  • AWS Certified Solutions Architect Associate

Timeline

Data Engineer

Morgan Stanley
10.2022 - 05.2023

Data Engineer

S2S SOFTSOL
09.2019 - 06.2021

Masterof Science - Computer Science

University of Central Missouri

Bachelor of Technology - Computer Science

Lovely Professional University
Rahul Paruchuri