Summary
Overview
Work History
Education
Skills
Timeline
Generic

Sravanthi Joshi

Summary

Highly motivated and skilled Data Engineer with a passion for designing and implementing efficient data pipelines and solutions. Seeking to leverage my expertise in AWS infrastructure, data modeling, and ETL processes to contribute to the success of a forward-thinking organization.

Overview

11
11
years of professional experience

Work History

Data Engineer

Amazon.com Services
03.2019 - Current


  • Designed and implemented a high-throughput data pipeline to load near real-time log data from over 5 million AWS infrastructure hosts.
  • Collaborated with the AWS Storage team to provide critical health reports on drive components.
  • Architected a near real-time data pipeline using AWS Kinesis Firehose, Lambda, SQS, and Glue to process log data from 25+ AWS regions into Redshift.
  • Implemented a scalable Data Lake solution using S3, Athena, Glue catalog, etc., allowing customers to bring their own compute and manage constraints for data retrieval.
  • Optimized the teams S3 Cost to 50% by implementing the file storage from .json to parquet .
  • Optimized QuickSight dashboard refresh rates by 80% through the implementation of efficient Slowly Changing Dimensions and Facts.
  • Spearhead the optimization of ETL workflows, resulting in a 40% reduction in data processing time and enhanced overall system performance.
  • Mentor junior data engineers, fostering a culture of continuous learning and knowledge sharing within the team.
  • Collaborate with data scientists and BI Engineers to integrate machine learning models into data pipelines and facilitate data-driven decision-making.
  • Streamlined data processing and reporting for services like S3 and EBS by closely collaborating with the Hardware Engineering Storage team.

Data Engineer

Cisco Systems, Inc.
10.2012 - 02.2019
  • Designed and developed a robust data infrastructure by retrieving and aggregating data from multiple sources. This infrastructure provided key insightful business metrics to support senior management decisions.
  • Conducted data modeling and optimized designs to unify various data sets residing in different databases and systems into a single platform on SAP HANA.
  • Implemented data pipelines using Python to efficiently fetch data from diverse sources, such as oracle, and Teradata, and loaded it into SAP HANA.
  • Employed fine-tuning mechanisms like indexing, partitioning, query optimization, and calculation views to enhance the performance of SAP HANA, Teradata, and Hadoop Hive databases. These optimizations significantly reduced Tableau dashboard refresh time from 30 minutes to mere seconds.
  • Coordinated with cross-functional teams, sellers, and managed post-production activities to ensure seamless project delivery.

Education

Bachelors in Technology - Electrical, Electronics And Communications Engineering

Jawaharlal Nehru Technological University
Hyderabad
05.2008

Skills

  • Data Pipeline Design and Implementation
  • Cloud Computing (AWS services such as S3, Glue, Lambda, SNS, SQS, Redshift, Athena)
  • Data Modeling and Warehousing (Slowly Changing Dimensions, Facts, aggregates)
  • Python
  • Big Data Technologies ( Spark)
  • ETL Processes and Data Transformation
  • Data Lake Architecture
  • SQL and NoSQL Databases
  • Machine Learning Integration
  • Data Quality and validation, Data Lineage
  • Agile Development Methodology

Timeline

Data Engineer

Amazon.com Services
03.2019 - Current

Data Engineer

Cisco Systems, Inc.
10.2012 - 02.2019

Bachelors in Technology - Electrical, Electronics And Communications Engineering

Jawaharlal Nehru Technological University
Sravanthi Joshi