Summary
Overview
Work History
Education
Skills
Websites
certifications
Timeline
Generic

DEVASHISH SARVADE

Cleveland,OH

Summary

Highly skilled Lead Data Engineer with over 5 years of experience in designing, building, and optimizing secure, scalable, and high-performance data architectures. Proficient in Python, SQL, and Java, with hands-on experience in Big Data technologies such as Apache Spark, Kafka, Hadoop, and Snowflake. Demonstrated expertise in cloud platforms (AWS, Azure, GCP) and a strong understanding of Identity & Access Management (IAM) principles to ensure data security, governance, and compliance. Extensive background in ETL/ELT processes, data modeling, and real-time data streaming solutions to support analytics and business intelligence initiatives. Committed to mentoring others, driving innovation in data engineering, and embracing agile development methodologies.

Overview

6
6
years of professional experience

Work History

Sr. Data Engineer

United Airlines
Houston, TX
07.2024 - Current
  • Designed and optimized real-time data pipelines using Apache Spark, Kafka, and PySpark to enhance decision-making for airline operations.
  • Implemented IAM security policies for access control, encryption, and compliance in cloud data storage solutions (AWS S3, Snowflake).
  • Developed ETL pipelines with Apache Airflow to process structured and unstructured data, ensuring efficient data ingestion.
  • Integrated Amazon Kinesis and Apache Kafka for real-time event streaming, improving system performance by 50%.
  • Collaborated with business intelligence teams to build interactive dashboards using Tableau and Python, enabling data-driven insights.

Data Engineer

CapitalOne
09.2023 - 06.2024
  • Built and maintained scalable data models to support risk assessment and fraud detection using AWS Redshift, Snowflake, and SQL.
  • Developed secure ETL pipelines using Python and Apache Spark, ensuring data quality and consistency.
  • Implemented role-based access controls (RBAC) and data encryption, strengthening security for sensitive customer data.
  • Automated data workflows with Apache Airflow, reducing manual intervention and enhancing operational efficiency.

Data Engineer

Tata Consultancy Services
08.2019 - 04.2023
  • Architected Big Data solutions leveraging Apache Hadoop and Databricks, reducing batch processing times by 40%.
  • Developed scalable data storage solutions using Azure Data Lake to enable seamless data access.
  • Designed and implemented ETL pipelines for Azure Synapse Analytics, ensuring efficient data transformation.
  • Integrated IAM security protocols for access management, compliance, and data protection.

Education

Master of Science - Information System

Trine University
USA
08.2024

Skills

  • Apache Spark, Hadoop, MapReduce, Databricks, Kafka, Airflow
  • Data Modeling (Conceptual, Logical, Physical), Data Warehousing (Snowflake, Redshift, Teradata)
  • ETL/ELT Pipelines (Informatica, Apache NiFi, Talend)
  • Database Management (SQL Server, Oracle, MySQL, PostgreSQL, MongoDB, HBase, Cassandra)
  • Python, Java, Scala, SQL
  • Python Libraries: Pandas, NumPy, SciPy, Scikit-learn
  • AWS: S3, Lambda, Glue, Kinesis, EMR, Lake Formation
  • Azure: Data Factory, Data Lake Storage, Event Hubs, Synapse Analytics
  • GCP: BigQuery, Google Cloud Storage
  • Role-based access control (RBAC), Data Encryption, Data Governance
  • IAM principles, compliance frameworks, security audits
  • CI/CD Pipeline Management

certifications


  • AWS Certified Data Analytics – Specialty
  • Google Cloud Certified – Professional Data Engineer (In Progress)


Timeline

Sr. Data Engineer

United Airlines
07.2024 - Current

Data Engineer

CapitalOne
09.2023 - 06.2024

Data Engineer

Tata Consultancy Services
08.2019 - 04.2023

Master of Science - Information System

Trine University
DEVASHISH SARVADE