Summary
Overview
Work History
Education
Skills
Websites
Certifications Trainings
Hobbies and Interests
Awards Recognitions
Languages
Timeline
Generic

Sumedh Sathe

Remote,India

Summary

Highly skilled Data Engineering Manager with over 13 years of experience, including 3 years of onsite experience in the USA. Successfully drives cost-effective data platform modernization and builds high-performance, scalable data solutions. Proven expertise in migrating legacy systems to modern cloud platforms such as Hadoop, AWS, Microsoft Fabric, and Databricks. Adept at optimizing the full data engineering lifecycle, from data ingestion to advanced analytics. Passionate about leveraging emerging technologies like AI, machine learning, and real-time data streaming to drive efficiency and enable data-driven business decisions.

Overview

13
13
years of professional experience

Work History

Data Engineering Manager

Tredence Inc.
10.2021 - Current
  • Designed and implemented a data warehouse for a leading US-based retailer to support backend warehouse management use cases
  • Leveraged Snowflake Data Platform, DBT Cloud, and Astro Cloud for the solution, with responsibilities including data modeling and workflow design using DBT and Astro Cloud
  • Spearheaded a team of 11-12 data engineers in migrating an Oracle Exadata on-premises data warehouse to Microsoft Fabric for a leading Saudi-based retailer
  • Leveraged Microsoft Fabric Data Factory, PySpark Notebooks, Warehouses, and Lakehouses, achieving a modernized, data architecture enhancing data processing performance by 40%
  • Designed a configurable data ingestion pipeline on Microsoft Fabric with load assurance and data quality checks, reducing manual efforts by 40% and improving scalability and reliability
  • Supported client RFPs by building technology architecture patterns, blueprints along with estimations related to Big Data Platform Modernization and Cloud Migrations

Specialist Senior – Data Engineering

Deloitte Consulting LLP
05.2015 - 10.2021
  • Designed and implemented data lake for Advanced Metering Infrastructure for one of largest power and utility client in USA using Oracle Big Data Appliance, Apache Spark, Apache NiFi and Confluent Kafka
  • Led the delivery of data engineering track from onshore (Chicago location) by coordinating with client architects and vendors to onboard structured, semi-structured data onto Hadoop based data platform using Apache NiFi, Confluent Kafka and Oracle Goldengate technologies
  • Built dynamic reconciliation framework on Hadoop based data lake using Apache Spark, Hive, HDFS and Control-M to measure/track critical data elements of the Insurance product for an US based Insurance giant
  • Led and built configurable Data Lake Ingestion framework using AWS EMR, AWS S3, AWS RDS, AWS Redshift, Python, PySpark, Apache NiFi and Confluent Kafka to reduce the initial cost and manual efforts
  • Framework is being used in multiple client engagements

Senior Software Engineer – Data Engineering

Capgemini India Pvt. Ltd.
02.2014 - 05.2015
  • Designed and developed generalized application for predicting the approval/rejection of mortgage load depending upon the demographic characteristics of the applicants using Hadoop, Sqoop, Avro, Pig, Hive, R (packages used: Rhive, Shiny), D3.js
  • Part of Big Data Center of Excellence team, evaluated tools and technologies under Big Data umbrella for and continuously provided/guided team members and other department teams on Big Data/Hadoop training and related resources

Software Engineer

FusionCharts (An Idera, Inc. Company)
12.2011 - 02.2014
  • Custom dashboarding using FusionChart.js (Company Product) and integration of FusionCharts.js with various BI tools in other technology frameworks such as Angular.js, Knockout.js

Education

Bachelor of Engineering - Computer Science

G. H. Raisoni College of Engineering
Nagpur
06.2011

Skills

  • Cloud Platforms: Microsoft Fabric, Snowflake, Databricks, AWS
  • On-Premise Data Platforms: Cloudera, Oracle BigData Appliance (BDA)
  • Ingestion Tools : Apache NiFi, Confluent Kafka
  • ETL Tools/Processing : dbt (Data Build Tool), Apache Spark
  • Data Languages : Python, SQL
  • Databases : PostgreSQL, Oracle, Redshift
  • Orchestration Tools : Astronomer Airflow, Control-M

Certifications Trainings

  • Databricks Certified Data Engineer Associate
  • Apache Spark Essentials
  • AWS Data Engineer Essentials

Hobbies and Interests

  • Travelling
  • Swimming
  • Singing

Awards Recognitions

Recognized for optimizing critical UAT pipelines to meet tight deadlines, resulting in successful project delivery. Due to the impact of my work, the client requested my onsite presence to collaborate closely with their business team.

Languages

English
Full Professional

Timeline

Data Engineering Manager

Tredence Inc.
10.2021 - Current

Specialist Senior – Data Engineering

Deloitte Consulting LLP
05.2015 - 10.2021

Senior Software Engineer – Data Engineering

Capgemini India Pvt. Ltd.
02.2014 - 05.2015

Software Engineer

FusionCharts (An Idera, Inc. Company)
12.2011 - 02.2014

Bachelor of Engineering - Computer Science

G. H. Raisoni College of Engineering
Sumedh Sathe