Summary
Overview
Work History
Education
Skills
Accomplishments
Soft Skills and Domain Knowledge
Languages
Timeline
Generic
SWAPNIL BHOOTWALA

SWAPNIL BHOOTWALA

Seattle,WA

Summary

Senior Data Engineering Leader with over 13 years of experience across advertising, marketing, and finance domains. Expertise in architecting both batch and real-time data pipelines, developing data lakehouse solutions, and building enterprise data warehouses for analytics and machine learning applications. Proven track record in designing scalable data platforms supporting MLOps and GenAI initiatives, implementing automated data quality frameworks and monitoring systems, and optimizing data ingestion workflows. Demonstrated success in leading technical teams, driving data-driven product development, and delivering end-to-end data solutions that enable advanced analytics, business intelligence, and AI applications

Overview

14
14
years of professional experience

Work History

Sr. Data Engineer

Amazon
02.2017 - Current
  • Building real-time data pipelines which consumes streaming data and publish to API end point for application consumption. This pipeline servers 100M+ advertisement recommendations to the Partners across 10+ marketplace. These recommendations are driving $900M+ revenue uplift for Amazon Ads.
  • Building data infrastructure for Analytical and Application use-case using Apache iceberg. These platform serves 100+ analytical use-case along with real-time front end application which extracts Partners performance data and helps partners with setting-up the account planning. Due to this ~300+ Partners are able to set up the account plan automatically without any manual touch points. Overall saving of 400+ hours every year during account planning cycle.
  • Build data pipelines which consumes data from 10+ sources to calculate partner incentives based on their performance metrics. Extend it to data solution where sending these incentives data to UI Application in form of SQS messages and integrate it with ML application for forecasting purpose.
  • Build BYOD (Bring your own data) platform to automate processing the data from different stakeholders (Internal and external). The BYOD platform is configuration based which creates data pipelines dynamically and reduces the data on-boarding time by 80% and hence saved 400+ hours for Data Engineering team in a year.
  • Build high scalable data catalog syncer service to make data available cross teams and share the data-lake without copying the underlying datasets
  • Build data dependency checker to enable software applications to check for the available partitions under data lake before consumption. Additionally build Rest API on top of dependency checker and make it available cross teams
  • Designed hourly data pipelines which reads data from SNS topic and store into S3 based data lake using Kinesis Firehose and EMR based platform. This application processes 10 billion messages in a day.
  • Build self-service and triggered based system for science team to retrieve the data based on their requirements from external team's API interface and store into data lake for ML use-cases. The Pipeline launches the resources when it triggers, processes the data and loads into Data Lake built on S3.
  • Efficient collaboration with PM (Product Manager), BIE (Business Intelligence Engineer, DS (Data Science) teams and address their requirements/problems, and ensure working backward from Customers ‘needs.

Information Technology Analyst

Tata Consultancy Services
07.2011 - 02.2017
  • Data Modeling & Data Warehouse designing using Oracle DB to deliver ETL-BI Solutions
  • Installation of SAP BODS & BO product suites (V 4.2) on Microsoft Server
  • Deliver complex & enhanced Visualization\Reports\Semantic layer using BI tools like SAP BO, Lumira, Universe Designer, Tableau
  • Deliver highly optimized and complex ETL framework to populate Data Warehouse data using ETL tools like SAP BODS (Business Objects Data Services), SAP ABAP Data flow & SSIS
  • Data Migration from SAP R3\ECC to Oracle platform & vice versa using ETL framework and delivering Data Warehouse on top of Oracle Database
  • Understanding of various financial modules like SAP Balance sheet, FICO (Arrears)
  • SAP HANA Modeling (Analytical view, Attribute View & Calculation View) for BI reporting purpose
  • Requirements gathering by communicating with clients and analysis using developing complex & highly optimized SQL, PL-SQL queries on Database platform

Education

Bachelor of Engineering - Information Technology

Sardar Patel University
India
05.2011

Skills

  • ETL
  • Data Pipeline
  • Cloud Technologies (AWS)
  • Database
  • Data warehouse
  • SQL
  • MLOps
  • GenAI application integrations
  • Big Data Technologies like Hadoop, Scala, Spark, Hive
  • Scripting Languages (Python, Pyspark, Scala)
  • Automation
  • NoSQL
  • OLTP & OLAP
  • Streaming Data
  • Kinesis
  • Kafka
  • SQS
  • Real time Data Processing
  • AWS technologies like S3, Lambda, EMR, Glue, Athena, SNS, Redshift, DynamoDB
  • Unix/Linux
  • Rest APIs
  • Leadership
  • Integrity
  • Problem Solving
  • Communication
  • Data warehousing
  • Data modeling
  • Performance tuning
  • Machine learning
  • NoSQL databases

Accomplishments

  • Spearheaded the development of a comprehensive data platform for Partner Advertisement recommendation program, driving $450M in annual revenue with projected growth to $1B by 2025.
  • Architected a multi-platform data infrastructure leveraging Redshift, Spark, and AWS Glue, supporting 200+ daily active users and managing 1000+ daily data pipelines.
  • Engineered a sophisticated Partner incentives calculation framework, integrating ML-based forecasting capabilities with Amazon's ledger system for automated performance tracking.
  • Implemented an Apache Iceberg-based data lake solution delivering sub-second latency for Partner performance metrics through UI applications.
  • Successfully mentored two interns through their professional development, leading to their conversion into full-time employees.

Soft Skills and Domain Knowledge

Soft Skills:

Leadership, Integrity, Problem Solving, Communication, Result Driven, Mentor


Domains worked on: 

Marketing, Finance, Banking and Advertising

Languages

English
Full Professional
Gujarati
Native or Bilingual
Hindi
Native or Bilingual

Timeline

Sr. Data Engineer

Amazon
02.2017 - Current

Information Technology Analyst

Tata Consultancy Services
07.2011 - 02.2017

Bachelor of Engineering - Information Technology

Sardar Patel University
SWAPNIL BHOOTWALA