Summary
Overview
Work History
Education
Skills
Interests
Certification
Languages
Timeline
Generic

ASHISH GUPTA

Tracy

Summary

Senior Software Engineer - Data Infrastructure at Robert Bosch with deep expertise in developing robust, scalable data pipelines and warehouses. Strong focus on team collaboration, driving projects to successful completion, and adapting to evolving requirements. Proficient in spark using scala programming language, multiple big data tools and frameworks. Values delivering high-quality results and fostering productive work environment.

Overview

14
14
years of professional experience
1
1
Certification

Work History

Senior Software Engineer - Data Infrastructure

Robert Bosch
12.2022 - Current
  • Created new data pipelines in Databricks by incorporating new data sources using Spark
  • Enhanced the performance and architecture of existing data pipelines
  • Developed scalable application monitoring infrastructure
  • Managed multiple projects simultaneously while maintaining strict deadlines and high-quality standards.
  • Maintained comprehensive documentation of development work, facilitating knowledge sharing among team members.

Senior Data Engineer II

Together Labs
04.2019 - 12.2022
  • Currently heading project to implement a change data capture (CDC) pipeline from mysql DBs to cloud data warehouse using debezium and apache HUDI
  • Created new Hive scripts for new ETL jobs
  • Created Spark jobs using scala to generate daily/weekly/monthly aggregated data
  • Refactored and fine tuned many spark and hive jobs for better performance and extensibility
  • Created Spark streaming job that ingests and aggregated data continuously from kafka and stores the aggregated data on s3 for tableau ingestion
  • Headed the data migration project from on-premises to aws cloud
  • Created a data reconciliation system using spark which compared the data generated by the migrated jobs in cloud against the old jobs at on-prem, running in parallel
  • Migrated many complex multistage jobs that had mysql-hive intermediate steps to only hive
  • Migrated the on-prem hive/spark jobs to cloud aws using docker and airflow as scheduler
  • Created new job in Data pipeline to send data to external vendors via REST API after OAuth2 authentication using Akka framework + scala

Data Engineer

Apple
02.2016 - 04.2019
  • Created MapReduce jobs and pig scripts to filter and aggregate data
  • Developed Spark Jobs to perform business logic on aggregated data to generate KPIs
  • Enhanced and maintained existing MapReduce and Pig jobs in production
  • Scheduled the MapReduce/Pig jobs using an in-house scheduler
  • Performed AdHoc analysis to identify and mitigate faulty scenarios

Data Developer

Infogain Corporation
10.2015 - 02.2016
  • Developed Spark Application to read the wav files, convert them to text in parallel, using Stanford NLP
  • Performed sentiment analysis on the converted text to generate sentiment score for the conversation
  • Trained a Spark Mllib Classification Model to detect the appropriate conversation sentiment
  • Stored the sentiment output as Parquet files and loaded on apache zeppelin UI, using spark Sql

Hadoop Developer

Elisa Oyj
04.2014 - 09.2015
  • Developed Map-Reduce programs for various business requirements (analysis of Web logs, DTH logs, CDR data)
  • Analyzed web logs to generate business KPI results
  • Written map reduce programs to identify patterns and KPI (most viewed bandwidths, most used customer portals, best campaigns according to the usage)
  • Generated KPI data monthly, weekly and annually according to the need

Scala/Java Developer

Elisa Oyj
02.2011 - 04.2014
  • Handled business critical application, interconnecting CRM with Billing Module
  • Enhanced existing Invoice generation and aggregation application with new functionalities
  • Developed new authentication functionality in the application UI to grant access to the e-letters and Invoices from a newly developed application, without creating a session
  • Maintained an application that was the frontend to the Billing Module, the application had complex business logic, written primarily in Scala and frontend in Apache Wicket
  • Design and development of new interface and features for MIBP application
  • Performed new application logic development in Scala for the backend of the wicket UI

Education

Bachelor of Science - Computer Science

Ajay Kumar Garg Engineering College
Ghaziabad, Uttar Pradesh
06.2010

Skills

  • Spark
  • Scala
  • Python
  • Hive
  • Azure
  • Databricks
  • Delta Lake
  • AWS
  • Kafka
  • Docker
  • Jsonnet
  • Kubernetes
  • Airflow
  • Jenkins
  • GitHub Actions
  • Apache HUDI
  • Debezium

Interests

Machine Learning

Certification

Functional Programming Principles in Scala, Coursera, 2021-03-01, KL425QK5TSCG

Languages

English
Hindi

Timeline

Senior Software Engineer - Data Infrastructure

Robert Bosch
12.2022 - Current

Senior Data Engineer II

Together Labs
04.2019 - 12.2022

Data Engineer

Apple
02.2016 - 04.2019

Data Developer

Infogain Corporation
10.2015 - 02.2016

Hadoop Developer

Elisa Oyj
04.2014 - 09.2015

Scala/Java Developer

Elisa Oyj
02.2011 - 04.2014

Bachelor of Science - Computer Science

Ajay Kumar Garg Engineering College