8 Years of experience in Data Engineering and Data Analysis. Hands on experience in Python, SQL, Spark, Snowflake, Kafka, Hive, Airflow and AWS. Excellent team lead and team player with good written and verbal communication skills.
Overview
8
8
years of professional experience
1
1
Certification
Work History
Senior Data Engineer
TCS
06.2022 - Current
Built end to end data pipeline in production using Spark from Kafka/S3 source systems to target system: Snowflake and JSON parsing done via Spark
Data pipeline maintained in AWS and Apple private cloud; scheduled via Apple internal tools and airflow, based on time and event
Spark resource optimization done based on data volumes and Job metadata maintained in Cassandra DB
Working on tech stack: S3, Spark, Python, Scala, Snowflake, Airflow, Kafka and GIT
Managing Team of Eight (one onsite/seven offshore)
Data Engineer
TCS
01.2019 - 05.2022
Built data pipeline using Spark to pull data from different source system like AWS S3, Kafka and Oracle to load in Snowflake, Teradata and Hive
Data pipeline maintained in AWS; scheduled via Apple internal tools and Airflow
Worked in centralized platform team to help onboard several applications to Spark
Python script created to publish event to trigger data pipeline.
Programmer Analyst/ Data Engineer
Reliable Software Resource Inc
06.2018 - 12.2018
Built Data pipeline through Python script to parse JSON file from Source Database and Python Pandas used for data wrangling/ Data cleansing
Worked on Spark/Hive Query to do data Transformation and sqoop for Data Movement and Cassandra to store
Supported Data Science Spark specific applications and involved in deployment process
Built Data modelling (Dimensional and Relational modelling).
SQL Analyst /Data scientist
Populus Group
08.2016 - 01.2018
Worked on Python( Scikit-learn) to Built prediction, classification model and Text Processing, Alteryx/ Hive/ Pig for Data Transformation and Javascript/ Qlikview for Dashboards
Conducted training session on how to analyze and process data using Pig and Hive (Hadoop Ecosystem) and to build custom object in QlikView
Solutions Consultant
Vitria Technology Inc
09.2015 - 08.2016
Worked on Spark(Scala)/ Hive for Data processing and Scala Code to fetch Data from REST API
Built predictive model using R(random forest algorithm)
Wrote JavaScript functions to parse Xml data from REST URL and to show sample sales data in Bipartite widget
Education
Master of Science - Business Analytics
The University of Michigan- Dearborn
Dearborn, MI
08.2015
MBA - Business Administration
VIT University
Vellore, INDIA
08.2014
Bachelor's Degree - Electronics and Communication Engineering