8 Years of experience in IT with extensive expertise in dealing with RDBMS and Big Data technologies leveraging SQL Server, Redshift, Azure/AWS Cloud services, Spark.
Overview
8
8
years of professional experience
Work History
Senior Data Engineer
Amazon.com
09.2022 - Current
Designed and implemented data pipelines and ETL processes to ingest, transform, and load supply chain data from diverse sources, enabling large-scale analytics and reporting.
Collaborated with cross-functional teams to design and implement scalable data architectures, using Redshift, AWS Glue, S3, and other relevant technologies to meet business requirements.
Implemented data ingestion pipelines using Spark, enabling seamless extraction of data from diverse sources using Spark-SQL and PySpark as programming languages.
Developed automated data quality monitoring solutions, including data profiling, data cleansing, and anomaly detections, reducing manual efforts, and improving efficiency.
Involved in on-call rotation process for the team to support the Data platform globally.
Served as the primary point of contact and subject-matter expert for all data-related inquiries, data quality issues, and troubleshooting within the team.
Created detailed technical documentation for ETL architectures, coding standards, user manuals, standard operating procedures, run books, etc.
Data Engineer 2
Microsoft Corporation
06.2019 - 08.2022
Implementation and delivery of Big Data solutions using Azure services and Sql Server
Extract Transform and Load data from Sources Systems to Azure Data Storage services using a combination of Azure Data Factory, Azure Data Bricks, T-SQL, Spark SQL, Scala
Developed Spark applications using Scala, PySpark and Spark-SQL for data extraction, transformation, and aggregation from multiple file formats for analyzing & transforming the data to uncover insights into the customer usage patterns
Data Analysis, Performance Tuning, Data cleansing, Data enrichment
Worked on SPARK using Azure Databricks service, to pull data from multiple sources using combination PySpark and Spark SQL to implement business logics.
Data Engineer
VanWagenen Financial Services Inc
11.2018 - 05.2019
Migrating on Prem SQL Databases to Azure Paas leveraging Azure SQL Databases
Refactoring the C#, Vb.net Applications to SSIS and SQL
Successfully migrated on-premises ETL workflows to the Azure cloud by leveraging Azure Databricks and Azure Data Factory.
BI Developer
Microsoft Corporation
06.2017 - 11.2018
Implementation and delivery of MSBI platform solutions to develop ETL, analytical, reporting
Developing Complex T-SQL Stored procedures
Actively monitored ETL jobs to ensure compliance with SLAs, effectively preventing any operational delays or oversights
Implemented ETL processes using SSIS to extract, clean, transform, and load data from diverse sources, employing complex transformations before storing it in SQL Server tables.
Involved in Dimensional modeling to Design and develop STAR/SNOW Flake Schema
Utilized Azure Databricks and Spark to extract data from various sources and implemented business logic using Spark SQL for data processing and analysis.
SQL Server Database Developer/Administration
Medtronic
01.2015 - 05.2017
Responsible for creating database objects like table, views, Stored Procedure, Triggers, etc
Collaborated within a production support team as a DBA to identify and resolve database and application issues, ensuring smooth operations and optimal performance
Involved in SQL Tuning by creation of indexes, rebuilding Indexes using Execution Plans
Worked with platform team on less intrusive database deployments
Designed and implemented TSQL queries for reporting and complex solution development.
Education
Master’s - computer science
Rivier University
Nashua, NH
05.2015
Skills
Big Data, ETL, Database Development/Administration
SQL Server 2019/2016/2014
AWS Glue, Amazon S3, Amazon EMR
Amazon Redshift, Dynamo DB
Azure Databricks, Azure Data Factory, Spark, Azure Synapse Analytics