
Data Engineer with over 2 years of experience in developing data pipelines using Informatica Powercenter in United Services Automobile Association(USAA Insurance company) with strong analytical skills along with proficiency in SQL & Python.I am eager to contribute to development projects that enable data to serve as a strategic asset. Equally confident working independently and collaboratively as needed and utilizing excellent communication skills.
Data Analysis of Health-care charges incurred by patients in the United StatesAug. 2020 – Dec. 2020
Credit Card Fraud DetectionJan.2021 – Mar.2021
Data analysis of NYC taxi ride duration
Aug.2021 – Oct.2021
• The goal of the project was to perform an explanatory data analysis (EDA) on NYC’s Yellow Taxi Trip Records from 2020 to find correlation among the various variables to improve ride time predictions.
• Applied deep neural network (NN) and MLP (Multi-layer perceptron) to perform regression modeling analysis on the NYC Yellow Cab dataset and have also used Azure Databricks, Azure Data Lake Gen2, Azure Data Factory and Spark core for the analysis of NYC taxi ride duration with spark SQL and PySpark where required and designed ETL pipeline.