Seasoned Senior Data Engineer at United Health Group (OPTUM Technologies) with expertise in the Hadoop Ecosystem and SQL. Known for a proactive approach and strong problem-solving skills, successfully spearheaded innovative data analytics projects that enhanced data processing and insights. Proficient in Spark development and skilled in machine learning techniques.
Overview
8
8
years of professional experience
Work History
Senior Data Engineer
United Health Group(OPTUM Technologies)
01.2020 - Current
Developed and maintained data lakes and analytical platforms using Databricks on Azure, ensuring scalability, data security, and automation of infrastructure as code (IaC)
Developed and maintained continuous integration and continuous deployment (CI/CD) pipelines for schema migrations, workflows, and cluster pools using tools like Git, Jenkins, Azure Repos, and Azure Pipelines
Leveraged Apache Spark with Kafka integration to process large volumes of streaming data for batch and real-time analytics, achieving a 3x performance improvement compared to traditional batch processing methods
Developed scalable data processing pipelines using Apache Spark and Scala to handle large datasets.
Utilized Spark MLlib to build and deploy machine learning models, focusing on feature engineering and model evaluation.
Conducted hyperparameter tuning and model optimization using Spark’s distributed computing capabilities.
Strong knowledge of Hadoop Architecture such as HDFS , JOB Tracker , Task Tracker and Map Reduce concepts.
Managing and scheduling Jobs on a Hadoop cluster using Oozie and Airflow.
Created shell scripts and processes for data integration and maintenance
Notable Projects: StepWise, Contact Analytics, Medical Crossover
Software Engineer
United Health Group(OPTUM Technologies)
06.2016 - 01.2020
Used Mulesoft ESB to create API's for O.I.L( optum Integration team)
Created java X12 EDI parser that parsed 837I and 837P files to produce key and values that is searchable
Provided continued maintenance and development of bug fixes and patch sets for existing application
Conducted extensive troubleshooting to identify root causes of issues and implement effective resolutions in a timely manner.
Collaborated on ETL (Extract, Transform, Load) tasks, maintaining data integrity and verifying pipeline stability.
Designed scalable ETL pipelines for improved data ingestion, processing, and storage.
Trained junior team members on best practices in big data engineering, fostering a culture of continuous improvement.
Contributed to establishment of continuous integration/continuous deployment (CI/CD) pipeline, automating software release process.