Seasoned Senior Data Engineer at United Health Group (OPTUM Technologies) with expertise in the Hadoop Ecosystem and SQL. Known for a proactive approach and strong problem-solving skills, successfully spearheaded innovative data analytics projects that enhanced data processing and insights. Proficient in Spark development and skilled in machine learning techniques.
Overview
8
8
years of professional experience
Work History
Senior Data Engineer
United Health Group(OPTUM Technologies)
01.2020 - Current
Developed and maintained data lakes and analytical platforms using Databricks on Azure, ensuring scalability, data security, and automation of infrastructure as code (IaC)
Developed and maintained continuous integration and continuous deployment (CI/CD) pipelines for schema migrations, workflows, and cluster pools using tools like Git, Jenkins, Azure Repos, and Azure Pipelines
Leveraged Apache Spark with Kafka integration to process large volumes of streaming data for batch and real-time analytics, achieving a 3x performance improvement compared to traditional batch processing methods
Developed scalable data processing pipelines using Apache Spark and Scala to handle large datasets.
Utilized Spark MLlib to build and deploy machine learning models, focusing on feature engineering and model evaluation.
Conducted hyperparameter tuning and model optimization using Spark’s distributed computing capabilities.
Strong knowledge of Hadoop Architecture such as HDFS , JOB Tracker , Task Tracker and Map Reduce concepts.
Managing and scheduling Jobs on a Hadoop cluster using Oozie and Airflow.
Created shell scripts and processes for data integration and maintenance
Notable Projects: StepWise, Contact Analytics, Medical Crossover
Software Engineer
United Health Group(OPTUM Technologies)
06.2016 - 01.2020
Used Mulesoft ESB to create API's for O.I.L( optum Integration team)
Created java X12 EDI parser that parsed 837I and 837P files to produce key and values that is searchable
Provided continued maintenance and development of bug fixes and patch sets for existing application
Conducted extensive troubleshooting to identify root causes of issues and implement effective resolutions in a timely manner.
Collaborated on ETL (Extract, Transform, Load) tasks, maintaining data integrity and verifying pipeline stability.
Designed scalable ETL pipelines for improved data ingestion, processing, and storage.
Trained junior team members on best practices in big data engineering, fostering a culture of continuous improvement.
Contributed to establishment of continuous integration/continuous deployment (CI/CD) pipeline, automating software release process.
Education
Master of Science - Computing And Business
New Jersey Institute of Technology
Newark, NJ
Bachelors - Computer Science
New Jersey Institute of Technology
Newark, NJ
Skills
Hadoop Ecosystem
SQL Expertise
ETL development
Data Pipeline Design
Spark Development
Kafka Streaming
Scala Programming
Git Version Control
Jenkins Automation
Azure Databricks
Apache Airflow/Oozie
Machine Learning
Timeline
Senior Data Engineer
United Health Group(OPTUM Technologies)
01.2020 - Current
Software Engineer
United Health Group(OPTUM Technologies)
06.2016 - 01.2020
Master of Science - Computing And Business
New Jersey Institute of Technology
Bachelors - Computer Science
New Jersey Institute of Technology
Similar Profiles
Swetha ArunSwetha Arun
Sr Quality Engineer at United Health Group (Optum Technologies)Sr Quality Engineer at United Health Group (Optum Technologies)