Data Engineer possessing in-depth knowledge of ETL and SAS/Python programming paired with expertise in integrating and implementing new data pipelines. Offering 5+ years background managing various aspects of development, design and delivery of reliable data.
Accomplishments:
1. Developed Final Action delta algorithm instead of pulling 100% data, which reduced the query time and also memory and resource utilization. Query time reduced by 8 hours and data processing time reduced by 6 hours.
2. Automated data pipeline using PySpark that replaced SAS code. Validations were performed on Medicaid Statistical Information data and finally generated the National Summary report. This automation reduced the time of report generation from 8 hours to 30 mins per report.
Accomplishments: Developed tool in Java and PowerShell scripts to automate the process of moving data from production to staging database which saved time and effort of 12 hours/week.
Accomplishments: Developed tool in VBA and Excel macros that performed compliance checks for Aetna. Aetna achieved CMMI Level 3 for the year 2015. Reduced manual effort and resulted in savings of $150/year.