Over 16 years of experience in Data Engineering with a strong history of delivering projects on time in fast-paced environments. Proficient in designing and maintaining data pipelines using Ab Initio, Python, SQL, Hadoop, and Spark, alongside AWS cloud technologies. Achievements include significant cost reductions and enhanced operational efficiency. Led cross-functional teams to streamline data processing and ensure successful project outcomes.
Overview
15
15
years of professional experience
1
1
Certification
Work History
Lead Data Engineer
FannieMae
07.2023 - Current
Designed and implemented secure, scalable end-to-end data pipelines for diverse integrations into Cloud CDH DB.
Oversaw management of CDH application, ensuring compliance with security standards.
Created pipelines to load data from Microsoft SharePoint lists into CDH seamlessly.
Built API-driven pipeline for integrating PwC survey data into CDH repository.
Established encrypted pipelines for NPI data using external Voltage systems through API connections.
Tuned AWS Lambda functions to optimize cost efficiency and performance metrics.
Executed resilience testing procedures to validate application stability under pressure.
Senior Data Engineer
FEPOC Carefirst
07.2022 - 07.2023
Established letter distribution process streamlining real-time reading and transmission of letters to printing services.
Documented essential technical and application flow support documents for the support team.
Senior Data Engineer
Capital One
01.2021 - 07.2022
Architected and designed secure, scalable end-to-end data pipelines to integrate diverse data sources into Cloud OME DB.
Developed multiple AWS Lambda functions to orchestrate data processing and submit Spark scripts to EMR cluster.
Implemented over 130 data quality checks through Spark scripts to ensure data integrity.
Created complex Spark scripts for business transformation logic, publishing load-ready datasets.
Utilized Load from S3 utility for efficient data loading from AWS S3 to Aurora MySQL DB.
Modernized existing data pipeline for improved orchestration, reducing EMR costs by managing cluster lifecycle.
Managed AWS Aurora MySQL Database, implementing failover and fallback setups across regions.
Mentored new team members, providing essential business knowledge and supporting team deliverables.
Senior Data Engineer Consultant
FannieMae
01.2015 - 12.2020
Analyzed and rebuilt complex legacy application to enhance functionality without disrupting accounting processes.
Managed PL/SQL packages for data analysis, resolving monthly reporting issues.
Executed POC for migrating Ab Initio-based legacy application to AWS cloud.
Automated tax provision calculations and reconciliations for General Ledger using advanced cloud technologies.
Leveraged AWS S3, DynamoDB, and RDS for secure data storage solutions.
Orchestrated workflows using AWS Step Functions to ensure efficient Talend Package execution.
Transformed on-premises data to cloud storage utilizing in-house CLIPS.
Provided quantification of exceptions during month-end closing to business team.
Tech Lead
Capital One Financial Corporation
01.2011 - 04.2015
Developed complex Ab Initio graph for CapitalOne's PCI Compliance project to process narrative text and tokenize PAN information.
Conducted analysis to identify applications needing Tokenization/De-Tokenization within bio group.
Led integration of applications with Token Generation Environment as per PCI Data Security Standard.
Automated IRS report generation from external vendors, increasing efficiency in reporting processes.
Proposed ETL validations to reduce form rejection and re-processing from external vendors.
Created dynamic file processing system using PDL programming, decreasing development time for file transfers.
Designed validation approach for Customer Experience Metric Program, ensuring data quality prior to customer surveys.
Developed IT plans, PSETS, and reusable common graphs through PDL coding based on business requirements.