Experienced, result-oriented, resourceful and problem-solving Data Engineer with leadership skills. Over 6+ years of experience and a proven knowledge of database design, data warehousing and implementation of various applications in Big data on cloud.
Overview
6
6
years of professional experience
5
5
years of post-secondary education
1
1
Certifications
Work History
AWS Data Engineer
Fannie Mae
Reston, VA
02.2017 - Current
Performed data migration and designed Enterprise Data Lake on cloud from various source systems using AWS Glue.
Developed Spark applications using PySpark API on EMR to validate,process and transform XML payload to parquet.
Implementation of automated metadata and batch driven framework for ETL process in cloud using AWS services SQS,SNS and lambda.
Performing CDC and incremental loading using Spark, Hive and Apache Hudi.
Orchestrating ETL jobs using Cloud native services like Lambda and Step Functions to create snapshots on Redshift Data Warehouse.
Created DAGs using Aitrflow to schedule and monitor workflows with better control.
Designed and automated Redshift unload jobs replacing existing feed from on-prem Netezza database to S3 Data Lake.
Creating workflows and mappings for ETL process in on-prem using Informatica and Autosys.
Generated detailed studies on potential third-party data handling solutions, verifying compliance with internal needs and stakeholder requirements.
ETL Informatica Developer
Cognizant Technology Solutions Ltd
Bangalore, Karnataka,India
05.2013 - 07.2015
Design, implement, or operate comprehensive data warehouse systems to balance optimization of data access with batch loading and resource utilization factors, according to customer requirements.
Develop or maintain standards, such as organization, structure, or nomenclature, for design of data warehouse elements, such as data architectures, models, tools, and databases.
Map data between source systems, data warehouses, and data marts.
Perform system analysis, data analysis or programming, using variety of computer languages and procedures.
Provide or coordinate troubleshooting support for data warehouses.
Designed Mappings in Power center and moved data from SQL database, to oracle database which has modified structure using Flat files.
Used Workflow manager for running and monitoring & scheduling jobs.
Involved in rectifying source code for production failures.
Education
Master of Science - Computer Science
University Of North Carolina At Charlotte
Charlotte, NC
08.2015 - 12.2016
Bachelor of Technology Electrical Engineering - Computer Science
Jawaharlal Nehru Technology University
Andhra Pradesh,India
10.2008 - 04.2012
Skills
Apache Spark
undefined
Certification
AWS Certified Developer
Timeline
AWS Data Engineer
Fannie Mae
02.2017 - Current
Master of Science - Computer Science
University Of North Carolina At Charlotte
08.2015 - 12.2016
ETL Informatica Developer
Cognizant Technology Solutions Ltd
05.2013 - 07.2015
Bachelor of Technology Electrical Engineering - Computer Science