Summary
Overview
Work History
Education
Skills
Timeline
Generic

Praveen Yarragunta

Owings Mills,MD

Summary

I have 9+ years of expertise in ETL application development, encompassing design, testing, implementation, and support within OLTP & OLAP environments. Specializing in Talend Open Studio (5.x/6.x/7.x) and AWS Services (S3, Lambda, Step Functions, EMR, Redshift), I've crafted robust solutions for data integration and Big Data. Proficient in Data Warehousing concepts (OLAP, OLTP, Star Schema, Snowflake Schema) and adept at translating Talend logic into optimized SQL for Redshift. Skilled in ETL strategies, and full BI lifecycle, ensuring efficient data flow and modeling. Experienced in Hadoop ecosystem, Linux commands, and diverse data sources. Proven ability in SDLC, Agile methodologies, and collaborative team environments.

Overview

10
10
years of professional experience

Work History

ETL Data Engineer

Empower
01.2022 - Current
  • Design and implement effective data models for various applications and use cases
  • Utilize and configure AWS services, including S3, DynamoDB, EMR, Step Functions, Lambda and Redshift
  • Write and optimize complex SQL queries for data extraction, transformation, and loading (ETL) processes
  • Analyze the existing Talend jobs to comprehend the data flow, transformations, and business logic embedded within them and translate the logic implemented in Talend jobs into equivalent SQL queries or stored procedures suitable for Redshift
  • Break down complex Talend components into corresponding SQL statements, ensuring compatibility with Redshift's SQL dialect and capabilities
  • Develop and maintain Python scripts for data processing, transformation, and orchestration
  • Build and maintain ETL processes for moving and transforming data across various systems
  • Design, implement, and manage DynamoDB databases for efficient and scalable NoSQL data storage
  • Implement AWS Step Functions for orchestrating and coordinating complex workflows
  • Configure and manage EMR clusters for distributed data processing using tools like Apache Spark and Apache Hive.

Talend AWS Developer

MassMutual Life Insurance Company
05.2020 - Current
  • Design and implement ETL for data load from heterogeneous sources to SQL Server and Oracle as source databases
  • Create Talend jobs to retrieve data from legacy sources and flat files
  • Write Hive queries to fetch data from HBase and transfer to HDFS
  • Optimize performance of mappings by testing on sources, targets, and transformations
  • Develop ETL mappings for XML, CSV, and TXT sources and load data into relational tables
  • Migrate code and release documents from DEV to QA and production
  • Troubleshoot and debug Talend issues
  • Schedule Talend jobs with Talend Admin Console
  • Create Talend mappings to populate data into dimensions and fact tables
  • Implement Change Data Capture technology in Talend
  • Perform record count validation and schema validation
  • Use Confluence Page for source code control and project documentation
  • Develop, support, and maintain ETL processes using Talend Integration Suite
  • Work in all phases of full life cycle including analysis, design, development, testing, deployment, support, and maintenance.

Talend Developer

SIRVA Worldwide Moving & Relocation Services
10.2018 - Current
  • Collaborate with Data Integration Team to perform data and application integration
  • Perform technical analysis, ETL design, development, testing, and deployment of IT solutions
  • Participate in designing logical and physical data warehouse data model and architectures
  • Explore prebuilt ETL metadata and develop SQL code for SQL Server database
  • Create reusable Joblets and routines in Talend
  • Work on web services using Talend components
  • Experience in Big Data technologies like Hadoop, Hive, HBase, Sqoop, and Spark SQL
  • Hands-on experience with Snowflake utilities and Big Data model techniques
  • Troubleshoot data integration issues and bugs
  • Schedule Talend jobs with Talend Admin Console
  • Use Visual Studio Team Services for source code control and project collaboration.

Talend Developer

Seacoast National Bank
06.2018 - 10.2018
  • Work in Data Integration Team to perform data and application integration
  • Work closely with Business Analysts to review business specifications and gather ETL requirements
  • Create Talend jobs to copy files and utilize Talend FTP components
  • Manage source to target mapping documents
  • Analyze source data using Talend Data Quality
  • Migrate existing data center to AWS environment
  • Work with various Talend components and debug mode
  • Develop, support, and maintain ETL processes using Talend Integration Suite
  • Perform unit testing and integration testing
  • Use Confluence Page for source code control and project collaboration.

Talend Big Data Developer

T ROW PRICE
02.2016 - 05.2018
  • Work in Data Integration Team to perform data and application integration
  • Develop custom components and multi-threaded configurations with flat files using Java code in Talend
  • Interact with Solution Architects and Business Analysts to gather requirements and update Solution Architect Document
  • Deploy and schedule Talend jobs in Administration console
  • Create separate branches within Talend repository
  • Review requirements and implement DQ rules using Talend DI jobs
  • Create cross-platform Talend DI jobs to read data from multiple sources
  • Troubleshoot data integration issues and bugs
  • Tune ETL mappings, workflows, and data model for performance optimization
  • Configure Talend Administration Center for scheduling and deployment
  • Use Visual Studio Team Services for source code control and project collaboration.

ETL Developer

Agility E Services
02.2014 - 12.2015
  • Assist in gathering business requirements and develop Data Model and ETL procedures for Data Warehouse
  • Design and develop star schema model using ERWIN Data modeling
  • Use Informatica Power Center for data extraction, transformation, and loading
  • Implement Slowly Changing Dimension Type 1 and Type 2
  • Use Debugger to validate transformations and monitor data flow
  • Tune performance of Informatica Session and mappings
  • Work with QA Team and provide production support
  • Write UNIX Shell Scripts and use pmcmd command line utility
  • Develop, support, and maintain ETL processes using Informatica Power Center
  • Perform unit testing and participate in full life cycle development.

Education

Master of Science - Information Technology

Stratford University
Falls Church, VA
10.2017

Skills

  • UNIX, Linux, Windows 98/2000, 2003, Windows NT 40
  • XML, SQL, Unix/LINUX shell scripting
  • SQL developer, SQL Server Management Studio, DB2, SQL
  • Loader, Snowflake, dbeaver
  • Redshift, Oracle 11g/10g/9i/8i, DB2 1005, Mysql, Teradata V 140, Oracle, SQL Server 2008/2005/2000, Hive 013, Hadoop, Spark
  • SQL Navigator, WinSCP, Putty, MS-Office& Excel, VMWare Workstation
  • Talend Studio Data Integration& Big Data 731/721/701/64/63/621/562/552/531, Talend Administrator Console, Talend Management Console
  • Agile Methodologies

Timeline

ETL Data Engineer

Empower
01.2022 - Current

Talend AWS Developer

MassMutual Life Insurance Company
05.2020 - Current

Talend Developer

SIRVA Worldwide Moving & Relocation Services
10.2018 - Current

Talend Developer

Seacoast National Bank
06.2018 - 10.2018

Talend Big Data Developer

T ROW PRICE
02.2016 - 05.2018

ETL Developer

Agility E Services
02.2014 - 12.2015

Master of Science - Information Technology

Stratford University
Praveen Yarragunta