Summary
Overview
Work History
Education
Skills
Timeline
Generic

Tarun K

Dallas,USA

Summary

Highly experienced Data Engineer with 5+ years of expertise in architecting and implementing data

solutions across AWS, Azure, and Informatica IICS (CDI, CDQ, CDGC, IDMC). Strong experience with

Python and PySpark. Proven ability to develop and optimize cloud-native ETL/ELT pipelines using

Informatica IICS(CDI), Powercenter, AWS Glue and Azure Data Factory, ensuring efficient data migration

and transformation. Expertise in Master data management using Informatica MDM, Cloud MDM.

Strong proficiency in cloud data warehousing (Redshift, Synapse), data integration, and

governance(CDQ, CDGC) with a track record of successfully integrating diverse data sources and

enhancing data quality within cloud environments. Adept at leveraging cloud-native services for

real-time data processing and infrastructure management, delivering robust and scalable cloud data

solutions that drive business insights. Skilled in Agile and Waterfall methodologies.

Overview

6
6
years of professional experience

Work History

Informatica Developer

Hyundai Capital of America
10.2024 - 05.2025
  • Led a team of four in a comprehensive data quality migration project, transitioning from Informatica Data Quality (IDQ) to Cloud Data Quality (CDQ).
  • Implemented and configured CDQ rules and processes, ensuring data accuracy and consistency post-migration.
  • Developed and optimized data quality workflows within CDQ to streamline data cleansing and standardization.
  • Utilized Microsoft Fabric for ETL processes, data visualization, and real-time data integration, enhancing data accessibility and insights.
  • Designed and implemented data pipelines within Microsoft Fabric to facilitate seamless data flow from source to target.
  • Created interactive dashboards and reports using Microsoft Fabric to visualize data quality metrics and project progress.
  • Ensured data integrity and compliance throughout the migration process, adhering to established data governance standards.
  • Conducted thorough testing and validation of migrated data quality rules and processes to ensure accuracy and reliability.
  • Provided technical guidance and mentorship to team members, fostering a collaborative and productive work environment.
  • Developed Python scripts and PySpark jobs within Microsoft Fabric to process and transform data for real-time analytics.
  • Implemented real-time data intelligence solutions in Microsoft Fabric by integrating and processing Apache Kafka data streams.
  • Utilized Microsoft Fabric's streaming capabilities to build dashboards that provided live insights from Kafka data.
  • Automated data validation and quality checks using Python within Microsoft Fabric data pipelines, enhancing data reliability.
  • Optimized existing mappings through partitioning based on volume or complexity of the data sets being processed.
  • Documented all ETL processes according to corporate standards.

Informatica Developer

Paycom Payroll LLC
05.2023 - 08.2024
  • Developed Python solutions for real-time data ingestion from Kafka into Informatica MDM via Business Entity Services API.
  • Implemented data integration solutions using Informatica IDMC/Customer 360, connecting diverse data sources.
  • Integrated Informatica IDMC with Amazon Redshift for scalable analytics and reporting.
  • Utilized AWS Glue for data integration, transformation, and automated ETL pipelines, including AWS Step Functions.
  • Optimized Java code and AWS Glue ETL jobs to enhance data processing efficiency and reduce resource consumption.
  • Designed and managed APIs within Informatica IICS to facilitate application communication and automate workflows.
  • Implemented data quality checks, masking, and governance practices within IDMC and IICS for compliance and security.
  • Managed metadata in PowerCenter, ensuring accurate data lineage and documentation.
  • Integrated data from AWS, Azure, and Google Cloud platforms using Informatica IICS connectors.
  • Implemented data synchronization and replication between IDMC and Informatica MDM for consistent master data.
  • Developed data quality scorecards in IDMC focused on master data reliability.
  • Leveraged IDMC/Customer 360 for data lineage and impact analysis.

Informatica MDM Developer

Lowes
Dallas, TX
02.2022 - 05.2023
  • Configured Informatica MDM Hub, including stage tables, base objects, match rules, trust scores, and hierarchies, to establish a central data repository.
  • Developed and executed SOAP and RESTful API calls (SIF, BES) for data manipulation and integration within Informatica MDM.
  • Designed and configured E360 dashboards and provisioning tools for business entity management and real-time data access.
  • Implemented data cleansing, standardization, and validation processes, including lookup tables, cleanse functions, and audit trails, resulting in a 10% improvement in data quality using DNB.
  • Defined and implemented data models, landing tables, staging tables, and data integration workflows within Informatica MDM, ensuring alignment with business requirements.
  • Analyzed business data, defined match/merge rules, and configured system trust to ensure data accuracy and integrity within the MDM Hub.
  • Utilized project management tools like Jira and Git/GitHub in Agile environments to manage development and deployment.
  • Configured user roles, groups, and privileges for secure access to MDM systems.
  • Implemented data lineage and impact analysis using AWS Glue Data Catalog and Crawlers, tracking data transformations and dependencies.
  • Led the migration of on-premises ETL workloads to AWS Glue, improving scalability and cost efficiency.
  • Developed and implemented data integration solutions using Informatica IICS, ensuring seamless data flow across various systems.
  • Implemented data quality processes within Informatica IICS to cleanse, validate, and standardize data for improved accuracy.
  • Designed and developed real-time data integration solutions using IICS and JSON based API calls.
  • Collaborated with business users, data architects, and development teams to define data models, requirements, and solutions.
  • Performed data profiling using Informatica Data Analyst and worked with the Data Profiling team to analyze source system data.
  • Configured source systems, landing tables, staging tables, and lookups, managing large data sets and complex relationships.
  • Worked with Java developers to implement custom code for SIF calls and User Exits.

Data Engineer

TCS
Hyderabad, India
08.2020 - 08.2021
  • Collaborated with business analysts to gather requirements and develop effective technology solutions.
  • Conducted feasibility studies and organized software requirements for structured implementation.
  • Implemented data loading processes for customer data into Informatica MDM from diverse sources.
  • Utilized Informatica BDM IDQ 10 for data ingestion and transformation between AWS S3 and Redshift.
  • Worked with ETL developers to create external batches and mappings for data integration from various sources into Informatica MDM.
  • Scheduled jobs and workflows in PowerCenter, creating mappings, tasks, sessions, and worklets for data processing.
  • Designed and created base objects, staging tables, mappings, and transformations based on business requirements.
  • Used Metadata Manager for repository management across development and testing environments.
  • Identified Golden Records (BVT) for customer data through analysis of duplicate records.
  • Scheduled shell scripts and Informatica jobs using Autosys.
  • Supported the testing team for data integrity and consistency.
  • Configured schema, landing, staging, base, and lookup tables, foreign-key relationships, packages, and queries.
  • Defined trust and validation rules, and configured match/merge rule sets for master records.
  • Configured match rule set properties for search by rules in MDM based on business rules.
  • Created unit test case, detailed design, supplementary, and knowledge transfer documentation.
  • Reviewed and discussed Security Access Management (SAM) for user roles and privileges.
  • Created data validation, unit test case, technical design, and Informatica migration request documentation.

Jr. Data Engineer

Ridhan Technologies
Hyderabad, India
09.2019 - 06.2020
  • Expertise in Informatica tools such as PowerCenter, MDM hub, Provisioning tool, IDQ, IDD, and databases.
  • Experience on data profiling & various data quality rules development & enhancement in production.
  • Support environment using Informatica data quality (IDQ).
  • Managed the respective jobs through Redwood. Tasks such as scheduling, verifying, tracking, and error checking. Used to coordinate with the development team to debug in case of any issues.
  • Part of the testing team for MDM upgrade. Involved in regression testing where created test cases while working closely with the development team and fool proofed the development.
  • Monitoring Batch jobs and Debugging and Reporting errors.
  • Worked on data cleansing using the cleanse functions in Informatica MDM.
  • Generating and publishing the reports to the downstream vendors on a weekly and monthly basis.
  • Working on the tickets by coordinating with Informatica GCS (global support) team.
  • Performed unit testing at various levels of the ETL. Debugging data rules, data mapping in IDQ.
  • Worked in an architecture that collects data from multiple upstream resources to parse and send downstream for storage and visualization.

Education

Master of Science - Information Technology

The University of Texas At Arlington
Arlington, TX
05-2023

Bachelor of Science - Computer Science

Jawaharlal Nehru Technological University-Hyd
04-2020

Skills

ETL tools: IICS, CDI, CAI, Informatica Power Center(10x), AWS Database

Migration Service, AWS Glue, Apache Airflow

Cloud technologies: AWS (Glue, S3, Redshift, Lambda,etc), Azure(Databricks,

Datalake, Microsoft Fabric) etc

RDBMS: Oracle, SQL Server, MySQL, AWS S3, Microsoft Azure SQL

Database

Job scheduling: IICS, Informatica Administrator, Control-M, Autosys

MDM packages: IDMC, Informatica MDM MultiDomain Edition 10X(C360,

S360, E360), AVOS, Informatica Data Director (IDD) 10X,

Informatica Data Quality (IDQ) 10X, DPM , EDC, AXON, JMS

Programming languages: Java, Python, PySpark, SQL, PL/SQL, React, HTML

Productivity tools: MS Office, MS Visio, TOAD, SQL Developer, MS SSMS

File transfer tools: MS Office, MS VISIO, TOAD, SQL-Developer, MS SSMS,

WinSCP, , Putty, File Zilla

Project management tools: JIRA, Service Now

Timeline

Informatica Developer

Hyundai Capital of America
10.2024 - 05.2025

Informatica Developer

Paycom Payroll LLC
05.2023 - 08.2024

Informatica MDM Developer

Lowes
02.2022 - 05.2023

Data Engineer

TCS
08.2020 - 08.2021

Jr. Data Engineer

Ridhan Technologies
09.2019 - 06.2020

Master of Science - Information Technology

The University of Texas At Arlington

Bachelor of Science - Computer Science

Jawaharlal Nehru Technological University-Hyd
Tarun K