Summary
Overview
Work History
Education
Skills
Timeline
Generic

Arun Kumar

Summary

Senior Data Quality Engineer with 8+ years of experience across healthcare and banking domains, specializing in cloud-based data platforms, data engineering validation, and analytics readiness. experience in supporting enterprise-scale data pipelines in highly regulated environments, ensuring accuracy, completeness, and trustworthiness of critical business data Hands-on experience with Azure Data Factory, Databricks, ADLS Gen2, Snowflake, SQL, Python, and PySpark, with strong domain expertise in healthcare data (claims, enrollment, provider, member, EDI 834/837), and financial/banking data (data migrations, reconciliations, reporting, and regulatory datasets).). Known for partnering closely with data engineers, product owners, and business stakeholders to identify data issues early, prevent production defects, and deliver reliable, analytics-ready data for decision-making.

Overview

13
13
years of professional experience

Work History

Senior Data Quality Engineer / QA Engineer – Data

UCare
Minneapolis, Minnesota
07.2019 - Current
  • Experience as the primary data quality and validation partner for cloud-based healthcare data platforms, supporting Claims, Enrollment, Provider, Member, and Encounter data.
  • Validate CMS 834 enrollment and EDI 837 claims data, ensuring accurate eligibility, PCP assignment, member effective dates, and coverage logic, prior to downstream reporting and analytics.
  • Work closely with data engineers to validate end-to-end ETL/ELT pipelines built on Azure Data Factory, Databricks, ADLS Gen2, Azure SQL, Snowflake, and Synapse.
  • Develop and execute Python- and PySpark-based validation logic in Databricks notebooks to perform record-level reconciliation, schema validation, and data completeness checks.
  • Perform data ingestion and transformation testing for flat files, JSON, HL7, and EDI formats across batch and API-driven pipelines.
  • Validate REST APIs using Postman and Swagger, covering authentication, response schemas, pagination, and error handling.
  • Support production issue triage by reproducing defects, analyzing root causes, and collaborating with engineering teams to reduce recurrence, and improve data reliability.
  • Design and execute data quality checks focused on completeness, duplicates, referential integrity, and business rule enforcement.
  • Support UAT and regression testing in staging environments, ensuring new releases meet business and regulatory expectations.

Environment: Azure Data Factory, Azure Databricks, ADLS Gen2, Azure SQL, Snowflake, Synapse, Informatica, dbt, Airflow, Python, PySpark, SQL, Postman, REST APIs, Azure DevOps, Git, Jenkins, Control-M, Power BI, Tableau, HL7, EDI 834/837.

Senior QA Engineer

State of Minnesota- DHS
St Paul, USA
07.2018 - 05.2019
  • Performed system, integration, and regression testing across multiple state healthcare and human services applications.
  • Executed backend data validation using complex SQL queries involving multi-table joins, views, and stored procedures.
  • Supported ETL testing and data migration validation during system modernization initiatives.
  • Conducted Section 508/WCAG accessibility testing using JAWS and web accessibility tools.
  • Collaborated with cross-functional teams to improve test coverage, defect tracking, and delivery quality.

Environment: SQL Server, Informatica, HTML, CSS, JavaScript, JAWS, RTC, Visual Studio, Windows, WCAG 2.0.

QA Engineer – Data & ETL

US Bank
Minneapolis, USA
07.2015 - 06.2017
  • Validated ETL workflows and data transformations using Informatica PowerCenter for enterprise financial systems.
  • Performed backend data validation using advanced SQL and PL/SQL to verify transformation logic and reconciliation rules.
  • Supported data migration and reporting validation across SQL Server, Teradata, and Oracle environments.
  • Partnered with stakeholders across SDLC phases to ensure data accuracy and reporting consistency.

Environment: Informatica PowerCenter, SQL Server, Teradata, Oracle, MicroStrategy, UNIX, HP ALM

QA Analyst

UnitedHealth Group
Eden Prairie, USA
05.2013 - 06.2015
  • Experience in functional, integration, and UAT testing for healthcare enrollment and claims processing systems.
  • Executed manual and automated testing for software applications.
  • Validated HIPAA-compliant EDI 834 enrollment data and backend SQL databases.
  • Executed SQL-based data validation to ensure data accuracy and integrity across systems.

Environment: SQL, Oracle, EDI 834, HIPAA, Quality Center, JIRA

Education

Master of Science - Information Technology

Concordia University
Saint Paul, MN

Skills

Cloud and data platforms

  • Azure Data Factory (ADF)
  • Azure Databricks
  • Azure Data Lake Storage Gen2 (ADLS Gen2)
  • Azure SQL, Synapse Analytics
  • Snowflake
  • Informatica

Data engineering and analytics

  • ETL / ELT pipeline validation
  • Data ingestion and transformation
  • Lakehouse and data lake architectures
  • Schema validation and schema drift handling
  • Delta processing, reconciliation, and balancing

Data quality and governance

  • Data quality frameworks and controls
  • Referential integrity, completeness, duplicates
  • Business rule and coverage validation
  • Healthcare data (claims, enrollment, provider, member)
  • EDI 834, 837, HL7, JSON, CSV, and flat files

Programming and querying

  • Python
  • PySpark
  • SQL (CTEs, window functions, stored procedures)

Orchestration, DevOps, and analytics tools

  • dbt, Airflow
  • Azure DevOps (Boards, Test Plans)
  • Git, Jenkins, Control-M
  • Postman, REST API Testing
  • Power BI, Tableau, SSRS

Timeline

Senior Data Quality Engineer / QA Engineer – Data

UCare
07.2019 - Current

Senior QA Engineer

State of Minnesota- DHS
07.2018 - 05.2019

QA Engineer – Data & ETL

US Bank
07.2015 - 06.2017

QA Analyst

UnitedHealth Group
05.2013 - 06.2015

Master of Science - Information Technology

Concordia University
Arun Kumar