Summary
Overview
Work History
Education
Skills
Software
Timeline
Generic

Priyanka Kumari

Rowlett,Texas

Summary

AWS Data Engineer with 5 years of experience managing RxClaim adjudication data within CVS pharmacy benefits management. Skilled in designing and implementing end-to-end data ingestion pipelines using AWS S3, AWS Glue, and PySpark, along with developing insights through data analysis and visualization. Adept at collaborating with cross-functional teams to drive data-driven decisions that optimize pharmacy operations and reduce costs.

Overview

8
8
years of professional experience

Work History

AWS Data Engineer – RxClaim Adjudication

Client: CVS Health
11.2019 - Current

Key Responsibilities and Contributions


  • End-to-End Data Ingestion
  • Designed automated data pipelines to ingest large volumes of RxClaim adjudication data into Amazon S3, leveraging AWS Glue Crawlers to catalog schemas.
  • Configured AWS DMS for continuous replication of on-premise databases, ensuring timely updates of claims records in the data lake.
  • ETL & Data Transformation
  • Developed and scheduled AWS Glue Jobs using PySpark to cleanse, enrich, and transform raw claims data into optimized Parquet files.
  • Applied partitioning strategies (e.g., by date or claim type) to improve query performance and reduce costs.
  • Analysis & Insight Generation
  • Utilized Spark SQL to run complex joins and aggregations, uncovering trends in prescription usage, cost patterns, and seasonal fluctuations.
  • Collaborated with actuarial and finance teams to create predictive models for cost forecasting, leveraging large-scale claim history.
  • Visualization & Reporting
  • Integrated Amazon QuickSight with S3 and Athena to build interactive dashboards tracking claim volume, cost breakdowns, and adjudication SLAs.
  • Provided self-service analytics to non-technical stakeholders, reducing ad-hoc report turnaround times by 40%.
  • Security & Compliance
  • Ensured HIPAA and PHI compliance by encrypting S3 data at rest and restricting access via IAM policies and VPC endpoints.
  • Collaborated with InfoSec teams to implement robust monitoring and logging (CloudTrail, CloudWatch), keeping a full audit trail of data access.
  • Notable Achievements
  • Reduced ETL runtime by 30% through optimized Spark job configurations and effective data partitioning.
  • Saved 25% on storage costs by transitioning historical claim data to S3 Glacier and adopting columnar formats.
  • Led a cross-functional initiative that significantly improved accuracy in claims reporting, supporting key business decisions on cost management.

Data Researcher

University of Texas Dallas
01.2017 - 12.2018
  • Designed and implemented data engineering pipeline to automate data processing tasks

Education

Master of Science - Bioinformatics (Computer Science Track)

University of Texas At Dallas
Richardson, TX
12-2018

Skills

  • AWS Cloud Services: S3, Glue, Lambda, EC2, IAM, Athena
  • Data Pipeline & ETL: PySpark, Spark SQL, Glue Jobs, AWS Database Migration Service (DMS)
  • Pharmacy Domain Knowledge: RxClaim adjudication workflows, NCPDP transaction formats, PHI/HIPAA compliance
  • Data Modeling & Architecture: Designing scalable schemas, partitioning, and performance optimization
  • Analytics & Visualization: Amazon QuickSight, Tableau, Python libraries (pandas, NumPy, Matplotlib, Seaborn)
  • Programming & Scripting: Python, SQL, Shell Scripting
  • Collaboration & Leadership: Agile methodologies, cross-functional team coordination, stakeholder communication

Software

Programming Languages: Python, SQL, R, PySpark

Big Data Technologies: Apache Spark, Hive

Databases: PostgreSQL, MySQL, IBM DB2

Cloud Platforms : AWS Glue, Amazon EMR , Amazon Quick Sight, AWS Lambda, Amazon CloudWatch ,AWS IAM

Machine Learning: Regression, Classification, Clustering, Time Series Analysis

Timeline

AWS Data Engineer – RxClaim Adjudication

Client: CVS Health
11.2019 - Current

Data Researcher

University of Texas Dallas
01.2017 - 12.2018

Master of Science - Bioinformatics (Computer Science Track)

University of Texas At Dallas
Priyanka Kumari