Summary
Overview
Work History
Education
Skills
Websites
Certification
Accomplishments
Timeline
Generic

Avinash Maddineni

Dallas,TX

Summary

Senior Data Engineer with 14+ years of experience designing and implementing data solutions across diverse industries, including Travel, Utilities, Healthcare, and Financial Services. Expertise in building and optimizing end-to-end data pipelines on cloud (AWS, GCP, Snowflake) and on-premise platforms, specializing in ETL development. Proficient in data engineering using Spark, Python, and CI/CD automation frameworks. Proven track record of integrating data management and governance functions like data quality, lineage, metadata management, and regulatory compliance into scalable data architectures supporting analytics, reporting, and operational intelligence.

Overview

13
13
years of professional experience
1
1
Certification

Work History

Senior Data Engineer

Expedia Group Inc
10.2017 - Current
  • Spearheaded migration of critical travel-related workloads from Teradata and Hadoop to AWS (S3, Redshift), optimizing for scalability, cost-efficiency, and high availability in cloud-native environment.
  • Developed and maintained data pipelines using Python, SQL, and PySpark to ingest, transform, and analyze large-scale booking, search, and user interaction datasets across various travel platforms.
  • Tuned and optimized EMR and Spark clusters to handle high-volume data processing during peak travel seasons, reducing processing times by 40% while controlling infrastructure costs.
  • Automated deployment and testing workflows using GitHub Actions, implementing CI/CD pipelines that streamlined release processes and minimized production issues.
  • Established data quality frameworks, including validation checks, schema monitoring, and automated alerting, ensuring data integrity and minimizing downstream errors.
  • Conducted hands-on evaluations of Snowflake, Dremio, and Querybook, providing recommendations to enhance analytics and BI capabilities while supporting compliance with GDPR and CCPA standards

Sr. Data Warehouse Engineer

Bank of America
08.2015 - 09.2017
  • Designed and optimized large-scale ETL pipelines using Teradata and Informatica, processing securities, trade, and reconciliation data to support capital markets systems and ensure accurate financial reporting.
  • Migrated critical datasets from Teradata to Hadoop using Sqoop and Spark, enhancing scalability and enabling advanced analytics for securitization and regulatory reporting.
  • Led migration of financial data from Teradata to Netezza, refactoring legacy logic and optimizing complex queries to improve performance for high-volume financial transactions and reconciliations.
  • Designed and executed POC for data replication from on-prem Hadoop to Google Cloud Platform (GCP), utilizing Cloud Storage, BigQuery, and Dataflow to enable cloud-native reporting and real-time risk analytics.
  • Built foundational data pipelines on GCP using Pub/Sub, Cloud Dataflow, and BigQuery, enabling near-real-time trade monitoring and portfolio performance analysis.
  • Defined and implemented semantic layers and dimensional models to support business reporting and financial reconciliations across asset classes, ensuring consistency and accuracy in data analysis.

Sr. Data Warehouse Engineer

QSSI
07.2014 - 08.2015
  • Built and maintained high-volume healthcare data pipelines using Teradata, Hadoop, and Informatica to support marketplace applications like claims processing, eligibility, and provider matching.
  • Led migration of healthcare datasets from Teradata to Hadoop, utilizing Sqoop and Spark to enhance scalability and query performance for claims processing and risk analysis.
  • Developed a POC for replicating data from Hadoop to AWS, leveraging S3, Glue, and Redshift to ensure secure, compliant data transfer for healthcare applications and analytics.
  • Designed and optimized data transformation logic with Spark and AWS Lambda, enabling real-time data processing for marketplace analytics while ensuring regulatory compliance and high-quality data integrity.

Sr. ETL & Database Engineer

Southern California Edison
09.2012 - 07.2014
  • Designed and implemented end-to-end ETL workflows using Informatica PowerCenter and IBM DataStage, handling large-scale batch loads, data transformation, and orchestration across multiple business domains.
  • Optimized SQL, Teradata BTEQ, and TPT scripts, improving stored procedures, partitioned queries, and advanced analytics workloads, resulting in up to 40% reduction in job runtimes.
  • Led POC for Hadoop integration, utilizing Sqoop, Hive, and Spark to migrate legacy workloads from Teradata, setting foundation for big data platform adoption.
  • Automated ETL deployments and job scheduling through Git, Control-M, and Autosys, while implementing data quality checks and audit trails to ensure compliance and consistency across data processing pipelines.

Education

Master of Science - Computer Science

University of Illinois
Springfield, IL
04.2011

Bachelor of Science - Information Technology

Vignan’s Engineering College
India
05.2009

Skills

    Cloud & Modern Data Platforms

  • AWS (S3, EC2, Lambda, CloudWatch, VPC, SNS)
  • Snowflake, IBM Netezza, Hadoop (Hive, Sqoop, HDFS)
  • Apache Spark, Apache Airflow
  • ETL & Data Integration

  • Informatica (v8x – v10x), IBM DataStage
  • Teradata (v14–v1610), Oracle (10g–12c), MS SQL Server
  • CI/CD & Workflow Orchestration

  • Jenkins, Git, GitHub Actions
  • Scripting & Automation

  • Python, Shell Scripting (Unix/Linux), BTEQ, TPT
  • Data Modeling & Governance

  • CA Erwin, Dimensional Modeling, Semantic Layer Design
  • Metadata Management, Data Quality & Reconciliation
  • Analytics & BI Tools

  • Tableau, Looker, SAP BusinessObjects

Certification

Amazon Web Services Solutions Architect Associate, 02/01/20, NKWX60C2FN1Q13CV

Accomplishments

  • Designed and implemented the Net Flown Revenue and Repeat Customer Model for the Air business, unlocking $7M+ in incremental revenue through advanced customer behavior analytics.
  • Prototyped a dynamic GDS Contract Optimization Model, enabling revenue maximization from global distribution systems starting in 2019.
  • Identified and mitigated $4M in refund risk exposure due to duplicate payouts for COVID-related ticket cancellations, preventing substantial financial leakage.
  • Engineered a real-time Smart Meter Rules Engine pipeline, allowing business users to configure and trigger reporting logic, reducing reporting latency from 24 hours to under 30 minutes (~95% improvement).
  • Developed an automated memory allocation mechanism tied to user registration traffic during open enrollment, increasing system availability by 20% during peak usage periods.

Timeline

Senior Data Engineer

Expedia Group Inc
10.2017 - Current

Sr. Data Warehouse Engineer

Bank of America
08.2015 - 09.2017

Sr. Data Warehouse Engineer

QSSI
07.2014 - 08.2015

Sr. ETL & Database Engineer

Southern California Edison
09.2012 - 07.2014

Master of Science - Computer Science

University of Illinois

Bachelor of Science - Information Technology

Vignan’s Engineering College
Avinash Maddineni