Summary

Overview

Work History

Education

Skills

Websites

Certification

Accomplishments

Timeline

Avinash Maddineni

Dallas,TX

Summary

Senior Data Engineer with 14+ years of experience designing and implementing data solutions across diverse industries, including Travel, Utilities, Healthcare, and Financial Services. Expertise in building and optimizing end-to-end data pipelines on cloud (AWS, GCP, Snowflake) and on-premise platforms, specializing in ETL development. Proficient in data engineering using Spark, Python, and CI/CD automation frameworks. Proven track record of integrating data management and governance functions like data quality, lineage, metadata management, and regulatory compliance into scalable data architectures supporting analytics, reporting, and operational intelligence.

Overview

years of professional experience

Certification

Work History

Senior Data Engineer

Expedia Group Inc

Dallas, TX

10.2017 - Current

Spearheaded migration of critical travel-related workloads from Teradata and Hadoop to AWS (S3, Redshift), optimizing for scalability, cost-efficiency, and high availability in cloud-native environment.
Developed and maintained data pipelines using Python, SQL, and PySpark to ingest, transform, and analyze large-scale booking, search, and user interaction datasets across various travel platforms.
Tuned and optimized EMR and Spark clusters to handle high-volume data processing during peak travel seasons, reducing processing times by 40% while controlling infrastructure costs.
Automated deployment and testing workflows using GitHub Actions, implementing CI/CD pipelines that streamlined release processes and minimized production issues.
Established data quality frameworks, including validation checks, schema monitoring, and automated alerting, ensuring data integrity and minimizing downstream errors.
Conducted hands-on evaluations of Snowflake, Dremio, and Querybook, providing recommendations to enhance analytics and BI capabilities while supporting compliance with GDPR and CCPA standards

Sr. Data Warehouse Engineer

Bank of America

Charlotte, NC

08.2015 - 09.2017

Designed and optimized large-scale ETL pipelines using Teradata and Informatica, processing securities, trade, and reconciliation data to support capital markets systems and ensure accurate financial reporting.
Migrated critical datasets from Teradata to Hadoop using Sqoop and Spark, enhancing scalability and enabling advanced analytics for securitization and regulatory reporting.
Led migration of financial data from Teradata to Netezza, refactoring legacy logic and optimizing complex queries to improve performance for high-volume financial transactions and reconciliations.
Designed and executed POC for data replication from on-prem Hadoop to Google Cloud Platform (GCP), utilizing Cloud Storage, BigQuery, and Dataflow to enable cloud-native reporting and real-time risk analytics.
Built foundational data pipelines on GCP using Pub/Sub, Cloud Dataflow, and BigQuery, enabling near-real-time trade monitoring and portfolio performance analysis.
Defined and implemented semantic layers and dimensional models to support business reporting and financial reconciliations across asset classes, ensuring consistency and accuracy in data analysis.

Sr. Data Warehouse Engineer

QSSI

Columbia, MD

07.2014 - 08.2015

Built and maintained high-volume healthcare data pipelines using Teradata, Hadoop, and Informatica to support marketplace applications like claims processing, eligibility, and provider matching.
Led migration of healthcare datasets from Teradata to Hadoop, utilizing Sqoop and Spark to enhance scalability and query performance for claims processing and risk analysis.
Developed a POC for replicating data from Hadoop to AWS, leveraging S3, Glue, and Redshift to ensure secure, compliant data transfer for healthcare applications and analytics.
Designed and optimized data transformation logic with Spark and AWS Lambda, enabling real-time data processing for marketplace analytics while ensuring regulatory compliance and high-quality data integrity.

Sr. ETL & Database Engineer

Southern California Edison

La Palma, CA

09.2012 - 07.2014

Designed and implemented end-to-end ETL workflows using Informatica PowerCenter and IBM DataStage, handling large-scale batch loads, data transformation, and orchestration across multiple business domains.
Optimized SQL, Teradata BTEQ, and TPT scripts, improving stored procedures, partitioned queries, and advanced analytics workloads, resulting in up to 40% reduction in job runtimes.
Led POC for Hadoop integration, utilizing Sqoop, Hive, and Spark to migrate legacy workloads from Teradata, setting foundation for big data platform adoption.
Automated ETL deployments and job scheduling through Git, Control-M, and Autosys, while implementing data quality checks and audit trails to ensure compliance and consistency across data processing pipelines.

Education

Master of Science - Computer Science

University of Illinois

Springfield, IL

04.2011

Bachelor of Science - Information Technology

Vignan’s Engineering College

India

05.2009

Skills

Cloud & Modern Data Platforms

AWS (S3, EC2, Lambda, CloudWatch, VPC, SNS)
Snowflake, IBM Netezza, Hadoop (Hive, Sqoop, HDFS)
Apache Spark, Apache Airflow

ETL & Data Integration

Informatica (v8x – v10x), IBM DataStage
Teradata (v14–v1610), Oracle (10g–12c), MS SQL Server

CI/CD & Workflow Orchestration

Jenkins, Git, GitHub Actions

Scripting & Automation

Python, Shell Scripting (Unix/Linux), BTEQ, TPT

Data Modeling & Governance

CA Erwin, Dimensional Modeling, Semantic Layer Design
Metadata Management, Data Quality & Reconciliation

Analytics & BI Tools

Tableau, Looker, SAP BusinessObjects

Websites

https://www.linkedin.com/in/avinash-maddineni/

Certification

Amazon Web Services Solutions Architect Associate, 02/01/20, NKWX60C2FN1Q13CV

Accomplishments

Designed and implemented the Net Flown Revenue and Repeat Customer Model for the Air business, unlocking $7M+ in incremental revenue through advanced customer behavior analytics.
Prototyped a dynamic GDS Contract Optimization Model, enabling revenue maximization from global distribution systems starting in 2019.
Identified and mitigated $4M in refund risk exposure due to duplicate payouts for COVID-related ticket cancellations, preventing substantial financial leakage.
Engineered a real-time Smart Meter Rules Engine pipeline, allowing business users to configure and trigger reporting logic, reducing reporting latency from 24 hours to under 30 minutes (~95% improvement).
Developed an automated memory allocation mechanism tied to user registration traffic during open enrollment, increasing system availability by 20% during peak usage periods.

Timeline

Senior Data Engineer

Expedia Group Inc

10.2017 - Current

Sr. Data Warehouse Engineer

Bank of America

08.2015 - 09.2017

Sr. Data Warehouse Engineer

QSSI

07.2014 - 08.2015

Sr. ETL & Database Engineer

Southern California Edison

09.2012 - 07.2014

Master of Science - Computer Science

University of Illinois

Bachelor of Science - Information Technology

Vignan’s Engineering College

Avinash Maddineni

Summary

Overview

Work History

Senior Data Engineer

Sr. Data Warehouse Engineer

Sr. Data Warehouse Engineer

Sr. ETL & Database Engineer

Education

Master of Science - Computer Science

Bachelor of Science - Information Technology

Skills

Websites

Certification

Accomplishments

Timeline

Senior Data Engineer

Sr. Data Warehouse Engineer

Sr. Data Warehouse Engineer

Sr. ETL & Database Engineer

Master of Science - Computer Science

Bachelor of Science - Information Technology

Similar Profiles

Thangabalaji SwamynathanThangabalaji Swamynathan

Suhana Suhana null

Jesuwegnigbe NoumakpeJesuwegnigbe Noumakpe

Phanindra P BabuPhanindra P Babu