Summary
Overview
Work History
Education
Skills
Personal Information
Ai Ml Exposure
Core Competencies
Accomplishments
Timeline
Generic

Kingsly David Abraham

Bentonville,AR

Summary

Dynamic data engineering leader with a proven track record at Walmart Inc., driving cost optimization and predictive analytics. Expert in Spark and GCP, achieving a 30% reduction in infrastructure costs. Adept at fostering collaboration and innovation, leveraging strong problem-solving skills to deliver impactful data solutions. Passionate about transforming data into strategic insights.

Overview

11
11
years of professional experience

Work History

Team Lead – Total Cost of Ownership (TCO)

Walmart Inc.
Bentonville, US
03.2024 - Current
  • Leading data engineering efforts for Walmart’s Total Cost of Ownership (TCO) platform, designed to calculate and optimize cost across supply chain, logistics, and retail operations.
  • Built scalable Spark and BigQuery pipelines to process multi-source cost data (inventory, transportation, vendor contracts) enabling real-time cost insights.
  • Partnered with finance and operations stakeholders to design cost models and integrate predictive analytics for cost forecasting.
  • Leveraged Vibe Coding (AI-assisted development) to accelerate pipeline development, improve efficiency, and reduce delivery timelines.
  • Optimized GCP resource utilization, reducing overall infrastructure cost by 25% while maintaining SLA performance.
  • Established data quality frameworks and audit checks to ensure accuracy of cost reporting for executive decision-making.

Team Lead – Finance DataLake

Walmart Inc.
Bentonville, US
09.2020 - 02.2024
  • Led a 10-member engineering team delivering Walmart’s Finance DataLake used for quarterly earnings reports to Wall Street.
  • Designed and optimized Spark pipelines, reducing GCP cluster costs by 30% through performance tuning and reusable components.
  • Partnered with business teams to integrate ML-based anomaly detection for POS data to improve fraud/risk insights.
  • Spearheaded the on-prem to GCP migration, ensuring secure, scalable, and cost-efficient data infrastructure.
  • Leveraged Vibe Coding (AI-assisted development) to accelerate Spark pipeline development, reduce boilerplate coding, and enhance developer productivity.
  • Built reusable frameworks for ingestion, data quality checks, and ML model orchestration.

Big Data Engineer – Retail Data Analysis (Walmart)

Miracle Software Systems Inc.
Bentonville, US
07.2019 - 09.2020
  • Designed and maintained ETL pipelines processing multi-TB datasets daily.
  • Implemented Spark jobs and Hive queries for business transformations and retail analytics dashboards.
  • Migrated structured data via Sqoop to Teradata for downstream BI analysis.
  • Collaborated with analysts to support AI-driven insights via ThoughtSpot and GDP portal.
  • Managed version control and CI/CD workflows using Git and Automic.

Hadoop Developer – Northern Trust Inc.

Hexaware Technologies Ltd.
Chennai, India
04.2017 - 07.2019
  • Built Spark SQL pipelines to process data and integrate with Oracle EDW.
  • Developed APIs to expose processed datasets for customer applications.
  • Implemented Sqoop for bi-directional data movement between Hadoop and Oracle.
  • Improved system efficiency by performance-tuning Impala and Hive queries.

Project: Text Analytics for Global Education Firm

08.2015 - 03.2017
  • Developed NLP-based plagiarism detection system using Spark, Hadoop, and Apache Tika.
  • Implemented n-gram models and Jaccard similarity coefficient for multi-language text comparison.
  • Deployed Hadoop clusters on AWS EC2 with Ambari for scalable NLP workloads.
  • Integrated ML pipelines for document similarity and reporting with Hive.

Project: Log Analytics – Global Banking Client

04.2014 - 07.2015
  • Designed AI-powered log analytics platform for anomaly detection in web/IIS server logs.
  • Used Flume for real-time ingestion, Pig/Hive for preprocessing, and ElasticSearch for visualization.
  • Automated system monitoring, improving incident detection time by 40%.

Education

B.E. - Information Technology

SRM Valliammai Engineering College
Chennai, TN
01.2013

Skills

  • Hadoop ecosystem tools
  • Data processing frameworks
  • Python and Scala programming
  • Cloud platforms and services
  • Database management systems
  • Data visualization techniques
  • Version control with Git
  • Workflow orchestration with Airflow

Personal Information

Work Permit: Authorized to work in US (H1B)

Ai Ml Exposure

  • Integrated ML models into Spark & GCP pipelines for anomaly detection and forecasting.
  • Developed NLP applications for plagiarism detection and log analytics.
  • Knowledge of Vertex AI, ML orchestration with Airflow, and AI-assisted development using Vibe Coding.

Core Competencies

Hadoop, Spark (Scala/Python), Hive, Impala, Sqoop, Flume, HDFS, Hortonworks, Cloudera, Dataproc, BigQuery, Google Cloud (GCP), AWS (exposure), Vertex AI (exposure), NLP, anomaly detection, sales forecasting integration, Vibe Coding (AI-assisted development), Airflow, Automic, YARN, Teradata, Oracle, SQL, ThoughtSpot, Git, Ambari, Hue, JIRA, Linux, Shell scripting

Accomplishments

  • For cost optimization
  • Designing framework for Total Cost of Ownership

Timeline

Team Lead – Total Cost of Ownership (TCO)

Walmart Inc.
03.2024 - Current

Team Lead – Finance DataLake

Walmart Inc.
09.2020 - 02.2024

Big Data Engineer – Retail Data Analysis (Walmart)

Miracle Software Systems Inc.
07.2019 - 09.2020

Hadoop Developer – Northern Trust Inc.

Hexaware Technologies Ltd.
04.2017 - 07.2019

Project: Text Analytics for Global Education Firm

08.2015 - 03.2017

Project: Log Analytics – Global Banking Client

04.2014 - 07.2015

B.E. - Information Technology

SRM Valliammai Engineering College