Summary
Overview
Work History
Education
Skills
Certification
Timeline
Awards and Highlights
Research Publications and Professional Online Works
Generic
Manoj Kuppam

Manoj Kuppam

Frisco

Summary

  • Highly accomplished Site Reliability Engineering (SRE) leader with proven track record of designing and implementing reliability frameworks at Fortune 100 companies driving significant improvements in system performance and incident metrics (TTx) driven by cost efficient Observability frameworks.
  • Author of the book “Enterprise Digital Reliability” – Amazon/Apress-Springer and reliability framework models and methods published in prestigious IEEE and the world largest digital library ACM (American Computing Machinery).

Overview

2026
2026
years of professional experience
1
1
Certification

Work History

Sr VP of SRE

JPMorgan Chase
2024 - Current

Leading enterprise SRE transformation for systems supporting 80M+ customers and 6M+ small businesses across 4,800 branches

Key Objectives:

  • Leadership and Strategy (Defining SRE strategy, quarterly and annual planning, live site reviews, resolve conflicts)
  • Observability (YBIYRI framework, Rel360 – CUJ, AIOps – Z-score, Davis AI, toil reduction)
  • Resiliency Improvement (Architecture reviews, FMEA)
  • GT SRE (Mentoring and Training global SRE talent pool)

Landings:

  • Architected and implemented unified observability platform providing single-pane-of-glass visibility across infrastructure, applications, network, and business KPIs, reducing MTTD by 45% and MTTR by 38%
  • Led implementation of AI-driven observability using Dynatrace Davis AI for predictive anomaly detection, reducing critical incidents by 30% through thematic analysis and proactive identification
  • Productivity improvement by automating the routine tasks and reducing toil – automated the branch resiliency data and alerting timely to identify branch teller issues upto 4 hours ahead with auto-detection.
  • Led and build YBIYRI framework for consumer business and core banking business supporting the observability improvement and Mission Control resiliency assessment through FMEA assessments.
  • Gamified CUJs using scoring frameworks to train global talent pool on the 6 dimensions of reliability engineering – Observability, Optimal Alerting, Test coverage, Quality release, Idempotency, Incident Management

Sr. SRE Lead

Medline.com
01.2021 - 01.2024

Established and scaled enterprise SRE practice from ground up for $34B healthcare leader's digital transformation

Key Objectives:

  • SRE Strategy (Establish SRE CoE)
  • Observability Implementation (Ecommerce modernization program – MK SRE Scoring Framework, end-end monitoring, Finops – cost optimization and tool consolidation)
  • Resiliency Improvement (Architecture reviews, SRE Runbooks)

Landings:

  • Built SRE practice from scratch, implementing comprehensive Observability across Azure cloud, Kubernetes clusters, and legacy on-premise systems
  • Created award-winning MK Scoring Framework for observability maturity assessment, adopted as industry best practice and winning 2023 International Titan Gold Award
  • Optimized observability costs by 40% through intelligent sampling, data retention policies, and vendor consolidation while improving coverage by 60%
  • Implemented distributed tracing across microservices architecture, reducing cross-service debugging time from hours to minutes
  • Led team of 15 engineers in observability tool implementation, custom instrumentation development, and runbook automation

Enterprise Monitoring Engineer

Volkswagen Credit Inc
2014 - 2021

Transformed enterprise monitoring from reactive log-based approach to proactive full-stack observability platform

Key Objectives:

  • Head of Observability (Modernize monitoring, Implement APM solutions, Vendor selection, Application onboarding, Build unified observability)

Landings:

  • Led APM tool evaluation and implementation, selecting and deploying AppDynamics across all enterprise applications, achieving 360-degree visibility
  • Pioneered Method Invocation Data Collectors (MIDC) framework for custom telemetry, presented at Cisco AppDynamics Conference to 100+ Fortune 500 companies
  • Implemented end-to-end observability for connected car platform, providing real-time insights into API performance and user experience
  • Deployed Splunk SignalFx for infrastructure monitoring and ThousandEyes for network observability, creating unified observability ecosystem
  • Achieved 100% KPI visibility for executive dashboards, enabling data-driven decision making during critical COVID-19 period

Programmer - Technical Program Manager

Infosys
2005 - 2014
  • Designed and led the Salesforce Automation program for customer Kellogg’s moving from paper binder to tablet based route planning and shelf inventory planning cutting down time to market from 2 weeks to 2 days.
  • Solution Architecture, Requirements Elicitation, High Level Design, Project and Conflict management, and Finops.
  • Building Technical work products.

Education

Bachelor of Science - Electronics And Electrical Engineering

JNTU, Hyderabad
Hydrabad, India
05-2005

Skills

  • SRE: Reliability GQM methods, FMEA, RCA and Blameless
  • Observability – AppDynamics, Splunk, Foglight, Orion, Dynatrace, Thousand eyes and many more
  • Azure – Monitor, AKS, AIS, ADF
  • AWS – ECS, ECR, EKS, Cloud Watch, EC2, S3, EBS, ASG, Route53, RDS, Cloud Front, etc
  • Scripting - PowerShell, JS, Bash, Groovy
  • Microsoft Framework - Dot Net, VBNet, TFS, IIS, Release Manager
  • Backend - SQL, PL/SQL, SSIS/SSRS, Data Modeling, Informatica

Certification

  • Splunk Power User Certified - 2025
  • AWS Solution Architect Associate - 2021
  • AppDynamics Performance Analyst - 2020
  • AWS Cloud Practitioner - 2020
  • Project Management Elite – Infosys - 2014
  • HP IT Governance Specialist - 2007
  • Competent Communicator Leader – Toastmasters International – 2018

Timeline

Sr. SRE Lead

Medline.com
01.2021 - 01.2024

Sr VP of SRE

JPMorgan Chase
2024 - Current

Enterprise Monitoring Engineer

Volkswagen Credit Inc
2014 - 2021

Programmer - Technical Program Manager

Infosys
2005 - 2014

Bachelor of Science - Electronics And Electrical Engineering

JNTU, Hyderabad

Awards and Highlights

Gold winner for Best Technical Strategy for IT implementation - Intl. Titan Awards 2023.

Finalist in Intl. Devops Excellence Awards 2024.

National level IT Aptitude 2004 – Top 1% certified.

Most Valuable Player (MVP) with Infosys 2009.

Top 1% coach in SRE with TopMate and GMI.

LinkedIn Top Voice in Team Leadership and IT Ops.

Research Publications and Professional Online Works

  • Google Scholar - https://scholar.google.com/citations?user=sA8dE8AAAAJ&hl=en
  • ResearchGate - https://www.researchgate.net/profile/Manoj-Kuppam/research
  • LinkedIn - https://www.linkedin.com/in/manojkuppam/
Manoj Kuppam