Summary
Overview
Work History
Education
Skills
Websites
Certification
Timeline
Generic
Supreetha Thedlapally

Supreetha Thedlapally

Summary

Results-driven Senior Cloud & SRE Engineer with 13 years of experience building, scaling, and operating enterprise-grade cloud infrastructure on AWS. Proven ability to design robust, secure, and highly available systems from Kubernetes-native microservices and custom operators to Terraform-automated infrastructure and zero-trust network architectures. Deep expertise in embedding SRE principles SLIs, SLOs, error budgets, and automated incident response into the full software development lifecycle. Experienced operating in regulated environments (HIPAA, FedRAMP) where security, compliance, and uptime are non-negotiable. Adept at leading cross-functional engineering initiatives, mentoring junior talent, and driving continuous reliability and cost-efficiency improvements. Passionate about building scalable, automated cloud solutions that teams can depend on.

Overview

14
14
years of professional experience
1
1
Certification

Work History

SR. Cloud DevOps Engineer

Centers for Medicare & Medicaid Services
GA
02.2025 - Current
  • Architected and delivered secure, scalable and highly available, AWS cloud platforms (EC2, ECS, S3, RDS, EKS, VPC, Lambda, IAM, KMS, API Gateway, SAML 2.0, RedShift) supporting large-scale, production workloads with strict uptime, security, and cost-efficiency targets.
  • Built end-to-end Infrastructure as Code using Terraform to provision VPCs, subnets, routing, security groups, NACLs, VPC peering, and private endpoints, reducing manual provisioning by 40% and improving deployment consistency and recovery times.
  • Engineered secure, scalable AWS networking architectures including Transit Gateways, Site-to-Site VPN, Hub-and-Spoke VPCs, NAT/Internet Gateways, OpenVPN, and VPC endpoints (gateway and interface).
  • Designed and implemented cloud infrastructure security and compliance controls across AWS, including RBAC, MFA, SAML 2.0 SSO, IAM policies, KMS encryption, and network isolation, while partnering with security and compliance teams to enforce governance frameworks aligned with SOC2, ISO 27001, HIPAA, and FedRAMP requirements.
  • Hardened and optimized NGINX as a reverse proxy (TLS termination, routing, caching, and security controls), deploying services under non-root users to enforce least-privilege security standards.
  • Designed and implemented Auto-Scaling policies based on real-time traffic patterns and performance metrics, improving service availability and reducing compute costs.
  • Led root-cause analysis and remediation of complex identity and access issues spanning AWS services, federated IdPs, and CI/CD pipelines, improving platform reliability and security posture.
  • Deployed Datadog APM and logging for ECS workloads using agent instrumentation and ECS-native tagging, improving microservice observability, transaction tracing, and reducing mean time to resolution (MTTR).
  • Built centralized observability with Amazon CloudWatch dashboards and Splunk ingestion (CloudTrail, VPC Flow Logs, ALB/ELB, EC2, ECS, Lambda, AWS Glue), enabling faster detection of latency, saturation, and systemic failures.
  • Designed and implemented Apache Iceberg tables for scalable, ACID-compliant Lakehouse storage, exposed as unmanaged tables in Snowflake, optimizing table design to enable 50% faster query performance for downstream analytics teams.
  • Centralized secrets management with AWS Secrets Manager and SSM Parameter Store, eliminating plaintext credentials and strengthening auditability and compliance.
  • Applied SRE principles by defining SLIs/SLOs, managing error budgets, and driving reliability improvements using AWS-native and third-party observability platforms.
  • Delivered zero-downtime Blue-Green deployments using load balancers and automated traffic shifting, minimizing release risk and enabling rapid rollback during production incidents.
  • Led GitFlow-based branching and release workflows, coordinating feature development, integration, and production promotion across teams.
  • Partnered with Product, QA, and Platform teams on release planning and execution within Agile Scrum environments, leveraging strong understanding of SDLC processes to deliver on-time, low-risk production deployments.
  • Sparksoft Corporation

Sr. Cloud DevOps Engineer

Fannie Mae
Atlanta, GA
03.2021 - 01.2025
  • Architected, automated, and operated production-grade, multi-account/multi-region AWS environments using Terraform and CloudFormation, provisioning VPCs, subnets, EC2, ECS, Fargate, EKS, S3, RDS, Lambda, ELB, and secure networking components to deliver highly available, scalable, and secure platforms while reducing manual configuration errors by 50% through repeatable Infrastructure-as-Code deployments.
  • Designed and maintained CI/CD pipelines with Jenkins, Git, Docker, and ECR to automate builds and deployments to EKS, enabling rolling and blue-green releases with 99.9% uptime and 40% fewer deployment incidents.
  • Designed and operationalized enterprise-wide observability strategy leveraging multi-tool telemetry (CloudWatch, Prometheus, Grafana, Datadog, Splunk, Dynatrace, ELK), cutting MTTD by 45% and enabling faster, data-driven root-cause analysis.
  • Engineered secure, low-latency AWS networking using Transit Gateways, VPC peering, VPC endpoints, and private connectivity, supporting reliable service-to-service communication at scale.
  • Led end-to-end database migrations using AWS Database Migration Service (DMS), enabling near-zero downtime cutover from on-prem/legacy systems to AWS RDS/Aurora environments.
  • Applied SRE practices by defining SLIs/SLOs/SLAs and implementing real-time SLA tracking and automated incident workflows with ServiceNow, PagerDuty, and Slack, improving uptime and customer satisfaction.
  • Designed and shipped Kubernetes controllers with Custom Resource Definitions (CRDs) to automate custom resource lifecycles and enforce desired state, standardizing deployments via Helm and reducing configuration drift across environments.
  • Designed and implemented serverless ETL pipelines using AWS Glue (Spark), Crawlers, and S3 to ingest and curate structured and semi-structured data, enabling schema evolution with minimal operational overhead.
  • Deployed and operated Airflow in containerized cloud environments, integrating CI/CD pipelines for DAG versioning, automated validation, and controlled, rollback-safe releases across environments.
  • Led end-to-end vulnerability management by leveraging Rapid7 for infrastructure and endpoint security and Fortify (SAST) with DAST/SCA tools for application security, integrating automated security scans into AWS CI/CD pipelines (via Terraform) to enforce pre-release compliance and ensure secure, policy-driven deployments.
  • Collaborated with security and compliance teams to implement AWS security controls aligned with SOC2 and ISO 27001, ensuring cloud workloads met enterprise governance and regulatory requirements.
  • Drove cloud cost optimization by analyzing AWS billing data, rightsizing underutilized resources, and implementing governance controls, achieving up to 30% savings in cloud spend.
  • Led end-to-end release planning across Dev, QA, UAT, and Production environments, defining scope, risk mitigation, rollback strategies, and deployment timelines.
  • Led and conduct brownbag sessions, code reviews to share knowledge and ensure code quality across teams.
  • Atlanta, GA

Cloud Engineer/DevOps Engineer

State of Georgia (Deloitte)
Atlanta, GA
08.2015 - 02.2021
  • Led end-to-end migration from managed hosting to AWS, owning service architecture, network design, data migration, automation, monitoring, deployments, cutover strategy, cost modeling, and delivery timelines.
  • Designed and provisioned secure, highly available AWS environments (VPCs, public/private subnets, NAT Gateways, bastion hosts, route tables, NACLs, security groups, ELB/ALB) distributed across multiple Availability Zones.
  • Built and operated CI/CD pipelines using Git, Jenkins, SonarQube, Nexus and Ansible automating deployments across Dev, QA, UAT and Production to improve release reliability and reduce manual effort.
  • Designed Infrastructure as Code using AWS CloudFormation to automate instance provisioning and environment setup, accelerating sprint delivery and ensuring repeatable infrastructure.
  • Containerized applications using Docker and deployed to Kubernetes with Ansible automation, enabling scalable rollouts, configuration consistency, and reduced deployment failures.
  • Engineered dynamic routing and load balancing with NGINX Ingress Controllers and AWS ELB/ALB, supporting high-traffic application scaling and resilient service delivery.
  • Built end-to-end observability using CloudWatch, AppDynamics, and Splunk, configuring alarms and dashboards to proactively detect performance issues and reduce incident response times.
  • Installed, upgraded, and operated IBM WebSphere Application Server and IHS on Red Hat Linux, including SSL/TLS enablement, TLS 1.2 upgrades, and zero-downtime version migrations.
  • Built automation scripts using Bash, Python, and PowerShell, reducing manual workloads and increasing process efficiency across security and operations teams.
  • Dept: Department of Human Services.

Sr. Middleware Administrator

State of Florida (Deloitte)
Tallahassee, FL
08.2013 - 07.2015
  • Led installation and configuration of IBM WebSphere Application Server (6.x–9.x) on Linux, designing clustered, high-availability environments with horizontal and vertical scaling via Deployment Manager.
  • Implemented enterprise security including SSL/TLS, Single Sign-On (SSO), LDAP/Active Directory integration, certificate management, and TLS 1.2 upgrades.
  • Architected and maintained secure communication between WebSphere and IBM HTTP Server (IHS) using SSL/TLS and plug-in configurations.
  • Created and managed JDBC Providers and Data Sources across cell, node, and server scopes to support Oracle 9i/10g/11g backend systems.
  • Deployed and managed EAR/WAR applications, configuring JVM settings and Web Container parameters using Admin Console and wsadmin automation scripts.
  • Tuned JVM and application performance using Tivoli Performance Viewer, optimizing heap sizes, connection pools, thread pools, and system throughput.
  • Implemented queue manager clustering and deployed Broker Archive (BAR) files to enable workload balancing and reduce administrative overhead.
  • Led IBM PMR engagements, applied WebSphere fix packs and interim fixes, and resolved critical production incidents.
  • Administered IBM Rational ClearCase and ClearQuest, performing VOB creation, snapshot views, production rebases, and merge conflict resolution.
  • Dept: Florida Department of Children & Families.

WebSphere Administrator

3M
St. Paul, MN
04.2012 - 07.2013
  • Installed and configured WebSphere Application Server (6.x/7.x) and Apache on AIX and Windows platforms.
  • Deployed and managed WebSphere Portal environments, including clustering, security, and virtual portals.
  • Implemented enterprise security with LDAP integration, SSO, SSL/TLS, and certificate management.
  • Supported full application deployment lifecycle across multiple WAS environments.
  • Tuned JVM performance using Tivoli Performance Viewer, Dynatrace, and Wily Introscope.
  • Implemented high-availability clustering and workload management.
  • Diagnosed JVM memory leaks using heap dumps and thread dumps.
  • Coordinated with DBAs to resolve JDBC and database connectivity issues.
  • Led PMR engagements with IBM and applied system fix packs.
  • Mentored teams on WebSphere operations and troubleshooting.
  • St. Paul, MN

WebSphere Administrator

Blue Shields of California
El Dorado Hills, CA
11.2011 - 03.2012
  • Installed and configured WebSphere Application Server (6.1/7.x) on Solaris 9/10 and integrated IBM HTTP Server.
  • Designed load-balanced, high-availability WebSphere clusters using WLM and F5 load balancers.
  • Configured JDBC providers, data sources, and connection pooling for Oracle integrations.
  • Led WAS and OS migrations (6.1 → 7.0, Solaris 9 → 10).
  • Supported full application lifecycle from build to production deployment.
  • Diagnosed JVM hung states and memory leaks using heap and thread dump analysis.
  • Tuned JVM and server performance for stability and scalability.
  • El Dorado Hills, CA

Education

Master’s - computer engineering

International Technological University
01-2012

Bachelors - Electronics and Communication Engineering

JNTU
Hyderabad
01-2010

Skills

  • Golang
  • Python
  • AWS
  • EC2
  • ECS
  • EKS
  • VPC
  • S3
  • RDS
  • IAM
  • Lambda
  • KMS
  • CloudWatch
  • API Gateway
  • Redshift
  • Jenkins
  • Urban Code Deploy
  • GitHub
  • Grafana
  • Splunk
  • Datadog
  • New-Relic
  • Prometheus
  • Dynatrace
  • Docker
  • Kubernetes
  • SQL Server
  • Oracle
  • Dynamo DB
  • VScode
  • Terraform
  • Ansible
  • Power Shell scripting

Certification

Certified AWS Solutions Architect – Associate, 2XC5ZQ1LD1Q41L5N, http://aws.amazon.com/verification

Timeline

SR. Cloud DevOps Engineer

Centers for Medicare & Medicaid Services
02.2025 - Current

Sr. Cloud DevOps Engineer

Fannie Mae
03.2021 - 01.2025

Cloud Engineer/DevOps Engineer

State of Georgia (Deloitte)
08.2015 - 02.2021

Sr. Middleware Administrator

State of Florida (Deloitte)
08.2013 - 07.2015

WebSphere Administrator

3M
04.2012 - 07.2013

WebSphere Administrator

Blue Shields of California
11.2011 - 03.2012

Master’s - computer engineering

International Technological University

Bachelors - Electronics and Communication Engineering

JNTU
Supreetha Thedlapally