Summary
Overview
Work History
Education
Websites
Certification
Timeline
Generic
SAI DIKSHIT PASHAM

SAI DIKSHIT PASHAM

Cloud Consultant
Alpharetta,GA

Summary

Seasoned Cloud and DevOps Engineer with nearly 9 years of experience in designing, implementing, and managing highly available, scalable, and resilient cloud infrastructures. Expertise in AWS, Kubernetes, Docker, and CI/CD pipelines to streamline deployments and enhance operational efficiency. Proficient in migrating applications from on-premises to the cloud, implementing service meshes like Istio, and architecting disaster recovery plans for mission-critical systems. Demonstrates strong skills in Infrastructure as Code (IaC) using Terraform and Helm, and in observability with OpenTelemetry and Splunk for real-time system monitoring and troubleshooting. Skilled in collaborating with cross-functional teams to deliver innovative solutions for distributed systems, containerized environments, and hybrid cloud setups. Proven ability to enhance uptime, optimize workflows, and meet evolving technical and business demands with a focus on scalability and reliability

Overview

11
11
years of professional experience
1
1
Certification

Work History

Senior Cloud Infrastructure Consultant

Transcend IT Solutions
Alpharetta, GA
10.2023 - Current
  • Designed and implemented high-availability architecture for card issuing services, including authorization, tokenization, and essential APIs, ensuring seamless operations and resilience for critical banking applications.
  • Migrated collection services from on-premises to the cloud, establishing global URLs to enable efficient data access between on-premises and cloud environments.
  • Transitioned Argo CD from an "Apps of Apps" model to an "App Set" solution, streamlining deployments and implementing a Hub-and-Spoke model for centralized management.
  • Deployed Istio service mesh for load balancing and traffic management, optimizing performance for beta-released services and ensuring efficient routing across environments.
  • Developed disaster recovery (DR) plans for application resiliency on AWS Cloud, utilizing Application Recovery Controller to implement multi-region failover setups for Tier 1, 2, and 3 applications in both active-active and active-passive configurations.
  • Built smoke testing frameworks using K6 for critical business APIs, supporting major banking clients like Capital One and JPMC, and implemented heartbeat monitoring for applications using CloudWatch Synthetics.
  • Configured on-call alerts with CloudWatch Alarms, ensuring timely notifications and rapid response for production incidents.
  • Migrated Kubernetes workloads from Cluster Autoscaler to Karpenter on EKS and EKS Anywhere, enhancing scalability and resource efficiency.
  • Engineered HAProxy configurations to serve as a global proxy between on-premises and cloud applications, enabling seamless application failover for active-active and active-passive scenarios.
  • Implemented Kafka replication using Amazon MSK, ensuring fault-tolerant and highly available messaging across distributed systems to support scalable and resilient applications.
  • Integrated OpenTelemetry (OTel) Collector with Splunk to enhance observability and provide real-time insights into system performance and troubleshooting.

System Development Engineer II

Amazon
Austin, TX
05.2022 - 10.2023
  • Designed and implemented a build infrastructure for game SDKs using Kubernetes, optimizing resource utilization and ensuring seamless integration with workflows across Windows, Linux, and MacOS platforms.
  • Managed and maintained high-availability build servers, reducing downtime and enhancing productivity, while creating monitoring systems to track build performance and proactively resolve issues.
  • Automated testing and validation of game SDK builds using test rails, ensuring high-quality releases and faster time-to-market, and conducted performance testing using JMeter and LoadRunner to identify and address bottlenecks.
  • Utilized AWS services such as EC2, S3, and CloudFront to host and distribute SDK builds, ensuring global accessibility and reliability for developers and customers.
  • Developed CI/CD pipelines in Jenkins to automate the retrieval, compilation, testing, and deployment of SDK builds, integrating Nexus for artifact storage and leveraging Ansible for deployment automation.
  • Created a private cloud infrastructure using Kubernetes and Helm to support development, testing, and production environments, managing Kubernetes manifests and releases for reproducible builds and scalable environments.
  • Built and managed GitOps strategies for automated updates to the release portal, streamlining SDK release processes and aligning with Unreal Engine CI build requirements.
  • Collaborated with cross-functional teams, including developers, QA engineers, and product managers, to ensure successful SDK integrations, meeting customer needs and business objectives.
  • Implemented new strategies to reduce costs and improve efficiency of engineering team.

Sr Devops Engineer

Global Payments
Phoenix, AZ
05.2021 - 05.2022
  • Designed and deployed multi-tier applications using AWS services such as EC2, Route53, S3, RDS, DynamoDB, SNS, and SQS, ensuring high availability, fault tolerance, and auto-scaling with AWS CloudFormation and Service Catalog for self-service deployments.
  • Leveraged AWS services like EC2, auto-scaling, and VPC to build secure, scalable systems for handling unpredictable load bursts, integrating AWS CloudWatch, CloudTrail, CloudFront, and CLI for resource provisioning and monitoring.
  • Implemented scalable, production-ready Kubernetes infrastructure with Helm for microservices orchestration and containerized deployments, creating reproducible builds and managing Kubernetes manifest files for multiple environments.
  • Developed CI/CD pipelines in Jenkins for end-to-end automation, including retrieving code, compiling applications, running tests, and deploying build artifacts to Nexus, while utilizing Terraform for IaC and Helm for Kubernetes charts.
  • Planned and implemented hybrid cloud solutions integrating on-premises servers with AWS for highly sensitive data, designed scalable DNS systems within AWS Cloud, and ensured traffic routing via AWS Direct Connect.
  • Participated in production RCAs and postmortems, building strategies to improve uptime within SLA, and created playbooks to enhance deployment processes and mitigate similar future issues.
  • Automated testing pipelines with TDD/BDD frameworks like Cucumber and Behave for Java and Python scripts, integrating them into Jenkins pipelines to enhance code quality and deployment reliability.
  • Deployed and monitored MySQL and Oracle databases on RDS across multiple availability zones, setting alarms for CPU utilization and database connections, with Splunk for logging and performance analysis.

Cloud Engineer

Brillius Technologies
San Jose, California
08.2017 - 04.2021
  • Orchestrated AWS infrastructure setup and deployments from QA to Production, including AMI-based and containerized deployments using Jenkins and Harness pipelines. Streamlined Kubernetes (EKS/KOPS) production deployments and troubleshooting processes, ensuring high availability and fault tolerance.
  • Designed and validated architecture solutions to enhance scalability and efficiency, implementing API Gateway and NGINX as reverse proxy servers with Consul for service discovery and management.
  • Automated AMI creation using Packer and developed Python-based DNS mapping to dynamically scale instances with MySQL integration. Utilized Terraform for Infrastructure as Code, integrating with Git and deploying Linux-based applications.
  • Architected and deployed applications leveraging AWS services (EC2, S3, RDS, Route53, Lambda, SQS, SNS, CloudWatch), focusing on scalability, auto-scaling, fault tolerance, and high availability.
  • Monitored application performance using tools like AppDynamics, Grafana, Prometheus, and AWS CloudWatch, while troubleshooting issues with Splunk log analysis. Resolved P1/P2 production issues in collaboration with customer care teams.
  • Addressed Linux vulnerabilities through Qualys scans, implemented security patches, and supported network services by analyzing packet flows using Wireshark. Created detailed RCA documentation for stakeholders.
  • Delivered innovative solutions for AWS cloud integration, monitoring, and automation for financial enterprises, including custom instance types and private networks for specialized applications.
  • Collaborated with cross-functional teams to ensure the timely delivery of applications to production, adhering to Agile methodologies and continuously optimizing workflows to meet customer requirements.
  • Provided 2nd and 3rd level technical support and troubleshooting to internal and external clients.

Release Engineer

ACUMEN INFINITE SOLUTIONS
Bangalore, India
07.2013 - 01.2015
  • Responsible for Infra setup on AWS for Non-Prod Environments
  • DevOps for load balanced environments & Multi-regional server environments (AWS Regional nodes managed via Roles)
  • Participated in weekly release meetings with technologies take holders to identify and mitigate potential risks associated with the releases
  • Release Engineer for a team that involved different development and multiple simultaneous software releases
  • Jenkins is used as a continuous integration tool for automation of daily process

Education

Master's - information system

University of Illinois
01.2016

Certification

  • AWS Certified Developer - Associate, AWS-ADEV-11060
  • HashiCorp Certified: Terraform Associate

Timeline

Senior Cloud Infrastructure Consultant

Transcend IT Solutions
10.2023 - Current

System Development Engineer II

Amazon
05.2022 - 10.2023

Sr Devops Engineer

Global Payments
05.2021 - 05.2022

Cloud Engineer

Brillius Technologies
08.2017 - 04.2021

Release Engineer

ACUMEN INFINITE SOLUTIONS
07.2013 - 01.2015

Master's - information system

University of Illinois
SAI DIKSHIT PASHAMCloud Consultant