Summary
Overview
Work History
Education
Skills
Certification
Timeline
Generic

Sai Krishna Reddy Gujjula

Summary

Site Reliability Engineer with 9 years of experience with maintaining and improving system reliability through automation and proactive monitoring. Utilizes problem-solving skills to troubleshoot and resolve complex issues efficiently. Knowledge of infrastructure optimization and collaborative teamwork to enhance system performance and reliability.

Overview

9
9
years of professional experience
1
1
Certification

Work History

Site Reliability Engineer

Apple
06.2022 - Current
  • Managed quarterly production releases and deployments using AWS and EKS.
  • Monitored system health and network observability to ensure high availability.
  • Collaborated with the performance team on capacity planning and infrastructure scaling.
  • Supported developers in non-production environments to resolve blockers and meet release deadlines.
  • Optimized infrastructure efficiency and saved 80,000 monthly through capacity planning, resource utilization audits, automated weekend scale-downs, and remediation of misconfigured compute instances.
  • Provided 24/7 on-call support and root cause analysis for production incidents.

DevOps Engineer

Capco
11.2021 - 06.2022
  • Deployed microservices on Kubernetes across multiple regions for scalability.
  • Automated software releases using Azure cloud technologies.
  • Provided technical guidance to developers on DevOps best practices and process improvements.

DevOps Engineer

Vanguard
06.2021 - 11.2021
  • Monitored system health using Splunk for early detection of issues.
  • Collaborated with developers to troubleshoot and resolve application issues.
  • Deployed infrastructure automation and configuration management tools.

Site Reliability Engineer

Apple
04.2020 - 06.2021
  • Executed on-premise to AWS migrations and transitioned services from EC2 to EKS.
  • Partnered with performance teams and developers to maintain stability during cloud transitions.
  • Performed root cause analysis and updated incident response documentation.

DevOps Engineer

Stanford
01.2020 - 04.2020
  • Built Ansible automation scripts for configuring virtual machines at scale.
  • Streamlined deployment workflows by implementing automated configuration tools.

Site Reliability Engineer

Apple
07.2019 - 12.2019
  • Managed high availability for applications within on-premise data center environments.
  • Monitored server resource utilization and performed hardware-level optimizations.
  • Investigated root causes of production issues related to on-premise application releases.

DevOps Engineer/Big Data Admin

Verizon Media(Verizon yahoo)
05.2017 - 07.2019
  • Automated installation of apps and services using Ansible playbooks.
  • Configured Cloudera data nodes and created Splunk dashboards for log analysis.
  • Participated in disaster recovery exercises to test system reliability and uptime.

Education

Master's In Information Technology And Management -

Campbellsville University
Campbellsville, KY
12 2021

Skills

  • Incident management
  • Microservices architecture
  • Scripting languages
  • Infrastructure automation
  • Capacity planning
  • System monitoring

Certification

  • Golden Kubeastronaut(CNCF)
  • AWS Certified DevOps Engineer- Professional
  • AWS Certified Solutions Architect- Associate
  • AWS Certified Sysops Administrator- Associate
  • HashiCorp Certified: Terraform Associate
  • Amazon Web Services Cloud Practitioner.

Timeline

Site Reliability Engineer

Apple
06.2022 - Current

DevOps Engineer

Capco
11.2021 - 06.2022

DevOps Engineer

Vanguard
06.2021 - 11.2021

Site Reliability Engineer

Apple
04.2020 - 06.2021

DevOps Engineer

Stanford
01.2020 - 04.2020

Site Reliability Engineer

Apple
07.2019 - 12.2019

DevOps Engineer/Big Data Admin

Verizon Media(Verizon yahoo)
05.2017 - 07.2019

Master's In Information Technology And Management -

Campbellsville University