Summary
Overview
Work History
Education
Skills
Container Orchestration
Configuration Management And Automation
Monitoring And Observability
Ci Cd
Infrastructure As Code
Scripting And Automation
Databases And Datastores
Security Best Practices
Version Control Systems
Synthetic Monitoring
Network Understanding
Analytical Skills
Enterprise It Management Technologies
Cloud Platforms
Timeline
Generic

Sai Krishna

Bangalore,India

Summary

Overall, 10 years of experience as Cloud, DevOps, SRE and Platform Engineer. Having extensive experience in SCM, AWS, SDLC, CI/CD, Cloud Computing and Build/Release Management and Agile Methodologies. Competent Engineering professional offering foundation in engineering project management and design. History of success in performing load and cost calculations and establishing clear parameters. Detail-oriented with strong knowledge of SRE and DevOps.

Overview

12
12
years of professional experience

Work History

DevOps, Platform & SRE Engineer

Adobe Systems
03.2022 - Current
  • Currently working as SRE within Adobe Commerce(Magento) team at Adobe
  • Responsibilities include leading comprehensive SRE Dashboard creation, optimizing performance, resolving production incidents promptly, delivering SRE training sessions, designing strategies for scalable infrastructure growth, and spending time on proactive improvements to system reliability and resilience
  • Also experienced in troubleshooting pipeline deployments, Redis, Elasticsearch (Opensearch), MySQL, and identifying performance issues using tools like NewRelic, Grafana, and Fastly interfaces
  • Hands-on experience in Distributed tracing for web applications
  • Experience working in ELK stack, Prometheus, Thanos, Grafana, and alert manager with different Exporters
  • Parallelly working on incidents raised by customers around the globe through the ticketing tool Zendesk
  • Involved in Root cause analysis (RCA), troubleshooting of issues through NewRelic APM
  • Dealing with Deployments, dep-failures, Sendgrid, GraphQL API, Databases, Fastly, Redis, Elastic search, performance, MySQL, Bots/XSS/Ddos, crons, infra, and other Magento core issues.

Sr. Site Reliability Developer

Oracle
03.2020 - 03.2022
  • Worked on Node, Windows, Process, JMX, Nginx, Jetty, Oracle DB, Consul, and Kafka exporter for standalone, Docker, and Kubernetes Environment
  • Created Docker file, Kubernetes deployment, services, configmaps, and secrets
  • Also made pipeline changes for all above exporters
  • Worked on Prometheus configuration by adding multiple jobs to see metrics in Prometheus
  • Added multiple alerts in Thanos and tested and refined in dev
  • Created alertmanager configuration with multiple routes, features, and integrated with SLACK, SMTP, and PagerDuty
  • Implemented detailed and beautiful Grafana Dashboards for all exporters/metrics
  • Integrated Kibana watcher with PagerDuty and JIRA, and integrated slack V2 to complete the flow for the operations team
  • Created Kibana visualizations and dashboards via pipeline to make persistent by creating respective configmaps in MSP stack
  • End-to-end configuration and setup of PagerDuty for multiple projects
  • Worked with Cloud Reliability services team across the globe.

Cloud Devops Engineer

Mediakind
03.2016 - 03.2020
  • Worked with clients such as Telus, Vodafone Qatar, and Bell Canada
  • Responsibilities included creating new infrastructure according to full components based on slot and Geo redundancy, writing Docker files, deployments, configmaps, and secrets
  • Worked with AKS, Azure container registry with Azure DevOps, and made helm changes for K8S objects and DCS changes
  • Created deployment yaml files, helm changes for that service, and troubleshooting K8S pods
  • Involved in bug fixing and bug follow-up with US counterparts
  • Deployed Kubernetes container applications using Azure Kubernetes Service (AKS), ACS, Azure Active Directory, Azure Virtual Network, Azure Storage, and Azure Database for MySQL.

CloudOps Engineer

Indecomm Global Services
07.2013 - 03.2016
  • Worked with clients such as Ingenico
  • Responsibilities included designing and deploying a multitude of applications utilizing almost all of the AWS stack (including IAM, VPC, EC2, EBS, RDS, S3, Glacier, Lambda, ELB, Auto Scaling, Elastic Beanstalk, Route53, CloudFront, CloudWatch, CloudTrail, CloudFormation, SQS, and SNS) focusing on high availability, fault tolerance, and autoscaling in AWS cloud formation
  • Worked on creating server-less Microservices by integrating AWS Lambda, S3, Cloud watch, API Gateway, and deploying Elastic Beanstalk applications to various environments on AWS
  • Experienced in Terraform setup, managing hosts file, wrote Terraform modules to automate AWS services like Launching EC2, Provisioning IAM, Configuring VPC, EBS.

Middleware Developer

Tarento Technologies
02.2012 - 05.2013
  • Worked with clients such as 20:20 Mobile
  • Responsibilities included working on Enterprise Software Implementation and Integration experience that includes Architecture, Analysis, Design, and Development of Oracle SOA.

Education

Master of Science - Computers

Andhra University
Visakhapatnam
09.2008

Skills

  • Cloud Computing: AWS, Microsoft Azure, Google Cloud Platform, Openstack
  • Scripting and programming: Python, Bash
  • Web/Application Tools: Nginx, Web Logic, Apache Tomcat, Jetty, Apache2, PHP
  • Automation Tools: Jenkins, Python, Terraform, GraphQL
  • Networking: DNS, DHCP, TCP/IP, SMTP, LDAP
  • Build Tools: ANT, Maven, Msbuild, Postman
  • Configuration Tools: Ansible, Helm
  • Bug Tracking Tools: Service NOW, JIRA, Zendesk
  • Repository Manager Tools: Nexus, JFrog
  • Operating Systems: RHEL, CentOS, Ubuntu, Solaris 10, Windows 2012 R2
  • Databases: MySQL, Oracle, Kafka, PostgreSQL, Redis, Cassandra
  • Monitoring Tools: Nagios, Cloud Watch, Grafana, NewRelic, Prometheus, thanos ELK, PagerDuty, Rundeck, Stackstorm
  • Version control tools: Git, GitHub, SVN, Bitbucket, Gitlab
  • Virtualization/Container: Docker, Kubernetes
  • Engineering Documentation

Container Orchestration

Strong experience with container orchestration platforms such as Docker, Kubernetes, including deployment, scaling, and management of containerized applications.

Configuration Management And Automation

Proficient in configuration management tools such as Ansible with a strong emphasis on automation and infrastructure as code (IaC) practices.

Monitoring And Observability

Hands-on experience with monitoring and observability tools such as Prometheus, Grafana, ELK stack (Elasticsearch, Logstash, Kibana), and NewRelic for real-time system monitoring, logging, tracing, and alerting.

Ci Cd

Experience with CI/CD pipelines and tools such as Jenkins, GitLab CI/CD, including automated testing, deployment, and rollback strategies.

Infrastructure As Code

Proficiency in IaC tools such as CloudFormation for provisioning and managing infrastructure resources declaratively.

Scripting And Automation

Strong scripting skills in languages such as Python, Shell for automating repetitive tasks, managing configurations, and orchestrating deployments.

Databases And Datastores

Experience with relational databases (e.g., PostgreSQL, MySQL) and NoSQL databases (e.g., MongoDB, Cassandra), time series databases Including performance tuning, replication, and high availability configurations.

Security Best Practices

Familiarity with security best practices for cloud environments, including identity and access management (IAM), encryption, network security, and compliance standards such as PCI-DSS and GDPR.

Version Control Systems

Proficient in version control systems such as Git, TFS, ADO, including branching strategies, code reviews, and collaboration workflows.

Synthetic Monitoring

Extensive experience with synthetic monitoring tools such as New Relic Synthetics, monitoring application performance from external locations.

Network Understanding

Strong understanding of networking, distributed systems, microservices architecture, and other relevant architectural concepts.

Analytical Skills

Excellent problem-solving skills and the ability to troubleshoot complex issues in production environments.

Enterprise It Management Technologies

Extensive experience with the following Enterprise IT Management technologies: SLA Monitoring, Application Monitoring, End User Response Time Monitoring, Server Monitoring, Browser Monitoring, synthetic Monitoring.

Cloud Platforms

Advanced proficiency in one or more cloud platforms such as AWS, Azure, or Google Cloud Platform (GCP), including expertise in services such as EC2, S3, RDS, and VPC networking.

Timeline

DevOps, Platform & SRE Engineer

Adobe Systems
03.2022 - Current

Sr. Site Reliability Developer

Oracle
03.2020 - 03.2022

Cloud Devops Engineer

Mediakind
03.2016 - 03.2020

CloudOps Engineer

Indecomm Global Services
07.2013 - 03.2016

Middleware Developer

Tarento Technologies
02.2012 - 05.2013

Master of Science - Computers

Andhra University
Sai Krishna