AWS DevOps Site Reliability Engineer with hands-on experience in designing and managing scalable infrastructure on AWS and GCP. Proficient in deploying and managing containerized applications using Kubernetes (EKS), creating and optimizing CI/CD pipelines, and automating infrastructure with Terraform and CloudFormation. Strong background in Site Reliability Engineering (SRE) practices including incident response, root cause analysis, and service monitoring. Experienced in setting up observability platforms using Prometheus, Datadog, CloudWatch, and Grafana for end-to-end infrastructure and application monitoring. Adept at implementing agile DevOps practices to increase deployment speed, system resilience, and operational efficiency.
Academic Project Achievement
Operational Excellence
Cross-Team Impact