
Site Reliability Engineer with 9+ years of experience in the Information Technology industry, specializing in cloud infrastructure, CI/CD automation, and production support. Extensive experience with Google Cloud Platform (GCP) and AWS, including infrastructure automation, IAM management, and Kubernetes-based deployments. Proven expertise in build and release management, GitOps workflows, and containerization using Docker. Skilled in implementing monitoring and observability solutions with Prometheus, Grafana, OpenTelemetry, Tempo, and Promtail to enable proactive issue detection and reduce MTTR. Adept at maintaining high system availability, optimizing deployment pipelines, and collaborating with cross-functional teams to deliver scalable, reliable solutions.