Cloud & Platform Engineer with over 8 years of expertise in architecting and automating secure infrastructures across AWS, Azure, and GCP. Focused on multi-cloud Kubernetes platforms, integrating GitOps, service mesh, and observability while ensuring compliance and security. Achieved cost reductions exceeding $1M annually and enhanced operational efficiency through automation. Committed to building resilient platforms that drive business success.
• Engineered a multi-cloud container platform across AWS (EKS), Azure (AKS), and GCP (GKE), standardizing workloads on Kubernetes with secure private networking and integrated CI/CD, observability, and compliance.
• Led Kubernetes upgrades from v1.14 to v1.30 across 150+ clusters, performing detailed middleware compatibility checks to ensure Flux CD, Argo CD, Cilium, Kyverno, CSI drivers, and other tools remained fully operational.
• Automated cluster provisioning with Terraform and custom Python scripts, embedding end-to-end GitOps pipelines, service mesh, observability, and compliance — saving 5,000+ engineering hours annually.
• Identified and corrected node pool inefficiencies and VMSS SKU skews, achieving $100K monthly savings (~$1.2M annually) through strategic resource tuning and Kubecost analytics.
• Managed 500+ ServiceNow requests and resolved 100+ critical incidents, ensuring zero downtime for high-impact workloads and driving platform reliability.
• Enhanced security posture by migrating legacy applications to private networking, removing public endpoints, enforcing workload identity, and deploying Kyverno to uphold compliance standards.
• Implemented Velero backup for Kubernetes across AKS, EKS, and GKE, ensuring disaster recovery readiness and rapid cluster restoration capabilities.
• Worked on Azure-native services such as Function Apps for event-driven automation, Service Bus for decoupled messaging, and integrated App Insights for monitoring application health and performance.
⸻
📊 Notable Achievements
• $1M+ annual cost savings: Optimized multi-cloud infrastructure costs through strategic node pool rightsizing and automation.
• 5,000+ engineering hours saved: Delivered fully automated Kubernetes clusters with pre-integrated middleware, eliminating repetitive manual setups.
• Enterprise-grade security: Transitioned workloads to private networks, removed static secrets with workload identity, enforced vulnerability scanning (Wiz.io) & compliance (Kyverno).
• Six Sigma methodologies for cost, process, and infrastructure optimization