

Tech Lead MLOps / DevOps Architect with over 10 years of experience building and operationalizing large-scale ML and AI infrastructure across cloud and hybrid environments. Certified Google Cloud Architect and Kubernetes expert skilled in designing end-to-end MLOps platforms covering model training, validation, deployment, monitoring, and governance.
Expert in Python-based automation, Kubernetes (GKE), container orchestration, and event-driven pipelines, integrating MLflow for model lifecycle management. Proven track record of building cost-optimized GPU clusters, CI/CD frameworks, and auto-remediation systems that drive operational excellence, reproducibility, and scalability.
Collaborative leader bridging data science, AI research, and engineering to ensure secure, compliant, and high-availability ML systems. Passionate about advancing automation, observability, and continuous improvement across the full AI delivery pipeline.
My role sits at the intersection of AI research, DevOps, and SRE. I actively drive AI/ML initiatives using MLflow, PyTorch, and TensorFlow frameworks for training, validating, and deploying production-grade models with strong governance and reproducibility. I’ve also contributed to building pretrained model workflows and documenting reliability frameworks aligned with SRE best practices for AI-driven systems.