Innovative Cloud & Platform Engineering Leader with 18 years of experience in SRE, DevOps, and AI/ML Ops transformation for global enterprises. Expertise in multi-cloud architecture, CI/CD modernization, and Kubernetes platforms. Led global engineering teams to enhance reliability, automate operations, and deliver secure, high-availability platforms across AWS, Azure, and GCP.
Managed global SRE cloud operations and platform engineering, enhancing reliability and scalability in enterprise environments, resulting in improved system uptime and performance.
Designed and operationalized AI/ML platforms (Domino, DataRobot, Databricks, Dremio), enabling secure model workflows and automated pipelines.
Developed strategic direction for deploying microservices-driven SaaS platforms using Kubernetes, Helm, Docker, and cloud-native design patterns.
Led operations for ADLS Gen2, Hadoop, Hive, Snowflake, Waterline Data, and Privacera platforms, reinforcing data pipeline security and stability through streamlined processes and proactive monitoring.
Specialized in CI/CD pipeline modernization and end-to-end automation & toolchain integration.
Senior DevOps / Cloud Lead
Cognizant Technology Solutions
12.2015 - 03.2019
Engineered large-scale platforms across Azure, AWS, and GCP with expertise in automation and compliance oversight.
Directed comprehensive release governance and migration planning for critical systems with a commitment to 24/7 operational support.
Developed orchestration solutions for cloud migrations, integrating DevOps practices to streamline release processes.
Advanced reliability operations for high-availability services, ensuring robust support for mission-critical systems.
Mentored global SRE and DevOps teams, enhancing collaboration and technical skills across diverse projects.
Administered enterprise database environments including Cassandra, MongoDB, Oracle, and SQL Server to maintain system integrity.
Interacted with clients at conferences to align technology approaches with strategic business objectives.
DevOps / Build and Release Engineer
Patni Computer Systems (acquired by IGATE/Capgemini)
10.2011 - 12.2015
Optimized operational efficiency through management of cloud and data center resources, enhancing resource utilization.
Executed software builds and maintained project release schedules, ensuring timely delivery of software updates.
Reduced manual intervention through automation of deployment processes.
Streamlined operations by developing AMI creation and cloud deployment strategies.
Collaborated with development teams for smooth code integration.
Implemented automated Unix scripts to enhance system monitoring capabilities.
Ensured optimal system performance using New Relic for monitoring and proactive issue resolution.
Identified opportunities for improvement through comprehensive business process analysis.
Software Engineer / Build and Release Engineer
Patni Computer Systems (acquired by IGATE/Capgemini)
01.2008 - 10.2011
Automated deployment scripts to enhance efficiency of production migration processes.
Monitoring scripts implemented to track optimal functionality performance.
Client requirements evaluated to inform project design and analysis.
Developed customized applications incorporating user feedback to improve functionality.
Coordinated with onsite teams to facilitate project progress and resolve challenges effectively.
Ensured thorough debugging of accounts during backend updates.
Performed unit and integration testing throughout development phase.
Supported user acceptance testing while managing diverse environments.
Education
Bachelor of Technology (B.Tech) - Information Technology
Sathyabama University
Greater Chennai, India
04-2007
Skills
Cloud architecture
AWS
Azure
GCP
SRE Leadership
Reliability Engineering
DevOps strategy
CI/CD Modernization
MLOps Automation
End-to-end automation
Platform engineering
Kubernetes
Microservices
AI/ML management
Data Integration Tools
Automated ML Platforms
Databricks
SLIs/SLOs
Incident Governance
Cybersecurity compliance
Cloud Patching
Vulnerability Remediation
Terraform
Unix
Windows
Big Data Ecosystems
Hadoop
Hive
ADLS Gen2
Snowflake
Automation observability
Python
ELK
Grafana
Splunk
Maven
Gradle
GitHub
Bitbucket
AWS CodeCommit
JIRA
Pivotal Tracker
Cassandra
MongoDB
Oracle
SQL Server
Global Team Leadership
Coaching
Scaling Teams
Operating Models
Oracle
SQL Server
Certification
Microsoft Certified: Azure – Designing and Implementing DevOps Solutions (AZ-400)
Six Sigma Green Belt and Lean Certification
Awarded multiple times for leading automation efforts that improved efficiency and reduced costs.