DevOps & Cloud Engineer | 8+ years driving infrastructure scalability, reliability, and automation. Led 250+ server migrations with 100% uptime, designed monitoring systems (Splunk, Grafana, Nagios) for 99.99% availability, and slashed MTTR by 75% via SLO/SLI frameworks. Expert in cloud migrations, Linux optimization, IaC (Terraform/Ansible), and CI/CD pipelines enhanced by automated testing (e.g., Sorry Cypress). Passionate about building resilient, scalable systems that accelerate delivery and minimize downtime.
● Designed and implemented automated patching solutions using Red Hat Satellite Server, ensuring 100% compliance with security updates, and reduced manual interventions by 80%.
Configured liveness probes to perform health checks of applications, and readiness probes to prevent the service from routing traffic to unhealthy containers.
● Established metrics and carefully monitored the health of AWS resources, the performance of the application, and SSL certificate expiration on a wide scale by making use of New Relic.
● Deployed applications in the Kubernetes cluster, making use of Rolling Update and Blue-Green deployment strategies. ● Used Kubernetes to orchestrate the deployment of containers across multiple nodes.
● Configured EBS snapshot automation to back up Jenkins data disk, using AWS EBS Lifecycle Manager.
● Successfully created and maintained automated CI/CD pipelines for code deployment using Jenkins.
● Participated in on-premises to AWS cloud migration, achieving zero downtime and improving system scalability.
● Performed troubleshooting of continuous integration systems, including build and deployment.
● Developed Terraform templates to provision AWS infrastructures in the cloud.
● Optimized SLOs/SLIs, reducing MTTR from 2 hours to 30 minutes, improving incident resolution speed by 75%.
● Minimized failure by running smoke tests using Cypress, increasing deployment reliability by 40%.
•Led monthly patching.
•Configured and managed Red Hat Satellite Server for patch management and system
updates.
•Collaborated with cross-functional teams to plan and execute the migration with zero
downtime
•Used Kickstart/Template to build physical and virtual servers and deploy them to the
network
• Troubleshoot TCP/IP, DHCP, DNS, NFS, and Samba related issues Performed Users and
Security management
•Performed file system management using LVM, NAS and SAN tools
•Configured storage pools in RHEL 8 using the Virtual Data Optimizer (VDO)
•Offered various support to software developers
• Configured Apache HTTPD, NFS share, and Samba Share
•Configured and Managed virtual servers using VMware & KVM
• Performed security hardening on dedicated servers
VMware, KVM, Red Hat, Ubuntu, CentOS, Grafana, Splunk, Patch Management, Bitbucket, ServiceNow, PagerDuty, Nagios, Sorry Cypress, TCP/IP, DHCP, DNS, NFS, VDO, Red hat satellite, Nimble Storage, F5, Cloud flare, Zookeeper, Kafka, SMTP, FTP, SSH, SSL/TLS, NTP, SNMP, Active Directory, Jira, GitHub, Bash, Tomcat, Apache, Nginx, HTTPS, SAN, NAS