Architected and deployed Prometheus and Grafana with sub-second (1s) metric scraping to deliver real-time visibility into application and infrastructure health.
Enabled proactive incident response that sustained 99.9% application uptime, preventing customer impact and safeguarding revenue.
Designed and deployed AWS infrastructure using CloudFormation (EKS, EC2, CodePipeline) to enable scalable and automated application deployments.
Implemented AWS security best practices across stacks, ensuring compliance and safeguarding critical workloads.
Architected and developed Python automation with AWS Lambda to trigger Grafana alerts for EC2 “insufficient data” states, ensuring timely issue detection.
Built automated alert cleanup workflows to remove unused alarms, ebs, reducing and saving operational costs.
Provided production support Performed root cause analysis (RCA) on recurring issues, driving permanent fixes and improving overall system reliability.
Managed end-to-end application deployments on Kubernetes using Helm charts, streamlining release cycles and simplifying configuration management.
Architected centralized logging with Grafana + Loki,Integrated AWS Q/Bedrock–backed LLM analysis via Lambda to summarize logs and suggest RCAs, cutting MTTR and reducing repeat incidents through automated insights.
Site Reliability Engineer
Idelic
Dallas
01.2022 - 08.2022
Engineered highly available AWS infrastructure (EC2, ELB, Auto Scaling, IAM, Networking) with Terraform and CloudFormation, reducing manual setup time by 70% and preventing potential downtime costs.
Implemented CloudWatch and Datadog monitoring with automated alerts, sustaining 99.9% application uptime and avoiding customer-impacting incidents that could result in revenue loss.
Automated application deployments using Docker, ECS, Kubernetes, and Helm via Jenkins pipelines, accelerating release cycles by 50% and enabling faster time-to-market for revenue-generating features.
Developed Python, Bash, and PowerShell scripts for infrastructure automation, cutting operational costs and increasing team productivity, delivering measurable business efficiency.
DevOps Engineer
PayPal
San Jose
05.2021 - 01.2022
migration of 300+ legacy Puppet modules and Packer/Terraform code from AWS to GCP, modernizing infrastructure and reducing cloud operational costs by 25%.
Managed GCP services including Compute Engine, App Engine, Cloud Storage, Load Balancers, BigQuery, and Firewalls, ensuring high availability and secure multi-environment deployments.
Enhanced CI/CD pipelines by updating testing suites and automating deployments with Puppet, Ansible, Ruby, and Bash scripts, accelerating release cycles by 40%.
Streamlined infrastructure management across AWS and GCP through automation, reducing manual intervention, improving system reliability, and preventing potential downtime costs.
AWS System Administrator
SYNERGY
Dallas
07.2020 - 05.2021
Managed and maintained cloud infrastructure on AWS, provisioning EC2 instances, VPCs, subnets, and security groups to ensure high availability and secure operations.
Configured and monitored relational databases (RDS/MySQL/PostgreSQL), performing backups, tuning, and disaster recovery to maintain 99.5% database uptime.
Implemented automation scripts using Bash and Python to streamline routine system administration tasks, reducing manual effort by 50%.
Monitored system performance and resolved incidents proactively, improving overall reliability and minimizing downtime impact on business operations 70%.
Education
Master of Science - Information Technology
University of North Texas
Denton, TX
05-2021
Bachelor in Computer Science - Computer Science Engineering
Gitam University
India
05-2019
Skills
Kubernetes management
AWS infrastructure design
Python programming
System administration
Software design and development
Containerization technologies
Version control systems
Security best practices
CI/CD automation
Monitoring techniques
Linux
Network fundamentals
Aws Q, sagemaker, bedrock
Certification
prometheus certificate
Kubernetes And Cloud Native Associate.
Timeline
Aws Devops Engineer
Boost Mobile
09.2022 - Current
Site Reliability Engineer
Idelic
01.2022 - 08.2022
DevOps Engineer
PayPal
05.2021 - 01.2022
AWS System Administrator
SYNERGY
07.2020 - 05.2021
Master of Science - Information Technology
University of North Texas
Bachelor in Computer Science - Computer Science Engineering