Dynamic and results-driven Site Reliability Engineer with over 9 years of IT experience, specializing in Kubernetes and Infrastructure as Code (IaC). Proven track record of building and leading high-performing SRE teams for more than 5 years, driving reliability initiatives, managing incident responses, and enhancing CI/CD pipelines. Expertise in embedding operational best practices across engineering teams while fostering cross-functional collaboration and mentoring engineers. Committed to implementing Service Level Objectives (SLOs) and Service Level Agreements (SLAs) to ensure optimal system performance and uptime.
Overview
12
12
years of professional experience
1
1
Certification
Work History
Senior Cloud Engineer - Team Lead
Techsur Solutions
08.2023 - Current
Migrated on-premises infrastructure to AWS cloud environment, resulting in increased scalability and reduced operational costs by 40%.
Led the migration of legacy applications to Kubernetes on AWS EKS, resulting in improved performance and cost savings.
Developed and maintained infrastructure as code using Terraform, enabling consistent and repeatable deployments across multiple environments.
Provisioned highly scalable infrastructure on AWS using CDK by reducing the infrastructure provisioning time by 50%.
Led the implementation of AWS CDK project, leveraging infrastructure-as-code principles to automate the provisioning and management of AWS resources.
Created deployment configuration files for Openshift kubernetes containers.
Leveraged AWS CDK's programming flexibility (Typescript) to customize and extend the functionality of CDK constructs.
Automated monitoring and alerting processes using CloudWatch, enabling proactive identification and resolution of performance issues.
Designed and implemented disaster recovery solutions using AWS services such as AWS Backup, ensuring business continuity in case of system failures.
Architected a Zero trust network architecture leveraging AWS Transit Gateway, centralizing network connectivity and enforcing consistent security policies across multiples VPC's.
Led a team of engineers in the successful implementation of a serverless architecture using AWS Lambda and API Gateway, resulting in improved application performance and reduced maintenance efforts.
Led a cross-functional team in the design and implementation of cloud-based solutions, enhancing operational efficiency and scalability.
Developed and maintained automated deployment pipelines using CI/CD practices, significantly reducing release times and improving system reliability.
Spearheaded the migration of legacy systems to cloud infrastructure, optimizing resource utilization and reducing operational costs.
Mentored junior engineers on cloud architecture best practices, fostering professional growth and enhancing team productivity.
Collaborated with stakeholders to define cloud strategy, aligning technical solutions with business objectives to drive innovation.
Developed automated deployment processes for microservices using Kubernetes and Helm, improving deployment frequency by 70%.
Integrated the image scanning feature of ECR into CD pipeline stage to perform vulnerability assessments on container images, identifying known vulnerabilities and security risks.
Senior Site Reliability Engineer - SRE Lead
Latch
10.2022 - 08.2023
Leading the effort for AWS cost savings. Investigating the usage of RDS, ECS, and Elasticache clusters and re-sizing them appropriately resulting in 15% of cost savings in the monthly AWS spend.
Took the ownership of the re-designing CI/CD platform for the organization. Analyzed the existing CI/CD platform, documented the challenges and also the improvements/expectations that are needed from the new CI/CD solution.
Evaluating potential new CI/CD platform which meets all the success criteria defined and worked on setting up pilot project around GitHub Actions and Harness CI/CD.
Designed and implemented logging and monitoring solution with ELK stack, providing real time visibility into system performance and application logs.
Designed solution to publish application logs from pods running on EKS to central logging systems like AWS OpenSearch.
Taking data driven decisions to improve resiliency and service quality of our existing CI/CD pipelines.
Lead the migration of applications from on-prem to Openshift and AWS EKS containers.
Empowering and evangelizing software engineers across various teams to leverage Infrastructure as Code (Terraform) to deploy the infrastructure into AWS.
Mentored junior engineers in SRE best practices, fostering a culture of continuous improvement and technical excellence within the team.
Championed the adoption of Agile methodologies, improving project delivery timelines and team collaboration across multi-disciplinary projects.
Cloud Service Engineer
Securonix
06.2022 - 10.2022
Partnered with infrastructure teams on evaluation and feasibility assessments of new systems and technologies
Provided technical leadership and delivered innovative products and services to address customer specific requirements
Understood client needs and objectives by conducting proactive customer and data analysis
Implemented a monitoring and alerting system that reduced response times to critical incidents by 50%
Automated the deployment and configuration of infrastructure, reducing deployment time by 80%
Designed and implemented a system for log aggregation and analysis, reducing time to resolution for issues by 60%
Collaborated with product managers to set and maintain Service level Objectives (SLOs) and metrics representative of the customer experience
This resulted in a 10% improvement in customer satisfaction.
Cloud Operations Engineer
Clarivate Analytics
10.2017 - 05.2022
Participated in design and architect of 7-layer VPC architecture on AWS for customers
Automated whole 7-layer VPC provision and validation with a single click by integrating with Jenkins
Lead the migration of logging infrastructure from Graylog to Elasticsearch
Leveraged Kibana for visualizations
Designed IAM Roles and Policies to restrict access to various AWS resources based on the user access level and organizations security requirements
Configured AWS cloud infra as code using terraform and continuous deployment through Jenkins
Integrating third-party applications with Azure Active Directory and provisioning SSO login
Provisioning AWS Workspaces from custom images and bundles
Tested and configured AWS Workspaces (Windows virtual desktop solution) for custom application requirement
Working with VPC i.e IPv4, IPv6, Route tables, NAT instances, Internet Gateway (IGW) and Virtual private gateway (VGW)
Provided technical leadership and delivered innovative products and services to address customer specific requirements
Worked with cloud architect to generate assessments and develop and implement actionable recommendations based on results and reviews
Partnered with infrastructure teams on evaluation and feasibility assessments of new systems and technologies
Used Datadog metrics to monitor application and infrastructure performance
Automated IAM roles and policies using terraform
Maintained JIRA for tracking and updating project defects and tasks.
Cloud Engineer
Florida International University
09.2015 - 12.2016
Experience and good knowledge in AWS (Amazon Web Services) services like EC2, S3, Glacier, Elastic Load Balancer (ELB), RDS, SNS, SWF, Cloud watch, Route53 and Lambda
Building CI/CD pipelines for build and release of various java/.net based applications and automating maven/ant builds through Jenkins CI pipeline
Created and designed whole application CI/CD pipeline from code to build, deploy, test application and infrastructure in a pipeline
Maintaining infra as code in version control (GIT) and running infra validation in Jenkins CI/CD pipeline using Server spec for each deployment
Managed Ubuntu Linux and Windows virtual servers on AWS EC2 using Puppet
Launching Amazon EC2 Cloud Instances using Amazon Images (Linux/Windows) and Configuring launched instances with respect to specific applications
Configured Elastic Load Balancers with Elastic Compute Cloud Auto scaling groups
Used ELB and Auto scaling for load balancing and scaling EC2 instances up/down based on Network Traffic
Worked on cloud watch to monitor resources such as EC2 CPU memory, Amazon to design high availability applications on AWS across availability zones
Knowledge on design applications on AWS taking advantage of disaster recovery
Configured S3 versioning and lifecycle policies to and backup files and archive files in Glacier
Migrated media (images and videos) to s3 and used CloudFront to distribute content with low latency and at high data transfer rates
Used AWS CLI to automate backups of ephemeral data-stores to S3 buckets, EBS
Configured AWS Identity and Access Management (IAM) Groups and Users for improved login authentication
Created AWS Multi-Factor Authentication (MFA) for instance RDP/SSH logon, worked, with teams to lock down security groups
Installing, upgrading and configuring operating systems like Solaris, HP-UX and Linux
Supported various databases like Oracle, UDB and SQL Server on Unix/Linux servers
Experience in configuring and administering FTP, SAMBA, DNS and DHCP
Providing day-to-day user administration like adding, deleting, and modifying users and groups and solving their queries
Hands on experience in Amazon Web Services (AWS) provisioning and good knowledge of AWS services like EC2, S3, Elastic Beanstalk, ELB (Load Balancers), RDS, VPC, Direct Connect, Route53, Cloud Watch, Cloud Formation, IAM, SNS etc
Defined AWS Security Groups which acted as virtual firewalls that controlled the traffic allowed reaching one or more AWS EC2 instances
Launching and configuring of Amazon EC2 (AWS) Cloud Servers using AMI's (Linux/Ubuntu) and configuring the servers for specified applications
Experience in creating alarms and notifications for EC2 instances using Cloud Watch
Installed applications on AWS EC2 instances and configured storage on S3 buckets
Configured AWS Identity and Access Management (IAM) Groups and Users for improved login authentication
Implemented and maintained monitoring and alerting of production and corporate servers such as EC2 and storage such as S3 buckets using AWS Cloud Watch
Creating S3 buckets and managing policies for S3 buckets and Utilized S3 bucket and backup on AWS
Performed database SQL queries to address connectivity and integration activities
Setting up scalability for application servers using command line interface for Setting up and administering DNS system in AWS using Route53
Worked with all areas of Development teams to ensure the build and deployment process serves better quality for business
Coordinate/assist developers with establishing and applying appropriate branching, labeling/naming conventions using Git
Coordinate build schedules between development teams, Database Administrators and Network Operations while developing and improving build communication channels
To design high availability applications on AWS across availability zones and availability regions.
Education
Master of Science - Management Information Systems
Florida International University
Miami, FL
05.2016
Bachelor of Science - Computer Science
Karunya University
India
05.2014
Skills
Cloud infrastructure design
Access control management
Disaster recovery planning
Infrastructure as Code (Terraform)
Infrastructure Monitoring
CI/CD
Team Leadership
Cost Optimization
Certification
AWS Certified Solutions Architect
Timeline
Senior Cloud Engineer - Team Lead
Techsur Solutions
08.2023 - Current
Senior Site Reliability Engineer - SRE Lead
Latch
10.2022 - 08.2023
Cloud Service Engineer
Securonix
06.2022 - 10.2022
Cloud Operations Engineer
Clarivate Analytics
10.2017 - 05.2022
Cloud Engineer
Florida International University
09.2015 - 12.2016
System Engineer
Infosys Ltd
01.2014 - 08.2015
Bachelor of Science - Computer Science
Karunya University
Master of Science - Management Information Systems
Sr Software Engineer at P Square Toll Solutions India Pvt Ltd / Seeroo IT Solutions (P Square Solutions LLC – Contractor)Sr Software Engineer at P Square Toll Solutions India Pvt Ltd / Seeroo IT Solutions (P Square Solutions LLC – Contractor)
Cashier Team Member / Customer Service & E-Commerce Team Member at Whole Foods MarketCashier Team Member / Customer Service & E-Commerce Team Member at Whole Foods Market