Around 8 years of experience in Information Technology, I specialise in AWS and GCP Cloud DevOps, Infrastructure Engineering, Security, Site Reliability, and Software Configuration Management. My expertise lies in designing and architecting complex, scalable solutions on the AWS cloud platform, aligning with both business needs and industry best practices.
Overview
8
years of professional experience
1
year of post-secondary education
Work History
Verizon
Atlanta
Cloud SRE Engineer
02.2023 - Current
Job overview
Provided 24/7 on-call support for production environment led root cause analysis efforts during incidents
Designed and developed Terraform scripts to automate the provisioning and management of Kubernetes infrastructure across multi-cloud environments, including GCP and AWS
Utilised Go-Lang to extend Terraform functionalities, enabling more customisable and dynamic infrastructure setups
Worked on Google cloud platform (GCP) services like compute engine, cloud load balancing, cloud storage, cloud NOSQL
Stack driver monitoring and cloud deployment manager
Setup GCP Firewall rules to allow or deny traffic to and from the VM's instances based on specified configuration and used GCP cloud CDN (content delivery network) to deliver content from GCP cache locations drastically improving user experience and latency
Configured GCP GKE ISTIO to handle the Ingress and egress and Service Discovery for micro service PODS inter communication
Implemented RESTful APIs, micro services, and server less architectures, integrating seamlessly with front-end applications and third-party services
Worked on google cloud platform (GCP) services like compute engine, cloud load balancing, cloud storage, cloud SQL
Stack driver monitoring and cloud deployment manager
Migrating and Securing the GCP infrastructure using Private subnets, Security groups, NACL(VPC), WAF etc
And Experience in Migrating the Legacy application into GCP Platform
Created and maintained CI/CD pipelines using GitHub Actions, integrating self-hosted runners to handle complex build and deployment processes
Led the development of Kubernetes-based CI/CD solutions, incorporating GitOps principles to ensure consistency and security across deployments
Maintained and upgraded platform tools, ensuring consistent performance and adapting to evolving business needs
Implemented secure methodologies for managing secrets and credentials using HashiCorp Vault, significantly reducing security risks in cloud deployments
Collaborated with security teams to define and enforce security policies across cloud environments, ensuring compliance with industry standards
Managed Kubernetes workloads across multiple cloud providers (AWS, GCP) optimising resource utilisation and maintaining high availability
Developed custom integrations with tools like Slack and Jira, automating notifications and task management to improve operational efficiency
Migrated and managed critical Windows and Linux workloads onto AWS leveraging EC2, RDS, Route53, ELB and other AWS native services
Led migration of legacy Java monolith application to micro services hosted on Amazon ECS and AWS Lambda with Designed and implemented highly available Redshift cluster across multiple nodes and subnets
Provisioning, configuring, and managing AWS services such as EC2, S3, RDS, Lambda, etc
Using CloudWatch and other tools to monitor performance, optimise resource usage, and reduce costs
Configured Prometheus exporters to scrape metrics from applications, databases, and network devices
Set up alerting rules in Prometheus and Grafana to detect anomalies and performance issues in real-time
Developed and maintained Python scripts using Boto3 for automated provisioning, configuration, and management of AWS resources including EC2, S3, RDS, and Lambda, reducing manual intervention
Implemented infrastructure as code (IaC) using Terraform and Python to automate the deployment and scaling of cloud infrastructure, ensuring consistency and repeatability across environments
Created and managed Ansible play books and roles to automate server configuration, application deployment, and compliance enforcement, achieving a 50% reduction in deployment times
Configuring the AWS and GCP Firewall rules in Terraform script to allow or deny traffic to and from the VM's instances based on specified configuration and used GCP cloud CDN to deliver content from GCP cache locations drastically
Implementing security best practices, managing IAM roles and policies, and ensuring compliance with industry standards
Writing and managing Ansible play-books for automated configuration of servers and services
Writing and managing Terraform configurations to provision and manage AWS infrastructure
Managing Terraform state files and ensuring they are stored securely
Using version control systems (like Git) to manage Terraform code and collaborate with team members
Setting up, configuring, and managing Kubernetes clusters, ensuring high availability and scalability
Using Kubernetes to deploy, manage, and scale applications in containers
Implementing monitoring and logging solutions for Kubernetes clusters to ensure application performance and reliability
Responding to incidents, troubleshooting issues, and performing root cause analysis
Working closely with Go lang and development teams to ensure smooth deployment and operation of applications
Continuously monitoring and optimise system performance and reliability
Designed CI/CD pipelines, automating build, test, scan, and deploy phases leveraging Code Pipeline and supporting multiple code repositories and deploy using Jenkins, GCP Services
Worked on Google cloud platform (GCP) cloud by provisioning compute engine, cloud load balancing, cloud storage, cloud SQL, Stack driver monitoring components using the Terraform GCP Foundation module
Automated Infrastructure provisioning, configuration management using Terraform, Ansible, CloudFormation
Worked closely with the Network team on VPC setup, subnet topology, security groups, and firewall rules
Worked on Key Rotations and Renewals, Vulnerabilities and managed Kubernetes nodes on Google Cloud Platform using Google Kubernetes Engine (GKE)
Configured, monitored and automated Google Cloud Services and involved in deploying the content cloud platform using Google Compute Engine, storage buckets
Created custom log metrics, monitoring and created charts, alerts using Stack driver in GCP
Integrated Cloud Armour with Stack driver Logging and Monitoring for comprehensive visibility into security events and alerting using Splunk other integration points
Created backup automation scripts in Python to backup critical databases to DB backups, infrastructure snapshots, log rotations, and instance refreshing.
R1RCM
Cloud | DevOps Engineer
03.2020 - 04.2022
Job overview
Spearheaded the creation of Infrastructure as Code (IaC) for Kubernetes environments using Terraform, enabling rapid deployment and scalability of healthcare applications
Developed Go-Lang based tools to streamline Kubernetes operations, including custom controllers and operators
Architected and implemented CI/CD pipelines with GitHub Actions, incorporating self-hosted runners to optimize build and deployment processes specific to healthcare applications
Integrated security best practices within CI/CD pipelines using HashiCorp Vault, ensuring sensitive data was securely handled during the build and deployment processes
Deployed and manage Kubernetes clusters across AWS, GCP, and Azure, ensuring consistent performance and security across all cloud platforms
Utilized GitOps workflows to manage Kubernetes deployments, enhancing the traceability and reliability of infrastructure changes
Leveraged HashiCorp Vault to securely manage and rotate credentials, minimizing the risk of security breaches in healthcare cloud environments
Configured AWS Multi Factor Authentication in IAM to implement 2 step authentication of user's access using Google Authenticator and AWS Virtual MFA
Implemented automated testing and validation processes for Terraform code using tools like Terra-test or kitchen-terraform, guaranteeing the reliability of infrastructure changes
Created detailed documentation and conducted training sessions to empower team members in effectively using Ansible for automation and configuration management
Implemented tagging strategies and standards through CloudFormation templates for asset management
Experience in Google cloud platform (GCP) cloud by provisioning compute engine, cloud load balancing, cloud storage, cloud SQL, Stack driver monitoring components using the Terraform GCP Foundation module
Specialized in provisioning the GCP Global HTTPS load balancer routes the traffic to GCP GKE cluster via Terraform modules
Experience in configuring the GCP Firewall rules in Terraform script to allow or deny traffic to and from the VM's instances based on specified configuration and used GCP cloud CDN (content delivery network) to deliver content from GCP cache locations drastically
Configured, monitored and automated Google Cloud Services and involved in deploying the content cloud platform using Google compute engine, storage buckets
Created custom log metrics, monitoring and created charts, alerts using Stack driver in GCP
Extensive knowledge of using Data dog to track and analyze infrastructure data, application health, and system performance across azure environments
Performed analytics on CloudFormation stack usage, costs and performance to identify optimization opportunities
Developed and Automated the Tests to validate the correctness of the Servers Configuration
Expressed in YAML Code and Used GIT to update the playbooks to the GIT repository
Design AWS Cloud Formation templates to create custom sized VPC, subnets, NAT to ensure successful deployment of Web applications and database templates
Worked on creating Kubernetes cluster using KUBECTL
Worked on creating remote builds for Namespace using Terraform
Automating the Build Infrastructure for deploying services in a dockside environment using Jenkins, SonarQube,Maven, Groovy, Docker and Splunk.
Cognizant
Cloud | DevOps Engineer
04.2018 - 02.2020
Job overview
Worked on AWS Services on EC2
VPC, ELB, S3, EBS, Elastic Beanstalk, AMI, SNS, RDS and maintained the user accounts IAM, RDS, Route 53, VPC, RDB, Dynamo DB, SES, SQS and SNS services in AWS cloud
Implemented custom monitoring solutions using Python to query AWS CloudWatch metrics and set up alerts for critical events, ensuring proactive detection and resolution of issues
Developed Python-based tools for managing Kubernetes clusters, including automated deployment, scaling, and monitoring of containerized applications
Led the design and implementation of AWS cloud architectures for various clients, aligning solutions with their business objectives and technical requirements
Implemented a CI/CD pipeline with Docker, Jenkins, GitHub and AWS AMI's, whenever a new GitHub branch gets started, Jenkins, our Continuous Integration server, automatically attempts to build a new Docker container from it
Implemented CI/CD pipelines in Jenkins and GitHub Actions to automate build, test, and deploy of Java and Python applications
Reduced deployment time from hours to minutes
Designed and deployed EKS clusters, managing nodes using Auto Scaling Groups
Ensured high availability of critical workloads
Wrote Infrastructure as Code templates in Terraform to provision AWS resources like EC2, VPC, RDS, and Elastic Cache
Containerized monolithic apps using Docker and orchestrated containers using ECS and Kubernetes
Realized improved resource utilization
Set up end-to-end monitoring using Prometheus,, CloudWatch, and Sentry
Improved incident response times
Led migration of legacy systems to cloud-native technologies, reducing costs and improving scalability
Leading migration of AWS infrastructure from CloudFormation to Terraform to enable full lifecycle management and multi-region deployment capabilities
Utilized Terraform to provision core AWS services like EC2, RDS, ELB, Route53 as code for the company's high traffic e-commerce platform
Implemented secure secret management in Terraform using AWS Secrets Manager to store sensitive data like database passwords
Configured, Maintained, and customized Ticketing tool & Monitoring tool such as JIRA and Confluence as per requirements to promote team coordination and ticketing system.
Infinx Healthcare
Linux| DevOps Engineer
08.2016 - 03.2018
Job overview
Responsible for Day-to-day Build and deployments in Dev, Test, pre-production and production environments
Implemented AWS high availability using AWS Elastic Load Balancing, which performed balance across instances in multiple availability zones
Created Python scripts to extract, transform, and load (ETL) data from various sources, generating insightful reports and dashboards for stakeholders, aiding in informed decision-making
Created AWS Multi-Factor Authentication (MFA) for instance RDP/SSH logon, worked with teams to lockdown security groups
Upgraded infrastructure provisioning from CloudFormation to Terraform for the company's flagship mobile app with 10+ million users
Implemented infrastructure compliance scanning and remediation workflows using tools like AWS Config and Security Hub
Setup CI/CD pipeline for ECS deployments utilizing Code Pipeline, ECR, and Code Deploy enabling faster release cycles
Developed reusable Terraform configurations for Lambda functions following standards and promoting collaboration between teams
Reduced infrastructure costs by 30% through EC2 reserved instance purchase optimization and instance scheduling using AWS tools
Maintained 10 Linux Servers which provided security and services for other Windows machines on the network such as: DNS, NAT, Anti-Virus/Spam, and Firewall via iptables
Managed backups for Windows, Linux, PACS, and File Servers to EMC Clarion SAN device
Experience with package management tools such as Yum and RPM to install, update, and manage software packages on CentOS 7 systems
With kernel tuning and system performance optimization to ensure maximum system performance
Features such as SE Linux, firewalls, and encryption, and experience implementing security best practices
Expertise in shell scripting and automation tools such as Ansible to streamline system administration tasks
Experience with upgrading ESX 3.0 to ESX 3.5 also using VMware update manager to install patches and updates on ESX host and virtual machines
Creating and Managing Virtual Machines in Windows Azure and setting up communication with the help of Endpoints and also VM Migrations from Transitional hosts on Canada Boxes
Updating the instance counts in the production for the services of Red box in the Production Azure Subscription
Good experience in Middleware Infrastructure services & Good experience in installing, configuring and managing Apache, Tomcat, WebLogic
Knowledge of EMC2 SAN, NAS Installation, Management, Load balancing, Clustering and RAID 0, 1, 5, 10 level configurations
Managed backups for Windows, Linux, PACS, and File Servers to EMC Clarion SAN device
Designed Network Security Groups (NSGs) & Load Balancer to control inbound and outbound access to network and subnets
Proficiently implemented Data-dog for monitoring and observability, setting up agents and integrations across the entire infrastructure stack
Integrated Ansible into CI/CD pipelines, automating application deployments and infrastructure changes as part of a streamlined development process.
Management & Infrastructure Tools: Red Hat Satellite, Rancher Management Server, Web Servers & Proxies: NGINX, Squid Proxy and APIs & Gateways: APIM, Front Door
Integrated Grafana, Prometheus, and APM tools with other monitoring and incident management systems.
Created complex PromQL queries to extract meaningful insights from Prometheus metrics
Experience in setup and build GCP infrastructure using resources Shared service VPC model, Compute engine, Cloud Storage, Cloud SQL, IAM using the Terraform foundation modules.
Implemented a CI/CD pipeline using Azure DevOps (VSTS, TFS) in both cloud and on-premises with GIT, MS Build, Docker, Maven along with Jenkins's plugins.
Hands on experience on Backup and restore Azure services and in Design and configure Azure Virtual Networks (subnets, Azure network settings, DHCP address blocks, DNS settings, security policies and routing. Azure cloud services, Blob storage, Active directory, Azure Service Bus, Cosmos DB.
Expertise in Architecting and Implementing Azure Service Offering, such as Azure cloud services, storage, IIS, Azure Active Directory, Resource Manager (ARM), Storage, Blob Storage, VMs, SQL Database, Functions, Service Fabric, Azure Monitor, and Service Bus.
Expertise in using various DevOps tools like Git, Jenkins, Ansible, Gradle, Artifactory, Docker, Kubernetes, Terraform and Skilled in cloud computing platforms like AWS, AZURE and GCP.
Knowledge of Azure Site Recovery and Azure Backup Installed and Configured the Azure Backup agent and virtual machine backup, Enabled Azure Virtual machine backup from the Vault and configured the Azure Site Recovery (ASR).
I have hands-on experience with a wide range of AWS services, including EC2, VPC, ELB, S3, RDS, Lambda, and CloudFormation, among others. My background includes transitioning infrastructure provisioning from CloudFormation to Terraform, optimising AWS infrastructure costs, and deploying and managing Kubernetes clusters on EKS using Terraform.
I have led multiple initiatives focused on infrastructure-as-code, Go-Lang, CI/CD pipeline development, and the automation of cloud infrastructure using tools like Git, Jenkins, Ansible, Docker, and Kubernetes
My technical skills are complemented by strong capabilities in scripting (Python,, Groovy) and a solid understanding of both Linux/Unix and Windows systems.
I am well-versed in optimising and automating cloud operations, making me a strong candidate for an AWS Cloud Engineer role, where my ability to deliver secure, scalable, and efficient cloud solutions can contribute significantly to organisational success.
Experience in configuring the Cloud Firewall rules to allow the inbound traffic to GCP GKE cluster.
Deployed the Micro services with Spring Boot on GCP/GKE Kubernetes, Cluster configured with master nodes and worker nodes.
Experience in managed Kubernetes services such as Google Kubernetes engine (GKE), Azure Kubernetes and Strong Configured Apigee Adapter and ISTIO mesh for our applications on GKE.
Set up a GCP Firewall rules in order to allow or deny traffic to and from the VM's instances based on specified configuration and used GCP cloud CDN (content delivery network) to deliver content from GCP cache locations drastically improving user experience and latency..