Summary
Overview
Work History
Education
Skills
Additional Information
Timeline
Hi, I’m

Raju Sankula

Cloud Engineer
Atlanta,GA
Raju Sankula

Summary

Around 8 years of experience in Information Technology, I specialise in AWS and GCP Cloud DevOps, Infrastructure Engineering, Security, Site Reliability, and Software Configuration Management. My expertise lies in designing and architecting complex, scalable solutions on the AWS cloud platform, aligning with both business needs and industry best practices.

Overview

8
years of professional experience
1
year of post-secondary education

Work History

Verizon
Atlanta

Cloud SRE Engineer
02.2023 - Current

Job overview

  • Provided 24/7 on-call support for production environment led root cause analysis efforts during incidents
  • Designed and developed Terraform scripts to automate the provisioning and management of Kubernetes infrastructure across multi-cloud environments, including GCP and AWS
  • Utilised Go-Lang to extend Terraform functionalities, enabling more customisable and dynamic infrastructure setups
  • Worked on Google cloud platform (GCP) services like compute engine, cloud load balancing, cloud storage, cloud NOSQL
  • Stack driver monitoring and cloud deployment manager
  • Setup GCP Firewall rules to allow or deny traffic to and from the VM's instances based on specified configuration and used GCP cloud CDN (content delivery network) to deliver content from GCP cache locations drastically improving user experience and latency
  • Configured GCP GKE ISTIO to handle the Ingress and egress and Service Discovery for micro service PODS inter communication
  • Implemented RESTful APIs, micro services, and server less architectures, integrating seamlessly with front-end applications and third-party services
  • Worked on google cloud platform (GCP) services like compute engine, cloud load balancing, cloud storage, cloud SQL
  • Stack driver monitoring and cloud deployment manager
  • Migrating and Securing the GCP infrastructure using Private subnets, Security groups, NACL(VPC), WAF etc
  • And Experience in Migrating the Legacy application into GCP Platform
  • Created and maintained CI/CD pipelines using GitHub Actions, integrating self-hosted runners to handle complex build and deployment processes
  • Led the development of Kubernetes-based CI/CD solutions, incorporating GitOps principles to ensure consistency and security across deployments
  • Maintained and upgraded platform tools, ensuring consistent performance and adapting to evolving business needs
  • Implemented secure methodologies for managing secrets and credentials using HashiCorp Vault, significantly reducing security risks in cloud deployments
  • Collaborated with security teams to define and enforce security policies across cloud environments, ensuring compliance with industry standards
  • Managed Kubernetes workloads across multiple cloud providers (AWS, GCP) optimising resource utilisation and maintaining high availability
  • Developed custom integrations with tools like Slack and Jira, automating notifications and task management to improve operational efficiency
  • Migrated and managed critical Windows and Linux workloads onto AWS leveraging EC2, RDS, Route53, ELB and other AWS native services
  • Led migration of legacy Java monolith application to micro services hosted on Amazon ECS and AWS Lambda with Designed and implemented highly available Redshift cluster across multiple nodes and subnets
  • Provisioning, configuring, and managing AWS services such as EC2, S3, RDS, Lambda, etc
  • Using CloudWatch and other tools to monitor performance, optimise resource usage, and reduce costs
  • Configured Prometheus exporters to scrape metrics from applications, databases, and network devices
  • Set up alerting rules in Prometheus and Grafana to detect anomalies and performance issues in real-time
  • Developed and maintained Python scripts using Boto3 for automated provisioning, configuration, and management of AWS resources including EC2, S3, RDS, and Lambda, reducing manual intervention
  • Implemented infrastructure as code (IaC) using Terraform and Python to automate the deployment and scaling of cloud infrastructure, ensuring consistency and repeatability across environments
  • Created and managed Ansible play books and roles to automate server configuration, application deployment, and compliance enforcement, achieving a 50% reduction in deployment times
  • Configuring the AWS and GCP Firewall rules in Terraform script to allow or deny traffic to and from the VM's instances based on specified configuration and used GCP cloud CDN to deliver content from GCP cache locations drastically
  • Implementing security best practices, managing IAM roles and policies, and ensuring compliance with industry standards
  • Writing and managing Ansible play-books for automated configuration of servers and services
  • Writing and managing Terraform configurations to provision and manage AWS infrastructure
  • Managing Terraform state files and ensuring they are stored securely
  • Using version control systems (like Git) to manage Terraform code and collaborate with team members
  • Setting up, configuring, and managing Kubernetes clusters, ensuring high availability and scalability
  • Using Kubernetes to deploy, manage, and scale applications in containers
  • Implementing monitoring and logging solutions for Kubernetes clusters to ensure application performance and reliability
  • Responding to incidents, troubleshooting issues, and performing root cause analysis
  • Working closely with Go lang and development teams to ensure smooth deployment and operation of applications
  • Continuously monitoring and optimise system performance and reliability
  • Designed CI/CD pipelines, automating build, test, scan, and deploy phases leveraging Code Pipeline and supporting multiple code repositories and deploy using Jenkins, GCP Services
  • Worked on Google cloud platform (GCP) cloud by provisioning compute engine, cloud load balancing, cloud storage, cloud SQL, Stack driver monitoring components using the Terraform GCP Foundation module
  • Automated Infrastructure provisioning, configuration management using Terraform, Ansible, CloudFormation
  • Worked closely with the Network team on VPC setup, subnet topology, security groups, and firewall rules
  • Worked on Key Rotations and Renewals, Vulnerabilities and managed Kubernetes nodes on Google Cloud Platform using Google Kubernetes Engine (GKE)
  • Configured, monitored and automated Google Cloud Services and involved in deploying the content cloud platform using Google Compute Engine, storage buckets
  • Created custom log metrics, monitoring and created charts, alerts using Stack driver in GCP
  • Integrated Cloud Armour with Stack driver Logging and Monitoring for comprehensive visibility into security events and alerting using Splunk other integration points
  • Created backup automation scripts in Python to backup critical databases to DB backups, infrastructure snapshots, log rotations, and instance refreshing.

R1RCM

Cloud | DevOps Engineer
03.2020 - 04.2022

Job overview

  • Spearheaded the creation of Infrastructure as Code (IaC) for Kubernetes environments using Terraform, enabling rapid deployment and scalability of healthcare applications
  • Developed Go-Lang based tools to streamline Kubernetes operations, including custom controllers and operators
  • Architected and implemented CI/CD pipelines with GitHub Actions, incorporating self-hosted runners to optimize build and deployment processes specific to healthcare applications
  • Integrated security best practices within CI/CD pipelines using HashiCorp Vault, ensuring sensitive data was securely handled during the build and deployment processes
  • Deployed and manage Kubernetes clusters across AWS, GCP, and Azure, ensuring consistent performance and security across all cloud platforms
  • Utilized GitOps workflows to manage Kubernetes deployments, enhancing the traceability and reliability of infrastructure changes
  • Leveraged HashiCorp Vault to securely manage and rotate credentials, minimizing the risk of security breaches in healthcare cloud environments
  • Configured AWS Multi Factor Authentication in IAM to implement 2 step authentication of user's access using Google Authenticator and AWS Virtual MFA
  • Implemented automated testing and validation processes for Terraform code using tools like Terra-test or kitchen-terraform, guaranteeing the reliability of infrastructure changes
  • Created detailed documentation and conducted training sessions to empower team members in effectively using Ansible for automation and configuration management
  • Implemented tagging strategies and standards through CloudFormation templates for asset management
  • Experience in Google cloud platform (GCP) cloud by provisioning compute engine, cloud load balancing, cloud storage, cloud SQL, Stack driver monitoring components using the Terraform GCP Foundation module
  • Specialized in provisioning the GCP Global HTTPS load balancer routes the traffic to GCP GKE cluster via Terraform modules
  • Experience in configuring the GCP Firewall rules in Terraform script to allow or deny traffic to and from the VM's instances based on specified configuration and used GCP cloud CDN (content delivery network) to deliver content from GCP cache locations drastically
  • Configured, monitored and automated Google Cloud Services and involved in deploying the content cloud platform using Google compute engine, storage buckets
  • Created custom log metrics, monitoring and created charts, alerts using Stack driver in GCP
  • Extensive knowledge of using Data dog to track and analyze infrastructure data, application health, and system performance across azure environments
  • Performed analytics on CloudFormation stack usage, costs and performance to identify optimization opportunities
  • Developed and Automated the Tests to validate the correctness of the Servers Configuration
  • Expressed in YAML Code and Used GIT to update the playbooks to the GIT repository
  • Design AWS Cloud Formation templates to create custom sized VPC, subnets, NAT to ensure successful deployment of Web applications and database templates
  • Worked on creating Kubernetes cluster using KUBECTL
  • Worked on creating remote builds for Namespace using Terraform
  • Automating the Build Infrastructure for deploying services in a dockside environment using Jenkins, SonarQube,Maven, Groovy, Docker and Splunk.

Cognizant

Cloud | DevOps Engineer
04.2018 - 02.2020

Job overview

  • Worked on AWS Services on EC2
  • VPC, ELB, S3, EBS, Elastic Beanstalk, AMI, SNS, RDS and maintained the user accounts IAM, RDS, Route 53, VPC, RDB, Dynamo DB, SES, SQS and SNS services in AWS cloud
  • Implemented custom monitoring solutions using Python to query AWS CloudWatch metrics and set up alerts for critical events, ensuring proactive detection and resolution of issues
  • Developed Python-based tools for managing Kubernetes clusters, including automated deployment, scaling, and monitoring of containerized applications
  • Led the design and implementation of AWS cloud architectures for various clients, aligning solutions with their business objectives and technical requirements
  • Implemented a CI/CD pipeline with Docker, Jenkins, GitHub and AWS AMI's, whenever a new GitHub branch gets started, Jenkins, our Continuous Integration server, automatically attempts to build a new Docker container from it
  • Implemented CI/CD pipelines in Jenkins and GitHub Actions to automate build, test, and deploy of Java and Python applications
  • Reduced deployment time from hours to minutes
  • Designed and deployed EKS clusters, managing nodes using Auto Scaling Groups
  • Ensured high availability of critical workloads
  • Wrote Infrastructure as Code templates in Terraform to provision AWS resources like EC2, VPC, RDS, and Elastic Cache
  • Containerized monolithic apps using Docker and orchestrated containers using ECS and Kubernetes
  • Realized improved resource utilization
  • Set up end-to-end monitoring using Prometheus,, CloudWatch, and Sentry
  • Improved incident response times
  • Led migration of legacy systems to cloud-native technologies, reducing costs and improving scalability
  • Leading migration of AWS infrastructure from CloudFormation to Terraform to enable full lifecycle management and multi-region deployment capabilities
  • Utilized Terraform to provision core AWS services like EC2, RDS, ELB, Route53 as code for the company's high traffic e-commerce platform
  • Implemented secure secret management in Terraform using AWS Secrets Manager to store sensitive data like database passwords
  • Configured, Maintained, and customized Ticketing tool & Monitoring tool such as JIRA and Confluence as per requirements to promote team coordination and ticketing system.

Infinx Healthcare

Linux| DevOps Engineer
08.2016 - 03.2018

Job overview

  • Responsible for Day-to-day Build and deployments in Dev, Test, pre-production and production environments
  • Implemented AWS high availability using AWS Elastic Load Balancing, which performed balance across instances in multiple availability zones
  • Created Python scripts to extract, transform, and load (ETL) data from various sources, generating insightful reports and dashboards for stakeholders, aiding in informed decision-making
  • Created AWS Multi-Factor Authentication (MFA) for instance RDP/SSH logon, worked with teams to lockdown security groups
  • Upgraded infrastructure provisioning from CloudFormation to Terraform for the company's flagship mobile app with 10+ million users
  • Implemented infrastructure compliance scanning and remediation workflows using tools like AWS Config and Security Hub
  • Setup CI/CD pipeline for ECS deployments utilizing Code Pipeline, ECR, and Code Deploy enabling faster release cycles
  • Developed reusable Terraform configurations for Lambda functions following standards and promoting collaboration between teams
  • Reduced infrastructure costs by 30% through EC2 reserved instance purchase optimization and instance scheduling using AWS tools
  • Maintained 10 Linux Servers which provided security and services for other Windows machines on the network such as: DNS, NAT, Anti-Virus/Spam, and Firewall via iptables
  • Managed backups for Windows, Linux, PACS, and File Servers to EMC Clarion SAN device
  • Experience with package management tools such as Yum and RPM to install, update, and manage software packages on CentOS 7 systems
  • With kernel tuning and system performance optimization to ensure maximum system performance
  • Features such as SE Linux, firewalls, and encryption, and experience implementing security best practices
  • Expertise in shell scripting and automation tools such as Ansible to streamline system administration tasks
  • Experience with upgrading ESX 3.0 to ESX 3.5 also using VMware update manager to install patches and updates on ESX host and virtual machines
  • Creating and Managing Virtual Machines in Windows Azure and setting up communication with the help of Endpoints and also VM Migrations from Transitional hosts on Canada Boxes
  • Updating the instance counts in the production for the services of Red box in the Production Azure Subscription
  • Good experience in Middleware Infrastructure services & Good experience in installing, configuring and managing Apache, Tomcat, WebLogic
  • Knowledge of EMC2 SAN, NAS Installation, Management, Load balancing, Clustering and RAID 0, 1, 5, 10 level configurations
  • Managed backups for Windows, Linux, PACS, and File Servers to EMC Clarion SAN device
  • Designed Network Security Groups (NSGs) & Load Balancer to control inbound and outbound access to network and subnets
  • Proficiently implemented Data-dog for monitoring and observability, setting up agents and integrations across the entire infrastructure stack
  • Integrated Ansible into CI/CD pipelines, automating application deployments and infrastructure changes as part of a streamlined development process.

Education

Trine University
Angola, IN

Master’s from information system
2022.05 - 2023.07 (1 education.year_Label & 2 education.months_Label)

Anurag University

Bachelors from Electronic and Communication

Certified cloud : GCP Certification. Certified AZURE: Azure infrastructure certification

Skills

  • Technical Skills
  • Platform and cloud Environment:
  • Cloud Providers:
  • AWS, Azure, GCP
  • Containerization & Orchestration: Docker, Kubernetes (EKS, ECS, AKS, GKS, RKE, K3s)
  • Operating Systems: RHEL, SUSE, Android, iOS
  • Automation & DevOps Tools:
  • Automation Frameworks: Ansible Automation Platform, AWX, Chef (Cookbooks, DSL), Puppet, Terraform
  • CI/CD & Monitoring: Jenkins Pipeline, GitHub Actions, GitLab, Bitbucket, Dynatrace, ArgoCD
  • Version Control & Repositories: Git, Artifactory, Vagrant, Fleet, HashiCorp Vault
  • Project Management & Collaboration: Jira, Confluence, VersionOne, HP Service Manager
  • Programming & Scripting Languages: Ruby, Python, Bash, PowerShell, GO, HCL
  • Markup & Configuration: YAML, JSON
  • Software & Tools:
  • Management & Infrastructure Tools: Red Hat Satellite, Rancher Management Server, Web Servers & Proxies: NGINX, Squid Proxy and APIs & Gateways: APIM, Front Door
  • Hardware & Networking:
  • Networking Protocols:
  • TCP/IP, UDP, DNS, WAN, VPN, TLS, mTLS
  • Security & Access: Firewalls, SSH, Telnet, RDP, WAF, VPN Gateway, VNET Peering, NIC, NIC Teaming
  • Networking Tools & Technologies:
  • PuTTY, Cisco IOS, Wireshark, Mainframe

Additional Information

  • Integrated Grafana, Prometheus, and APM tools with other monitoring and incident management systems. Created complex PromQL queries to extract meaningful insights from Prometheus metrics Experience in setup and build GCP infrastructure using resources Shared service VPC model, Compute engine, Cloud Storage, Cloud SQL, IAM using the Terraform foundation modules. Implemented a CI/CD pipeline using Azure DevOps (VSTS, TFS) in both cloud and on-premises with GIT, MS Build, Docker, Maven along with Jenkins's plugins. Hands on experience on Backup and restore Azure services and in Design and configure Azure Virtual Networks (subnets, Azure network settings, DHCP address blocks, DNS settings, security policies and routing. Azure cloud services, Blob storage, Active directory, Azure Service Bus, Cosmos DB. Expertise in Architecting and Implementing Azure Service Offering, such as Azure cloud services, storage, IIS, Azure Active Directory, Resource Manager (ARM), Storage, Blob Storage, VMs, SQL Database, Functions, Service Fabric, Azure Monitor, and Service Bus. Expertise in using various DevOps tools like Git, Jenkins, Ansible, Gradle, Artifactory, Docker, Kubernetes, Terraform and Skilled in cloud computing platforms like AWS, AZURE and GCP. Knowledge of Azure Site Recovery and Azure Backup Installed and Configured the Azure Backup agent and virtual machine backup, Enabled Azure Virtual machine backup from the Vault and configured the Azure Site Recovery (ASR). I have hands-on experience with a wide range of AWS services, including EC2, VPC, ELB, S3, RDS, Lambda, and CloudFormation, among others. My background includes transitioning infrastructure provisioning from CloudFormation to Terraform, optimising AWS infrastructure costs, and deploying and managing Kubernetes clusters on EKS using Terraform. I have led multiple initiatives focused on infrastructure-as-code, Go-Lang, CI/CD pipeline development, and the automation of cloud infrastructure using tools like Git, Jenkins, Ansible, Docker, and Kubernetes My technical skills are complemented by strong capabilities in scripting (Python,, Groovy) and a solid understanding of both Linux/Unix and Windows systems. I am well-versed in optimising and automating cloud operations, making me a strong candidate for an AWS Cloud Engineer role, where my ability to deliver secure, scalable, and efficient cloud solutions can contribute significantly to organisational success. Experience in configuring the Cloud Firewall rules to allow the inbound traffic to GCP GKE cluster. Deployed the Micro services with Spring Boot on GCP/GKE Kubernetes, Cluster configured with master nodes and worker nodes. Experience in managed Kubernetes services such as Google Kubernetes engine (GKE), Azure Kubernetes and Strong Configured Apigee Adapter and ISTIO mesh for our applications on GKE. Set up a GCP Firewall rules in order to allow or deny traffic to and from the VM's instances based on specified configuration and used GCP cloud CDN (content delivery network) to deliver content from GCP cache locations drastically improving user experience and latency..

Timeline

Cloud SRE Engineer

Verizon
02.2023 - Current

Trine University

Master’s from information system
2022.05 - 2023.07 (1 education.year_Label & 2 education.months_Label)

Cloud | DevOps Engineer

R1RCM
03.2020 - 04.2022

Cloud | DevOps Engineer

Cognizant
04.2018 - 02.2020

Linux| DevOps Engineer

Infinx Healthcare
08.2016 - 03.2018

Anurag University

Bachelors from Electronic and Communication

Certified cloud : GCP Certification. Certified AZURE: Azure infrastructure certification
Raju SankulaCloud Engineer