Summary
Overview
Work History
Education
Skills
Certification
Accomplishments
Timeline
Generic

Venkata Kolluri

Sr. Site Reliability Engineer
Atlanta,USA

Summary

Site Reliability Engineer with 9+ years of experience in the Information Technology industry, specializing in cloud infrastructure, CI/CD automation, and production support. Extensive experience with Google Cloud Platform (GCP) and AWS, including infrastructure automation, IAM management, and Kubernetes-based deployments. Proven expertise in build and release management, GitOps workflows, and containerization using Docker. Skilled in implementing monitoring and observability solutions with Prometheus, Grafana, OpenTelemetry, Tempo, and Promtail to enable proactive issue detection and reduce MTTR. Adept at maintaining high system availability, optimizing deployment pipelines, and collaborating with cross-functional teams to deliver scalable, reliable solutions.

Overview

10
10
years of professional experience
1
1
Certification

Work History

Site Reliability Engineer

Equifax
Atlanta, GA
11.2019 - Current
  • Architected, deployed, and managed scalable cloud infrastructure on Google Cloud Platform (GCP), leveraging GKE, Compute Engine, IAM, VPC, BigQuery, Load Balancers, Secrets, ConfigMaps, and Monitoring services.
  • Provisioned and managed Kubernetes clusters using Terraform, creating reusable common modules to support standardized multi-environment deployments (Dev, Int, UAT, Prod).
  • Developed and integrated bootstrap automation scripts (ArgoCD, Anthos Service Mesh, Twistlock) into Terraform modules for post-cluster configuration.
  • Automated infrastructure provisioning and application deployments using Jenkins CI/CD pipelines, ensuring consistent and reliable cloud deployments.
  • Implemented GitOps workflows using Anthos Config Management, enforcing Kubernetes desired state, and automating virtual service deployments.
  • Designed and maintained FedRAMP-compliant infrastructure, ensuring security, governance, and operational stability for regulated environments.
  • Orchestrated deployment, scaling, and lifecycle management of containerized workloads using Kubernetes and Docker.
  • Configured NGINX Ingress Controllers, dynamic routing, and load balancing to support high-traffic, highly available applications.
  • Enhanced observability, scalability, and resilience for microservices and API gateways through implementation of rate limiting, autoscaling, caching, backoff strategies, and circuit breakers.
  • Built end-to-end telemetry pipelines using OpenTelemetry, Google Managed Prometheus, Grafana, Tempo, and Datadog, enabling metric, log, and trace correlation, and reducing MTTR.
  • Led the migration from open-source Prometheus to Google Managed Prometheus (GMP), and integrated Grafana dashboards for real-time metrics visualization.
  • Created and managed Grafana and Datadog dashboards using Terraform, tracking performance, SLOs, SLIs, and infrastructure health.
  • Deployed Datadog agents via Helm, enabled APM metrics, and visualized application performance across Kubernetes workloads.
  • Troubleshot P1 production incidents, including Kubernetes, networking, monitoring, and distributed systems issues across L4/L7 load balancing, DNS, TLS, and HTTP(S).
  • Debugged containerized Java and Golang applications running on GKE using Linux and TCP/IP tools such as tcpdump, curl, and OpenSSL.
  • Implemented security controls using Istio, OPA, Binary Authorization, and cloud security tools, including Twistlock, C3M, and Venafi.
  • Designed and implemented backup and disaster recovery strategies for infrastructure and GKE workloads.
  • Managed networking and high-availability architectures, including VPCs, subnets, ingress, multi-zone/multi-region deployments, and database replication.
  • Developed automation and reliability tooling using Go, Python, and Shell, along with runbooks and operational documentation.
  • Supported data workflows using Cloud Composer to create and manage data pipeline jobs.
  • Participated in on-call rotations and led incident bridges and postmortems.
  • Collaborated closely with the application, networking, security, QA, and operations teams to support production releases and live events.
  • Managed source control and release strategies using Git (GitHub/Bitbucket), implementing branching, tagging, and versioning standards.
  • Drove end-to-end CI/CD automation using Jenkins, including build, test, validation, and deployment pipelines.
  • Engaged with business stakeholders to understand requirements and translate them into scalable, reliable, cloud-native solutions.

DevOps Engineer

T-Mobile Inc
Atlanta, GA
12.2018 - 11.2019
  • Developed and implemented Software Release Management strategies for various applications according to the agile process.
  • Implemented AWS Cloud platform features including EC2, VPC, EBS, AMI, SNS, RDS, Cloud Watch, Cloud Trail, Cloud Formation, Autoscaling, Cloud Front, IAM, S3, and Route 53 to enhance cloud infrastructure.
  • Created Kubernetes clusters that supports DEV, TEST, and PROD environment.
  • Used Kubernetes to orchestrate the deployment, scaling, and management of Docker Containers.
  • Experience in Configure, EC2 instances, upgrade, and Resize clusters, Cluster monitoring, alerting, Manage multi zone/region availability.
  • Experience in creating VPC, Networking, Load Balancer, Port management, Cluster Ingress management, DB reliability, replication, and availability.
  • Experienced in branching, tagging, and maintaining the version across the environments using SCM tools like GIT (GitHub/Bit Bucket) on Linux and Windows platforms.
  • Configured Jenkins master/slave architecture to enable parallel builds and streamline Continuous Integration (CI) and Continuous Delivery (CD) processes.
  • Involved in all projects that moved into production and worked closely with the Data Center, Development, Quality Assurance and Management teams to ensure cross communication and confirmed approvals for all production changes.
  • Responsible for developing CI CD Pipelines for Order Management Applications which automated build and deployments to cloud.
  • Implemented Infrastructure automation through Terraform for auto provisioning, code deployments, software installation and configuration updates.
  • Hands-on experience designing, deploying, and managing solutions across Azure Cloud Services, Azure App Services, Azure SQL, Azure Functions, Azure Storage, Azure Virtual Networks (VNet), Subnets, NSGs, Load Balancers, and Azure Active Directory.
  • Proficient in implementing Azure IaaS and PaaS solutions, ensuring high availability, resiliency, security, and scalability.
  • Expertise in migrating on-premises applications to Azure using lift-and-shift and re-architecture strategies.
  • Strong knowledge of Azure Availability Sets, Availability Zones, and VM scale sets for fault-tolerant infrastructure.
  • Configured Azure Backup solutions for efficient disaster recovery and monitoring processes.
  • Configured and managed Azure Backup, Azure Recovery Services Vault, and Azure Site Recovery (ASR) for end-to-end DR solutions.
  • Experience configuring ASR agents, replication policies, failover/failback procedures, and recovery plans.
  • Implemented Azure Monitor, Log Analytics, and Application Insights to establish proactive monitoring and alerting systems.
  • Experience in Scheduling, deploying, and managing container replicas onto a node cluster using Kubernetes.
  • Responsible for Automating deployments and day to day activities using Jenkins and Shell scripts.
  • Responsible for Creating and monitoring Splunk queries and dashboards.
  • Working with Pivotal cloud foundry to deploy and run the applications through Jenkins.
  • Configured Jenkins to run unit test and generating code coverage and test reports graphically for all the projects.
  • Proficient in Azure Resource Manager (ARM) Templates, Terraform, and Bicep to automate resource deployment at scale.
  • Implemented infrastructure pipelines for deploying VNets, Subnets, NSGs, Storage Accounts, Key Vaults, and Virtual Machines.
  • Configured end-to-end network architectures including VNets, Subnets, Route Tables, VNet Peering, VPN Gateways, Load Balancers, Private Endpoints, and DNS Zones.
  • Secured Azure workloads through Application Security Groups (ASGs), NSGs, Firewall rules, and RBAC policies.
  • Built CI/CD pipelines using Azure DevOps (Repos, Pipelines, Boards, Artifacts).
  • Implemented automated build and release pipelines for .NET, Java, and containerized applications.
  • Integrated Git, GitHub Actions, Jenkins, and Azure DevOps for automated build/test/deploy workflows.
  • Used Maven, Gradle, and Jenkins plugins for packaging and deployment.
  • Configured secure access using Azure AD, RBAC, Conditional Access, and managed identities.
  • Implemented security best practices for storage accounts, VNets, and application services.
  • Managed Key Vault for secrets, certificates, and key management.
  • Added alerts for pod restarts’ unscheduled pods, CPU utilization for HPA and integrated with Slack to send notification about the alerts.
  • Worked with version controller GIT for maintaining history of source- code and project documents.
  • Worked with Agile development team to develop continuous integration/continuous delivery in an opensource environment.

DevOps Engineer

American Cancer Society
Atlanta, GA
05.2016 - 12.2018
  • Implemented CI/CD pipelines to automate infrastructure provisioning and software delivery.
  • Understanding the application business logic with Business Requirements Specification documents and functionality of application with Functional Requirements Specification Documents.
  • Maintained continuous test integration and automated builds with Jenkins, sharing build outputs with team members for timely feedback.
  • Utilized Jenkins add-ins to streamline end-to-end CI/CD processes across multiple projects, enhancing deployment efficiency.
  • Used Jenkins as Continuous Integration tool for automation of daily process.
  • Implemented pull request strategies to the development team.
  • Involved in Continuous Integration (CI) and Continuous Delivery (CD) process implementation using Jenkins along with Shell scripts to automate routine jobs.
  • Used SQL queries to validate data and updated the records for the various modules.
  • Proposed triggers on events to insert, update and captured the data Defined constraints, rules, and defaults to ensure data integrity and relational integrity.
  • Hands on Docker container snapshots, attaching to a running container, removing images, and managing containers.
  • Installed Docker on virtual machines to facilitate application development, testing, and deployment using containerization.
  • Resolved update and merge issues in Jenkins and JIRA to maintain operational continuity.
  • Automated the front-ends platform into highly scalable, consistent, repeatable infrastructure using high degree of automation using Chef, Jenkins, and Cloud Formation.

Education

Master of Science - Computer Science and Information Technology

University of Michigan
USA
04-2016

Bachelors - computer science

JNTU
India

Skills

  • Cloud services expertise
  • AWS, Azure, and GCP proficiency
  • Container orchestration with Kubernetes
  • Infrastructure automation techniques
  • Performance optimization strategies
  • Team leadership skills
  • Monitoring solutions implementation
  • Data analytics with BigQuery and Dataflow
  • Configuration management tools usage
  • Security compliance knowledge (FEDRAMP)
  • CI/CD pipeline development with Jenkins and Git

Certification

Google Cloud Certified Professional Cloud Engineer

Accomplishments

SRE of the Month

Timeline

Site Reliability Engineer

Equifax
11.2019 - Current

DevOps Engineer

T-Mobile Inc
12.2018 - 11.2019

DevOps Engineer

American Cancer Society
05.2016 - 12.2018

Master of Science - Computer Science and Information Technology

University of Michigan

Bachelors - computer science

JNTU
Venkata KolluriSr. Site Reliability Engineer