Summary
Overview
Work History
Education
Skills
Certification
Timeline
Generic

Sunil Kumar Kanakappagari

Dallas,TX

Summary

  • Senior DevOps Engineer with over 9 years of experience in supporting, automating, and optimizing mission-critical deployments in AWS and Azure. Proven track record in leveraging configuration management, CI/CD, Infrastructure as Code (IaC), and container orchestration to deliver robust solutions.
  • Extensive experience with AWS and Azure, utilizing their IaaS (EC2, S3, EBS, VPC, ELB, VM, Storage) and PaaS (AWS Lambda, RDS, App Services, Azure Functions) offerings, as well as services like Route 53, CloudWatch, CloudTrail, Auto Scaling, Security Groups, Redshift, and Directory Services. Designed and deployed customized infrastructure using Terraform (IaC), creating VPCs, subnets, and NAT in AWS and Azure, and architecting secure VPC solutions with Network ACLs and Security Groups. Streamlined CI/CD pipelines using GitHub, Terraform, Ansible, Jenkins, and Azure DevOps, automating deployments and managing configurations with Ansible.
  • Expert in containerization, managing Docker images, Docker Hub, and registries, and creating and maintaining Kubernetes EKS and AKS clusters. Applied Jenkins for continuous integration and end-to-end automation, using Maven and Nexus for build management. Utilized monitoring and logging tools such as Nagios, Splunk, ELK, and CloudWatch for service management, log maintenance, event monitoring, and issue resolution.
  • Proficient in networking, including VPC, Subnet, SSH, DNS, VPN, NACL, and Security Groups. Experienced in agile methodologies with JIRA and Service Desk for bug tracking and code tracking integration. Additionally skilled in VMware Tanzu products, database management, and scripting and automation. Demonstrated leadership and communication skills, enabling effective performance both independently and within teams.

Overview

10
10
years of professional experience
1
1
Certification

Work History

Senior DevOps Engineer

Northern Trust
Chicago, Illinois
03.2022 - Current
  • Engineered an automated Linux cloud infrastructure with Puppet, managing Java versions, and creating Python plugins for metrics collection, resulting in a 40% reduction in administrative task completion time across 100+ servers
  • Built automation systems using PowerShell scripts and JSON templates, which improved Azure services remediation by 35%, and implemented Azure Functions for code uploads and monitoring changes in Blob and Table Storage, enhancing operational efficiency by 25%
  • Designed microservices onboarding tools with Python and Jenkins on AWS, improving build job maintenance by 50% and streamlining Kubernetes deployments, while also crafting Docker containers with various distributions, reducing container build times by 30%
  • Achieved Infrastructure as Code for deploying and updating production Tanzu Application Services (PCF) Foundations and configured vSphere 7.0 with Tanzu, enabling Kubernetes (k8s) data domain through Concourse CI server and pipelines
  • Written templates for Azure infrastructure as code using Terraform to build staging and production environments, ensuring no changes to existing infrastructure during incremental deployments
  • Established CI/CD pipelines with Azure DevOps, integrating GIT, MS Build, Docker, Maven, Jenkins, and AWS CloudFormation, automating components such as EC2, Security Groups, ELB, RDS, and IAM
  • Led complex support calls with vendors like JFrog and VMware Tanzu, resulting in a 30% increase in CI/CD tools performance
  • Debugged build failures and collaborated with developers and QA to resolve issues, leading to a 40% increase in build success rates. Designed and deployed pipelines through Azure Data Factory, reducing error rates by 35%
  • Designed and deployed AWS solutions using EC2, S3, EBS, ELB, auto-scaling groups, and OpsWorks, while managing and administering Apache, Tomcat, WebLogic, WebSphere, and JBoss servers
  • Implemented the ELK (Elasticsearch, Logstash, Kibana) Stack for enhanced log management, reducing downtime by 50%
  • Configured Nagios to monitor Azure VMs with Puppet automation, leading to a 30% improvement in system reliability
  • Optimized NoSQL databases like MongoDB and Cassandra, increasing scalability by 40%
  • Automated Azure services, including VMs, Functions, Load Balancer, Blob Storage, and Table Storage, using Python scripts, resulting in a 25% reduction in manual management time
  • Established a virtual network on Azure to connect all servers, enhancing network security and management, and configured CloudWatch alarms and notifications for EC2 instances, employing Lambda functions for automated snapshot backups
  • Improved Incident Management by automating ticketing and escalation workflows within ITSM tools, resulting in faster incident resolution and effective management of incident backlogs
  • Employed Kubernetes to automate deployment, scaling, and operations of application containers across clusters, achieving 99.9% uptime and scaling applications up to 50% more efficiently
  • Collaborated with the Azure product team to identify and resolve defects in the Azure IaaS offering, leading to a 40% reduction in defect resolution time
  • Implemented and configured monitoring solutions using New Relic, Grafana, and Kibana to proactively track system performance, visualize metrics, and swiftly identify and resolve issues, resulting in improved system reliability and uptime
  • Designed and implemented network architectures suitable for various cloud topologies, ensuring optimal performance and reliability
  • Applied best practices for SLO, SLA, and OLA management to maintain high service standards for cloud services and organizational needs
  • Set up GIT repositories and controlled access rights for authorized developers and workers, reducing code deployment errors by 20% and ensuring 100% compliance with security policies, resulting in improved code management efficiency

DevOps Engineer

Molina healthcare
Bothell, WA
09.2020 - 02.2022
  • Engineered microservice onboarding tools using Python and Jenkins on AWS, facilitating the seamless setup and maintenance of build jobs and Kubernetes deployments and services
  • Implemented reproducible builds of Kubernetes applications, managed Kubernetes manifest files and Helm packages, and used Kubernetes to deploy, scale, load balance, and manage Docker containers across namespace versions
  • Implemented robust solutions for deploying applications across multiple zones and regions, optimizing for redundancy and high availability.
  • Utilized advanced techniques for data synchronization and consistency to support scalable and resilient cloud-based applications
  • Deployed Kubernetes-based container environments for development teams and containerized environments for releases using Docker and Kubernetes
  • Managed Docker containers and consoles for application lifecycle management, utilizing Docker Engine, Hub, Machine, Compose, and Docker Registry
  • Developed and executed strategies for horizontally and vertically scaling applications, ensuring seamless performance across sync/async environments.
  • Leveraged BASE/ACID principles to maintain data integrity and consistency in multi-zone and multi-region deployments
  • Focused on containerization and immutable infrastructure using Docker and Kubernetes, including deploying Docker Swarm for spring boot applications and Docker Engines in virtualized platforms for containerization
  • Implemented Problem Management practices by conducting root cause analysis and updating the ITSM knowledge base, enhancing the resolution of recurring issues and minimizing operational impact
  • Constructed Docker images from scratch, customized and adjusted base images from existing environment configurations, and maintained the image repository for development teams
  • Administered Ansible playbooks with roles, using the file module to copy and remove files on remote systems, and established inventories for supporting continuous deployment
  • Applied Ansible roles in YAML to automate repetitive tasks and accelerate deployment of critical applications, including outlining tasks, variables, files, handlers, and templates, setting up Ansible files for parallel deployment, and transforming static playbooks into dynamic roles to streamline the continuous delivery
  • Oversaw the lifecycle of digital certificates, including issuance, renewal, and revocation, across various system components.
  • Ensured secure communication channels and compliance with security protocols, significantly reducing the risk of service interruptions and security breaches
  • Executed Terraform for infrastructure resource management, RDS configuration, and dependency mapping, detecting network issues and collaborating with Jenkins to enhance automation
  • Managed Release Management processes by coordinating deployments, performing risk assessments, and ensuring adherence to ITIL standards to achieve smooth and controlled software releases
  • Enhanced code security by establishing and fine-tuning scanning processes in Sonatype and Veracode, ensuring comprehensive detection of vulnerabilities and improving overall code quality
  • Set up Splunk to capture and analyze logs from load balancers, web servers, and application servers, and monitored system logs, notifying the incident management system upon exceeding thresholds
  • Coordinated with developers to establish and apply appropriate branching, labeling/naming conventions using GIT, and resolved conflicts related to merging source code
  • Maintained JIRA for tracking and updating project defects and tasks, ensuring successful completion of tasks in sprints
  • Implemented Operations Management Suite, employed PowerShell scripts, and crafted ARM templates for streamlined operations

DevOps Engineer

Nationwide Insurance
Columbus, OH
04.2018 - 08.2020
  • Coordinated with Development, Database Administration, QA, and IT Operations teams throughout the SDLC to prevent resource conflicts and ensure smooth project execution
  • Managed AWS IAM services by creating user/group accounts and attaching policies to control access and enhance cloud security
  • Orchestrated the deployment of fault-tolerant AWS instances, ECS clusters, and scalable infrastructure components like Elastic Load Balancers and Auto Scaling groups, ensuring high availability and performance for critical systems
  • Utilized Apigee to configure and deploy API gateways, handling certificate tasks such as issuance and renewal to ensure secure and reliable API communications
  • Defined AWS Security Groups as virtual firewalls to control incoming traffic and configured VPCs, subnets, Internet Gateways, S3 buckets, and Route 53 in the Amazon Cloud environment
  • Automated the application build and deployment process by creating Jenkins CI/CD pipelines, integrating Maven and Gradle for building WAR files, Docker for containerizing applications, and Ansible for deployment
  • Developed and implemented Bash, Shell, and PowerShell scripts to automate continuous deployment tasks and server housekeeping, enhancing overall efficiency and reducing manual errors
  • Managed service level objectives (SLOs), service level agreements (SLAs), and operational level agreements (OLAs) for cloud services, ensuring compliance and performance standards were consistently met or exceeded
  • Installed and configured monitoring tools such as Mantis, Splunk, and AWS CloudWatch to ensure system performance and reliability
  • Developed multi-branch pipelines and automated build and deployments using Jenkins and Ansible, integrating Git, Junit Testing, and Selenium to ensure robust and error-free code releases
  • Automated the creation of immutable infrastructure components (EC2, VPC, ELB, Auto-Scaling) using Terraform (IaC), setting up both classic and application load balancers based on application team requirements
  • Built and tested .NET code using MSBuild and MSTest, and generated NuGet packages for application teams, ensuring consistent and reusable code across repositories
  • Wrote Docker files to automate database deployments on Oracle Linux containers, enhancing deployment speed and consistency
  • Developed shared libraries for Jenkins files, enabling reusable code across different projects and pipelines
  • Integrated build and deployment pipelines with communication tools like emails, Slack, and MS Teams, ensure real-time notifications to appropriate teams on status updates
  • Promoted builds across environments, including sandbox, dev, sat, prod, and DR, ensure smooth transitions and minimize deployment issues
  • Maintained and managed Jenkins, focusing on plugin management, user access, and configuration management
  • Integrated Jenkins with AWS plugin to keep Jenkins slaves online only when necessary, resulting in significant cost savings
  • Modeled and automated end-to-end CI/CD pipelines using tools such as Jenkins, Nexus Artifactory, and Git
  • Utilized Nexus Artifactory for storing and retrieving built artifacts.

Software Engineer

Zensar Technologies
Chennai, India
08.2014 - 12.2017
  • Specialized in developing and deploying Java applications on Google Cloud Platform (GCP)
  • Utilized GCP services such as Compute Engine, Kubernetes Engine, Cloud Storage, and Cloud Functions to design and implement scalable, high-performance solutions
  • Identified and resolved memory leaks using tools like YourKit and Eclipse MAT, ensuring proper resource management and stable memory usage in production environments
  • Diagnosed and resolved threading issues by analyzing thread dumps with JStack and TDA, eliminating deadlocks and reducing thread contention to enhance application concurrency
  • Analyzed garbage collection logs to optimize JVM parameters, reducing pause times and improving application responsiveness using tools like GCViewer and VisualVM
  • Designed and managed distributed systems and networking infrastructures, ensuring robust and secure connectivity across development, testing, and production environments
  • Installed and managed DevOps tools such as Jenkins, GitLab, Chef, Docker, and JFrog Artifactory
  • Conducted maintenance tasks including user and group management, ensuring seamless operations
  • Utilized JIRA for project management, including bug tracking, workflow creation, dashboard report generation, and sprint planning
  • Worked closely with development teams to identify and mitigate common build failures
  • Provided detailed project updates and schedule feedback to management, ensuring clear communication and timely oversight
  • Demonstrated strong analytical, organizational, and problem-solving skills with a keen attention to detail
  • Collaborated effectively within large financial services environments, adhering to industry standards and best practices while ensuring compliance and security
  • Proven ability to work both independently and as part of a team, maintaining high standards of accuracy and reliability with minimal supervision
  • Communicated effectively across teams and departments, drafting plans, soliciting feedback, establishing roadmaps, and providing regular updates to stakeholders

Education

Bachelor of Technology - Computer Science and Engineering

Mahatma Gandhi Institute of Technology (JNTUH)
Hyderabad, Telangana
05.2014

Skills

CLOUD: AWS, Microsoft Azure, and Google Cloud Platform (GCP)

Azure Tools: Azure Storage Explorer, Visual Studio, VS Code, GitHub, Docker, Kubernetes, ServiceBus, ARM Templates, Azure Key Vault, Vnet, NSG

CI/CD & BUILD TOOLS: Jenkins, Terraform, Azure DevOps (formerly VSTS)

VERSION CONTROL TOOLS: GIT, SVN, Bitbucket, Azure Repos (formerly TFS)

CONFIGURATION MANAGEMENT: Chef, Ansible, Puppet

MONITORING TOOLS: Nagios, Splunk, ELK, Cloud Watch, Azure Monitor, Grafana and Prometheus

CONTAINER TOOLS: Kubernetes, Docker, Azure Container Instances (ACI)

SCRIPTING LANGUAGES: Bash, Shell Scripting, Python, JSON, YAML, PowerShell, Go, Ruby

PROGRAMMING: C, Java, JavaScript, HTML, CSS

DATABASES: MySQL, Red shift, Mongo DB, AWS RDS, PostgreSQL, and SQL Server

APPLICATION SERVERS: IIS, JBoss, Apache Tomcat, Web Sphere, Web Logic

WEB SERVERS: Apache HTTP, Nginx, Apache TOMCAT

OPERATING SYSTEMS: RedHat Linux 7/6/5/4, Unix, Ubuntu 16/14/13/12, Debian, CentOS, Windows, Mac OS, RHEL

Certification

  • AWS Certified Solutions Architect
  • Microsoft Azure Fundamentals (AZ-900)

Timeline

Senior DevOps Engineer

Northern Trust
03.2022 - Current

DevOps Engineer

Molina healthcare
09.2020 - 02.2022

DevOps Engineer

Nationwide Insurance
04.2018 - 08.2020

Software Engineer

Zensar Technologies
08.2014 - 12.2017

Bachelor of Technology - Computer Science and Engineering

Mahatma Gandhi Institute of Technology (JNTUH)
Sunil Kumar Kanakappagari