Summary
Overview
Work History
Education
Skills
Timeline
Generic

Vijayadurga Chunchu

Arlington Heights,USA

Summary

With approximately 6 years of experience, I have developed in-depth technical expertise in Linux administration, systems/network administration, cloud computing, and DevOps practices. I have hands-on experience with Continuous Integration (CI), Continuous Delivery (CD), and Continuous Deployment (CD) across multiple environments, including Development, Testing, Staging, and Production. My work also spans Software Configuration Management (SCM), Build, and Release Engineering, ensuring smooth transitions between environments. Additionally, I have contributed to the implementation and maintenance of AWS infrastructure, leveraging services such as EC2, S3, VPC, IAM, RDS, and CloudWatch to support and optimize application performance. Highly skilled Site Reliability Engineer with hands-on experience in designing, coding, testing and supporting next-gen database solutions in Oracle enterprise and SQL Server environments. Proficient at developing large scale software systems, maintaining high server uptimes and responding swiftly to outages or interruptions. Consistently enabled smoother deployments and monitoring of applications across different platforms by implementing automation tools. Demonstrated leadership skills while coordinating with cross-functional teams to ensure system efficiency and reliability.

Overview

6
6
years of professional experience

Work History

Cloud DevOps Engineer/SRE Engineer

HSBC
Arlington Heights, USA
01.2020 - Current
  • Developed and maintained Terraform scripts to replicate on-premises infrastructure in AWS, ensuring a seamless, repeatable process for environment creation, and enabling consistent infrastructure management across multiple AWS accounts
  • Configured AWS security groups, IAM roles, and VPC networking using Terraform to ensure secure, compliant migration of on-premises workloads to the cloud, adhering to industry best practices and regulatory standards
  • Utilized Terraform to provision AWS resources based on demand, implementing Auto Scaling and Reserved Instances to optimize costs and ensure efficient use of cloud resources during the migration from on-premises environments
  • Utilized Ansible to manage auto-scaling policies and configure cloud resources (e.g., EC2, VM instances) to automatically scale up or down based on traffic load and demand, ensuring system stability
  • Automated the entire migration process from on-prem to AWS with Terraform, integrating version-controlled infrastructure code into CI/CD pipelines to ensure smooth and continuous deployment of resources throughout the migration lifecycle
  • Created Docker images using a Docker file, worked on Docker container snapshots, removing images and managing Docker volume and Implemented Docker automation solution for Continuous Integration / Continuous Delivery model
  • Implemented SRE principles such as Service Level Objectives (SLOs), Service Level Indicators (SLIs), and Service Level Agreements (SLAs) to ensure application uptime and reliability
  • Created Python-based automation scripts to integrate IaC workflows with continuous integration/continuous delivery (C/CD) pipelines
  • Working on monitoring and logging the servers by using Splunk, App Insights and ELK in ways to find the load on the servers like how many people are logging into the servers, how many people are using on daily basis, what is the traffic on the website when there is a heavy load and how to overcome the heavy traffic issues
  • Deployed Prometheus for real-time monitoring of infrastructure and application metrics, integrated with Grafana dashboards for visualizing key performance indicators (KPIs)
  • Utilized Dynatrace's automatic anomaly detection and alerting capabilities to quickly detect and respond to incidents, reducing Mean Time to Detect (MTTD) and Mean Time to Recovery (MTTR)
  • Conducted performance benchmarking, optimization, and capacity planning using AppDynamics and Prometheus metrics, identifying areas for scaling and optimizing cloud resources
  • Led incident response efforts, conducted postmortem analyses, and worked with engineering teams to implement corrective actions to prevent future outages
  • Worked closely with development, security, and QA teams to integrate monitoring, performance tracking, and continuous deployment processes into the SDLC
  • Created comprehensive runbooks, troubleshooting guides, and internal documentation on AWS best practices, monitoring configurations, and incident management protocols
  • Configured Jenkins masters and slaves on cloud and administer all the instances
  • Integrated GitHub/Bitbucket with Jenkins to start automated builds on Jenkins using various triggers such as pull request, tags and commits
  • Utilized Jira, Confluence, and Slack for tracking incidents, managing projects, and facilitating team communication during outages and critical incidents
  • Implemented containerization of applications using Docker and orchestrated container deployments with Kubernetes (EKS) and AWS Fargate for scalability and resilience
  • Utilized Helm to automate the deployment, configuration, and management of containerized applications on Kubernetes, reducing manual intervention and deployment errors
  • Managed and optimized highly available database environments (MySQL, PostgreSQL, MongoDB, Cassandra, etc.) in production for high traffic applications
  • Developed and maintained CI/CD pipelines using Jenkins, GitLab CI, and AWS CodePipeline to automate build, test, and deployment processes, ensuring seamless application delivery
  • Integrated AppDynamics for real-time application performance monitoring, identifying bottlenecks, and optimizing application flow to enhance end-user experience and improve application uptime
  • Configured automated alerting mechanisms using Prometheus Alertmanager, Splunk, and AppDynamics to proactively identify and address issues, reducing mean time to recovery (MTTR)
  • Managed and optimized network infrastructure leveraging TCP/IP protocols to ensure low-latency, high-availability, and scalable services across global data centers
  • Implemented and managed HTTP-based APIs, ensuring efficient communication and response times across microservices architectures
  • Leveraged Splunk for advanced log search, parsing, and analysis to troubleshoot incidents, improve system reliability, and monitor the health of microservices
  • Environment & Tools: AWS, Terraform, Jenkins, Splunk, Grafana, GIT, Nexas, Docker, Kubernetes, Python, Shell Scripting

Linux System Administrator

BrownGreer PLC
Richmond, USA
03.2019 - 12.2019
  • Assisted in managing and maintaining Window Server environments, including regular updates, patching, and trouble shooting
  • Performed user account administration, including onboarding/offboarding and permission changes via Active Directory
  • Interacting with 300 users on site and more remotely via email/phone/ticketing systems for their computing needs
  • Monitored system performance and availability using tools SolarWinds, Nagios
  • Supported basic networking tasks like configuring IP addresses, managing DSN and DHCP settings
  • Assisted in virtualization environment setup using VMware and Hyper-V
  • Installed, configured, Administered and supported WebSphere Application Servers 6.0/6.1 on Windows and Linux environment using GUI as well as silent install
  • Adopted and Followed waterfall methodology for application development
  • Responsible for code generation and migration to different stages and making up the process smooth and in supporting the team
  • Responsible for documenting the procedures followed up in the build process
  • Responsible for release notes and build process documentation, issue log and bug reports
  • Provided frontline IT support, troubleshooting software, hardware, and networking issues for users
  • Environment: Linux (RHEL 4.x/5.x), Solari8/9/10, CentOS, VERITAS Volume Manager, Shell Scripting, Autosys, VMWARE, Apache Tomcat, Nagios, WebSphere Application Servers 6.x

Education

Master’s - COMPUTER SCIENCE

Wright State University
OH

Bachelors -

GITAM University
Visakhapatnam

Skills

  • Linux operating system
  • Infrastructure automation
  • Microservices architecture
  • Test automation
  • Virtualization technologies
  • Developer collaboration
  • Configuration management
  • Incident management
  • Linux administration
  • Disaster recovery
  • Continuous deployment
  • Application scaling
  • Capacity planning
  • Containerization technologies
  • Continuous integration
  • Scripting languages
  • API management
  • System monitoring

Timeline

Cloud DevOps Engineer/SRE Engineer

HSBC
01.2020 - Current

Linux System Administrator

BrownGreer PLC
03.2019 - 12.2019

Master’s - COMPUTER SCIENCE

Wright State University

Bachelors -

GITAM University
Vijayadurga Chunchu