Vijayadurga Chunchu

Summary

With approximately 6 years of experience, I have developed in-depth technical expertise in Linux administration, systems/network administration, cloud computing, and DevOps practices. I have hands-on experience with Continuous Integration (CI), Continuous Delivery (CD), and Continuous Deployment (CD) across multiple environments, including Development, Testing, Staging, and Production. My work also spans Software Configuration Management (SCM), Build, and Release Engineering, ensuring smooth transitions between environments. Additionally, I have contributed to the implementation and maintenance of AWS infrastructure, leveraging services such as EC2, S3, VPC, IAM, RDS, and CloudWatch to support and optimize application performance. Highly skilled Site Reliability Engineer with hands-on experience in designing, coding, testing and supporting next-gen database solutions in Oracle enterprise and SQL Server environments. Proficient at developing large scale software systems, maintaining high server uptimes and responding swiftly to outages or interruptions. Consistently enabled smoother deployments and monitoring of applications across different platforms by implementing automation tools. Demonstrated leadership skills while coordinating with cross-functional teams to ensure system efficiency and reliability.

Overview

6

years of professional experience

Work History

Cloud DevOps Engineer/SRE Engineer

HSBC

Arlington Heights, USA

01.2020 - Current

Developed and maintained Terraform scripts to replicate on-premises infrastructure in AWS, ensuring a seamless, repeatable process for environment creation, and enabling consistent infrastructure management across multiple AWS accounts
Configured AWS security groups, IAM roles, and VPC networking using Terraform to ensure secure, compliant migration of on-premises workloads to the cloud, adhering to industry best practices and regulatory standards
Utilized Terraform to provision AWS resources based on demand, implementing Auto Scaling and Reserved Instances to optimize costs and ensure efficient use of cloud resources during the migration from on-premises environments
Utilized Ansible to manage auto-scaling policies and configure cloud resources (e.g., EC2, VM instances) to automatically scale up or down based on traffic load and demand, ensuring system stability
Automated the entire migration process from on-prem to AWS with Terraform, integrating version-controlled infrastructure code into CI/CD pipelines to ensure smooth and continuous deployment of resources throughout the migration lifecycle
Created Docker images using a Docker file, worked on Docker container snapshots, removing images and managing Docker volume and Implemented Docker automation solution for Continuous Integration / Continuous Delivery model
Implemented SRE principles such as Service Level Objectives (SLOs), Service Level Indicators (SLIs), and Service Level Agreements (SLAs) to ensure application uptime and reliability
Created Python-based automation scripts to integrate IaC workflows with continuous integration/continuous delivery (C/CD) pipelines
Working on monitoring and logging the servers by using Splunk, App Insights and ELK in ways to find the load on the servers like how many people are logging into the servers, how many people are using on daily basis, what is the traffic on the website when there is a heavy load and how to overcome the heavy traffic issues
Deployed Prometheus for real-time monitoring of infrastructure and application metrics, integrated with Grafana dashboards for visualizing key performance indicators (KPIs)
Utilized Dynatrace's automatic anomaly detection and alerting capabilities to quickly detect and respond to incidents, reducing Mean Time to Detect (MTTD) and Mean Time to Recovery (MTTR)
Conducted performance benchmarking, optimization, and capacity planning using AppDynamics and Prometheus metrics, identifying areas for scaling and optimizing cloud resources
Led incident response efforts, conducted postmortem analyses, and worked with engineering teams to implement corrective actions to prevent future outages
Worked closely with development, security, and QA teams to integrate monitoring, performance tracking, and continuous deployment processes into the SDLC
Created comprehensive runbooks, troubleshooting guides, and internal documentation on AWS best practices, monitoring configurations, and incident management protocols
Configured Jenkins masters and slaves on cloud and administer all the instances
Integrated GitHub/Bitbucket with Jenkins to start automated builds on Jenkins using various triggers such as pull request, tags and commits
Utilized Jira, Confluence, and Slack for tracking incidents, managing projects, and facilitating team communication during outages and critical incidents
Implemented containerization of applications using Docker and orchestrated container deployments with Kubernetes (EKS) and AWS Fargate for scalability and resilience
Utilized Helm to automate the deployment, configuration, and management of containerized applications on Kubernetes, reducing manual intervention and deployment errors
Managed and optimized highly available database environments (MySQL, PostgreSQL, MongoDB, Cassandra, etc.) in production for high traffic applications
Developed and maintained CI/CD pipelines using Jenkins, GitLab CI, and AWS CodePipeline to automate build, test, and deployment processes, ensuring seamless application delivery
Integrated AppDynamics for real-time application performance monitoring, identifying bottlenecks, and optimizing application flow to enhance end-user experience and improve application uptime
Configured automated alerting mechanisms using Prometheus Alertmanager, Splunk, and AppDynamics to proactively identify and address issues, reducing mean time to recovery (MTTR)
Managed and optimized network infrastructure leveraging TCP/IP protocols to ensure low-latency, high-availability, and scalable services across global data centers
Implemented and managed HTTP-based APIs, ensuring efficient communication and response times across microservices architectures
Leveraged Splunk for advanced log search, parsing, and analysis to troubleshoot incidents, improve system reliability, and monitor the health of microservices
Environment & Tools: AWS, Terraform, Jenkins, Splunk, Grafana, GIT, Nexas, Docker, Kubernetes, Python, Shell Scripting

Linux System Administrator

BrownGreer PLC

Richmond, USA

03.2019 - 12.2019

Assisted in managing and maintaining Window Server environments, including regular updates, patching, and trouble shooting
Performed user account administration, including onboarding/offboarding and permission changes via Active Directory
Interacting with 300 users on site and more remotely via email/phone/ticketing systems for their computing needs
Monitored system performance and availability using tools SolarWinds, Nagios
Supported basic networking tasks like configuring IP addresses, managing DSN and DHCP settings
Assisted in virtualization environment setup using VMware and Hyper-V
Installed, configured, Administered and supported WebSphere Application Servers 6.0/6.1 on Windows and Linux environment using GUI as well as silent install
Adopted and Followed waterfall methodology for application development
Responsible for code generation and migration to different stages and making up the process smooth and in supporting the team
Responsible for documenting the procedures followed up in the build process
Responsible for release notes and build process documentation, issue log and bug reports
Provided frontline IT support, troubleshooting software, hardware, and networking issues for users
Environment: Linux (RHEL 4.x/5.x), Solari8/9/10, CentOS, VERITAS Volume Manager, Shell Scripting, Autosys, VMWARE, Apache Tomcat, Nagios, WebSphere Application Servers 6.x

Education

Master’s - COMPUTER SCIENCE

Wright State University

OH

Bachelors -

GITAM University

Visakhapatnam

Skills

Linux operating system
Infrastructure automation
Microservices architecture
Test automation
Virtualization technologies
Developer collaboration
Configuration management
Incident management
Linux administration

Disaster recovery
Continuous deployment
Application scaling
Capacity planning
Containerization technologies
Continuous integration
Scripting languages
API management
System monitoring

Timeline

Cloud DevOps Engineer/SRE Engineer

HSBC

01.2020 - Current

Linux System Administrator

BrownGreer PLC

03.2019 - 12.2019

Master’s - COMPUTER SCIENCE

Wright State University

Bachelors -

GITAM University

Summary

Overview

Work History

Cloud DevOps Engineer/SRE Engineer

Linux System Administrator

Education

Master’s - COMPUTER SCIENCE

Bachelors -

Skills

Timeline

Cloud DevOps Engineer/SRE Engineer

Linux System Administrator

Master’s - COMPUTER SCIENCE

Bachelors -

Similar Profiles

Denys KrytskyiDenys Krytskyi

HANNAH JOKWIHANNAH JOKWI

VijayaLaxmi rapatiVijayaLaxmi rapati

RAJESH KMRAJESH KM