Summary
Overview
Work History
Education
Skills
Certification
Timeline
Generic

MADHU B

Summary

Bilingual DevOps Engineer with background designing, testing, and implementing infrastructure and applications. Talented performer with 10+ years of experience using source control tools to identify and fix bugs in code. Consistent team player with exemplary multitasking skills.

● Proficient in container-based deployments utilizing Docker, Docker images, Dockerfile, Docker Hub, ECS, and ECR.

● Skilled in configuring AWS IAM, Security Groups, and managing IAM accounts and policies to ensure compliance with security standards.

● Experienced in designing Serverless architectures using API Gateway, Lambda, and DynamoDB, including deployment from Amazon S3 buckets and creation of Lambda Deployment functions.

● Competent in Kubernetes orchestration, creating pods, config Maps, and deployments, and integrating Ansible Tower with Jenkins for streamlined code deployment.

● Proficient in Continuous Integration processes with Jenkins, Bamboo, and CI/CD automation using various DevOps tools.

● Expertise in configuring and maintaining a wide range of AWS services for high availability, fault tolerance, and Autoscaling using AWS CloudFormation.

● Skilled in data management within AWS S3 buckets, including versioning, lifecycle policies, and web hosting with Pre-Signed URLs.

● Specialized in configuration management tools such as Chef, Ansible, and Puppet for infrastructure automation.

● Hands-on experience with Azure services including Compute, Storage, SQL Azure, Network Services, and PowerShell Automation.

● Proficient in creating and managing pipelines using Azure Data Factory and monitoring applications using tools like

New Relic and AppDynamics.

● Skilled in building scalable and resilient solutions on Azure, including VM availability sets and Virtual Machine Scale Sets.

● Familiarity with cloud provisioning tools such as Terraform and CloudFormation for infrastructure deployment and management.

● Experienced in Blue/green deployment strategies using CloudFormation templates and Route53 weighted record sets.

● Proficient in Docker container management, including snapshots, image optimization, and container lifecycle management.

● Skilled in analyzing web application performance using Real User Monitoring (RUM) Metrics and optimizing Docker image creation.

● Experienced in scheduling, deploying, and managing container replicas using Kubernetes clusters.

● Proficient in creating and managing Ansible Playbooks for continuous deployment automation.

● Skilled in server monitoring and network performance analysis using Nagios, Splunk, CloudWatch, and ELK.

● Experienced in implementing Continuous IntegrationContinuous Delivery (CICD) pipelines using Jenkins, SonarQube, Nexus, Perl Scripting, and Shell scripting.

● Proficient in source code management tools such as Git, GitLab, Bitbucket, and SVN, including migration from SVN to Git.

● Skilled in issue management tools like ServiceNow, JIRA, Confluence, and Rally.

● Experienced in installing, configuring, and managing various databases including SQL Server, MySQL, NoSQL, DB2, PostgreSQL, Oracle, DynamoDB, MongoDB, and Cassandra.

● Proficient in REST API development and good understanding of API concepts and REST architectural style.

● Expertise in installation, administration, configuration, performance tuning, and troubleshooting of Linux and

Windows operating systems.

● Proficient in utilizing Splunk for log analysis, monitoring, and troubleshooting to ensure optimal system performance and security.

● Extensive experience in developing, managing, and optimizing Terraform modules for automated infrastructure provisioning and configuration management across various cloud environments.

● Deep understanding of Google Cloud Platform services, utilizing Terraform to design and manage scalable and secure infrastructures, including Compute Engine, Cloud Storage, Cloud SQL, VPCs, and IAM.

● Demonstrated ability to orchestrate complex, multi-tiered architectures on GCP with Terraform, ensuring compliance with best practices for security, scalability, and cost efficiency.

● Strong problem-solving skills in troubleshooting Terraform configurations, resolving infrastructure-related issues,and optimizing code for improved performance and reliability.

● Skilled in creating custom dashboards, alerts, and reports within Splunk to provide real-time visibility into system health and performance metrics.

● Experienced in integrating Splunk with other monitoring tools and platforms to centralize log management and streamline incident response processes.

● Provided 24/7 on-call support for production systems, ensuring rapid response and resolution of incidents to maintain system availability and reliability.

● Responded to alerts and incidents promptly, diagnosing and resolving issues related to infrastructure, deployment pipelines, and application performance.

Overview

10
10
years of professional experience
1
1
Certification

Work History

SRE/Sr. Cloud DevOps Engineer

Humana
06.2022 - Current
  • Utilized AWS API extensively to manage resources such as EC2 instances, S3 buckets, VPC configurations CloudWatch metrics, Auto-scaling groups, and SNS notifications, crafting Python scripts for seamless resource management
  • Integrated AWS DynamoDB with AWS Lambda functions for efficient data storage and backup of DynamoDB streams
  • Designed and deployed custom-sized AWS infrastructure using AWS CloudFormation templates, encompassing VPC setups, subnets, EC2 instances, ELBs, and security groups, while ensuring adherence to AWS best practices for data integrity and security
  • Demonstrated proficiency across various Azure services including Compute, SQL Azure, Storage, and Network services, alongside adept management of load balancers handling high data throughput
  • Led architecture and deployment efforts spanning bare solutions, VMWare, and Amazon Web Services
  • Delivered effective troubleshooting solutions for large-scale customer-facing issues and big-data internal tools
  • Managed diverse Linux distributions including Ubuntu, RHEL, Amazon Linux, and CentOS
  • Implemented robust version control practices utilizing TFS, CVS, SVN, Git, GitHub, Perforce, with a focus on DevOps methodologies leveraging tools like Jenkins, Maven, Ant, Chef, and others
  • Specialized in configuration management tools such as Chef, Ansible, and Puppet for infrastructure automation
  • Demonstrated hands-on expertise with Microsoft Azure Cloud services, particularly Storage Accounts and Virtual Networks
  • Successfully migrated applications from on-demand EC2 instances to ECS Fargate spot instances, significantly reducing maintenance costs and improving scalability and monitoring capabilities
  • Leveraged AWS Lambda for event-driven code execution, especially in response to S3 bucket changes and HTTP requests via AWS API Gateway
  • Implemented a performance monitoring dashboard in Grafana by integrating Prometheus for Kubernetes metrics tracking
  • Conducted application monitoring using New Relic to ensure consistent performance, providing on-call production support as needed
  • Utilized Terraform for infrastructure provisioning, including the setup of AWS resources such as EC2 instances, S3 buckets, and VPC configurations
  • Orchestrated deployment and maintenance of microservices using Docker and Kubernetes, optimizing container management and scalability
  • Configured Splunk for logging and monitoring AWS services, creating dashboards and setting alerts to monitor system health and security events
  • Applied Zero Trust security principles across AWS infrastructure by configuring AWS IAM, Security Groups, AWS Config, and CloudTrail for real-time auditing and access control
  • Configured Jenkins Pipelines for Docker image building and pushing to AWS ECR, facilitating seamless integration into ECS Fargate Tasks
  • Established monitoring tools like Prometheus and Grafana for comprehensive tracking of Kafka cluster components and other system endpoints
  • Managed Kubernetes charts using Helm, ensuring reproducible builds and releases of Kubernetes applications
  • Created and maintained a library of Helm Charts for various applications, ensuring up-to-date deployments and configurability
  • Utilized Ansible Playbooks with Python wrappers to manage AWS configurations and deployments, facilitating efficient web application deployments
  • Implemented Continuous Integration (CI) and Continuous Deployment (CD) processes using Jenkins, designing project workflows and pipelines for seamless integration and deployment
  • Installed and configured Splunk for log monitoring, creating dashboards and alerts to facilitate proactive system management
  • Configured network and server monitoring using ELK Stack, along with PagerDuty for notifications alerts, ensuring timely responses to system issues
  • Utilized JIRA for project management and ServiceNow for change order creation and management in production deployments

DevOps Engineer

American Airlines
10.2020 - 05.2022
  • Played a pivotal role in installing, configuring, and administering Red Hat Linux (versions 4.x, 5.x, and 6.1) and Windows servers, leveraging Kickstart and Jump Start Servers
  • Provided comprehensive support for various applications running on these platforms
  • Managed multiple applications across diverse environments including DEV, QA, UAT, PRE-PROD, and PROD for various releases
  • Developed instance strategies and meticulously crafted Release Calendars to ensure seamless deployment processes
  • Implemented Helm to effortlessly install Prometheus and Grafana, facilitating robust monitoring of application performance within the Kubernetes cluster
  • Conducted Infrastructure Monitoring adeptly utilizing industry-leading tools like Prometheus and Grafana, offering valuable insights into cluster performance and promptly addressing arising issues
  • Developed tailored dashboards in Amazon CloudWatch utilizing advanced scripting techniques, enabling holistic monitoring of EC2 performance metrics such as CPU utilization, memory usage, and disk usage
  • Leveraged Splunk for enhanced application performance monitoring, ensuring vital visibility and actionable insights
  • Established AWS CloudWatch alarms to monitor server performance metrics including CPU utilization and disk usage, ensuring proactive detection of potential issues
  • Orchestrated streaming of AWS CloudWatch logs to Splunk through AWS Lambda triggers, enabling real-time analysis and visualization of events
  • Set up and maintained the ELK (Elasticsearch, Logstash, Kibana) platform, employing regular expressions to parse unstructured logs into structured JSON format
  • Conducted log aggregation and analysis using Elasticsearch, Logstash, and Kibana (ELK stack) to identify patterns, troubleshoot issues, and optimize system performance
  • Managed performance dashboards in Kibana to track application metrics and performance efficiently
  • Automated NGINX/MySQL setup and implemented monitoring mechanisms for streamlined operations
  • Actively resolved issues related to Ansible environments, troubleshooting, and comparing run lists, ensuring seamless configuration management
  • Assisted System Administrators in troubleshooting network-related issues, including IP networking problems with firewalls, DNS, and load balancers
  • Collaborated with the team to identify and resolve issues, ensuring smooth functioning of network infrastructure
  • Implemented security controls following Zero Trust architecture, leveraging AWS IAM, VPC Network ACLs, and AWS Shield for DDoS protection
  • Automated security assessments using AWS Security Hub and GuardDuty, identifying vulnerabilities and misconfigurations in real-time
  • Leveraged Terraform for provisioning AWS cloud infrastructure, contributing to the creation of AWS Batch policies and maximizing the potential of AWS Batch features
  • Managed S3 buckets, implemented policies, and utilized S3 and Glacier for archival storage and backup on AWS
  • Contributed to the development of a Maven-based build environment and collaborated on the seamless integration of Kafka services
  • Employed Kubernetes for efficient container management on Amazon Web Services (AWS), leveraging Ingress for network management and load balancing
  • Demonstrated proficiency in versioning, branching, and managing Jenkins pipelines for continuous integration and deployment objectives
  • Utilized ServiceNow for Agile project management, implementing iterative and incremental development methodologies and ensuring seamless task visibility and collaboration within cross-functional teams
  • Provided 24x7 on-call and weekend support for production computing environments, ensuring uninterrupted operations
  • Installed and managed Red Hat Linux and Windows servers, enhancing application performance across environments
  • Developed monitoring dashboards in CloudWatch and Grafana, improving visibility of EC2 performance metrics

Sr. DevOps Engineer

United Health Group
10.2018 - 09.2020
  • Orchestrated the deployment of Kubernetes clusters on AWS using Amazon Elastic Kubernetes Service (EKS) and AWS Command Line Interface (CLI)
  • Leveraged Kubernetes and Docker for the CI/CD system's runtime environment, streamlining building, testing, and integration
  • Configured and maintained multiple systems on AWS using Ansible playbooks within an Atlassian continuous build and deploy environment, encompassing Jira, Confluence, Bitbucket (formerly Stash), Jenkins, and AWS CodeCommit
  • Automated building and deployment of Microservices using AWS CodeCommit and Jenkins
  • Developed automated tests using Selenium WebDriver in C# and integrated them with Jenkins for build automation, with NUnit as the testing framework
  • Worked with AWS Virtual Private Network (VPN), Virtual Private Cloud (VPC), AWS Network Security (Security Groups, Network ACLs), and Firewall configurations
  • Leveraged PowerShell scripting for automating deployments and provisioning of Amazon Elastic Compute Cloud (Amazon EC2) instances on AWS
  • Managed artifacts in binary repositories using JFrog Artifactory and configured AWS CodePipeline with the Jenkins Artifactory plugin to push new artifacts
  • Utilized Terraform templates to automate the deployment of Infrastructure as code (IAC) services using Terraform modules
  • Deployed virtual machine scale sets in production environments
  • Collaborated closely with Development and Testing teams, providing process design, management, and support for source code control, compilation, change management, and production release management on AWS
  • Developed an automation system utilizing AWS Lambda and PowerShell scripts with JSON templates for remediating AWS services
  • Developed Splunk queries and dashboards on AWS to analyze and optimize application performance and capacity, enabling data-driven decision-making
  • Utilized Jira as a defect tracking system, configuring workflows, customizations, and plugins for bug/issue tracking
  • Coordinated with managers and developers to gather requirements and resolve code conflicts during deployments to different AWS environments
  • Provided 24x7 on-call production support ensuring the availability and smooth operation of all environments
  • Worked on cache purging, whitelisting, and adding new digital properties to existing SSL certificates on AWS, ensuring secure and efficient data transmission

System/Linux Administrator

Amphoras
02.2016 - 08.2018
  • Company Overview: India
  • I administered the Subversion Version Control System (VCS), managing user access to repositories and ensuring smooth operations
  • I proposed and implemented industry best branching strategies, creating branches for parallel development in a fast-paced Agile environment
  • I integrated Subversion with Jira and implemented gated check-ins using pre-commit hooks and Jira commit plugins with post-commit hooks
  • I planned and executed the migration from Bugzilla and Hudson CI to the Atlassian suite, including JIRA, Confluence, and Bamboo
  • I installed and administered the Atlassian toolset (JIRA, Confluence, Fisheye, Crucible, and Bamboo), and upgraded FishEye from an HSQLDB to a MySQL database
  • I created Maven POM files to automate the build process, integrating them with third-party tools like Sonar and Nexus for enhanced automation
  • I installed and administered Nexus, creating roles and privileges to restrict access, and managed dependencies during the build process from an internal Nexus repository
  • I troubleshot Java build issues, integrated Ant scripts for code quality reporting, and installed Bamboo to support the Continuous Integration (CI) process for Java builds
  • I deployed static code to Apache web servers, deployed application WARs and EARs to WebLogic, and configured JNDI, Data Sources, and JDBC for backend Oracle database connectivity
  • I collaborated with project managers to design release plans, coordinated module deployments across test and production environments, and worked closely with developers to identify and address build failures
  • India

Linux Administrator

ESPS
10.2014 - 01.2016
  • Company Overview: India
  • Performed System administration activities using NFS, NIS, DHCP, FTP, Send mail, and Telnet for Linux
  • Installed and configured Apache web servers on various Linux and UNIX Servers
  • Administered packages using RPM/YUM on Red Hat Linux and maintaining patching on Solaris servers
  • Worked on recovering root password
  • Configuring and Implemented RAID levels (RAID0, RAID1, RAID5), Logical Volume Management (LVM), and Load Balancing
  • Managed systems backup, scheduling jobs, enabling Cron jobs, enabling system logging, and network logging of servers for maintenance
  • Installed and configured Oracle, Solaris, and Linux servers using JUMPSTART & KICKSTART installation & periodic path upgrading using live upgrade
  • Worked on LAN/WAN, firewalls, & routing for Internet & Intranet connectivity using different protocols like TCP/IP, DHCP, HTTP/s, FTP, SMTP & SSH
  • Configured Login management centrally using OPENLDAP
  • Managed Patches Configuration, Version Control, Service Pack, and reviewed connectivity issues regarding security problems and did network management TCP/IP, NIS, DNS, NFS, VLAN
  • Administered Linux servers for several functions including managing Apache/Tomcat Server, mail server, MYSQL databases in both development and production
  • Monitored health of infrastructures servers and mission-critical using Nagios
  • India

Education

Bachelor of Engineer - computer science

Sathyabama University

Skills

  • Operating systems: Windows, Linux, RedHat, CentOS, Ubuntu
  • Cloud Services: Amazon Web Services (AWS), Microsoft Azure
  • Version tool: Bit Bucket, GIT, GitHub
  • Build Tools: Ant, Maven, Gradle
  • Continuous Integration: Jenkins, Bamboo, Azure pipelines, TeamCity, GitHub Actions
  • Configuration management: Puppet, Chef, Ansible
  • Containerization/orchestration: Docker, Docker swarm, Kubernetes, EC2 Container Service, Azure Container Service, ECR, ECS, IAAS, PAAS
  • Automation Tools: Terraform, Cloud Formation Templates, Azure Resource Manager (ARM Templates)
  • Web/APP Servers: Apache Tomcat, Nginx, Web Sphere, WebLogic, JBoss
  • Databases: MongoDB, Cassandra DB, DynamoDB, Aurora, Cosmos DB, SQL Server, MySQL
  • Virtualization: Oracle Virtual box, VMware ESX/ ESXi, Windows Hyper-V
  • Programming & Scripting Languages: Python, Perl, HTML, JavaScript, Angular JS, PowerShell, Bash, Shell, Groovy, JSON, XML, YAML, Helm, c, Net
  • Artifactories: Nexus, JFrog, Archiva, Artifactory
  • Monitoring Tool: Grafana, Datadog, Cloud Watch, Splunk, Dynatrace, New Relic, Prometheus, Nagios, Splunk, ELK Stack
  • Networking:TCP/IP, HTTP/HTTPS, DNS, NFS, LAN, FTP, SSH, UDP, SMTP, SFTP, Route53
  • Ticketing Tool: Jira, Service Now, IBM ClearQuest, Rally, Bugzilla, Redmine

Certification

  • AWS SysOps Administrator
  • AWS Certified Cloud Practitioner

Timeline

SRE/Sr. Cloud DevOps Engineer

Humana
06.2022 - Current

DevOps Engineer

American Airlines
10.2020 - 05.2022

Sr. DevOps Engineer

United Health Group
10.2018 - 09.2020

System/Linux Administrator

Amphoras
02.2016 - 08.2018

Linux Administrator

ESPS
10.2014 - 01.2016

Bachelor of Engineer - computer science

Sathyabama University
MADHU B