Summary
Overview
Work History
Education
Skills
Certification
Timeline
Generic

Dhiraj Kumar

Pittsburgh,PA

Summary

Certified AWS and Kubernetes professional with around 10+ years of experience in the IT industry in various roles as Lead DevOps, Cloud Engineer, and Build & Release Engineer in Linux Administration, Configuration Management, Continuous Integration, Continuous Deployment, and Cloud Solution Architecture Implementation. Utilized all AWS services such as VPC, VPC Peering, VPN, IAM, EC2, ELB, Auto Scaling, Code Build, Code Deploy, Code Pipeline, CloudWatch, CloudTrail, SNS, SQS, Lambda, Route53, EBS, Load Balancer, etc. Implement and maintain security protocols across AWS services, such as IAM roles, security groups, and VPC settings, ensuring that the organization's data is secure from threats. Developed and maintained automated deployment pipelines using AWS Code Pipeline and Code Build. Creating new build jobs, integration testing jobs, and deploying jobs in Jenkins to automate the process. Using AWS container services like ECS, ECR, and EKS, maintaining an end-to-end large number of containerization clusters. Designing, developing, and maintaining Kubernetes clusters and applications. Architecting Docker Management Platform, Leveraged Custom Docker Images as Containerized Apps within the Docker Engine as Multi Stack Application. Knowledge of creating Terraform templates to handle cross-cloud dependencies and managing the orchestration to build large-scale multi-cloud infrastructures. Expertise in various version control tools like GitHub, Gitlab, SVN, and Bitbucket. Experience pushing, pulling, and checking out repositories, merging branches, and designing multi-branch strategies. Experience designing solutions that are high quality, high performance, high availability, and highly scalable. Experience in Microsoft Azure platform as Azure Administrator and DevOps Engineer. Experience with Various services in Azure like Compute, Storage Account, Virtual networks, Load balancer, Active Directory, VM scale set, availability zone, availability set, etc. Managed AWS infrastructure as code (laC) using Terraform. Provisioned high availability and high scalability EC2 instances and EKS using Terraform and CloudFormation. Experience with Monitoring tools like Elasticsearch, Logstash, and Kibana (ELK) for log analytics, full-text search, and application monitoring in integration with AWS Lambda. Hands-on experience using CloudFormation templates for reliable provisioning of initial infrastructures (stacks) such as IAM roles & policies, S3 buckets, and bucket backups on AWS. Experience in system operations, auto-provisioning, code deployments, software installation, and configuration updates using Ansible playbooks. Experience working on Docker Hub, creating Docker images, and handling multiple images primarily for middleware installations and domain configuration. Extensively use Build tools like ANT, Maven, Gradle & MS Build to build the CICD pipelines. Proficient in Power Shell, JSON, YAML, Groovy, Bash/Shell, and Python. Certification: AWS Solution Architect Certified Kubernetes Administrator (CKA)

Overview

10
10
years of professional experience
1
1
Certification

Work History

Lead DevOps Engineer

Cisco
Remote
10.2022 - Current
  • My primary responsibility is design and building AWS infrastructure using resources such as VPC, EC2, S3, Subnet, Route Table, IAM, Security Group, Load Balancer (ELB), RDS, DynamoDB, EBS, Autoscaling (ASG), Route53, SNS, SQS, CloudWatch, and Cloud Trail
  • Provisioned high availability and highly scalable CP/DP cluster on AWS cloud using Infrastructure as code (IaC) Terraform
  • Creating snapshots and Amazon machine images (AMIs) of the instances for backup and creating clone instances
  • Assisted developers with cloud resource provisioning, configuration, and troubleshooting, ensuring a smooth deployment process
  • Docker file to build Docker images from the Java jar files
  • Integrated Kubernetes with continuous integration tool Jenkins to manage and automate the deployments
  • Expertise in setting up Upstream & Downstream Jobs, Jenkins server, and build jobs to provide continuous automated builds based on polling the GIT SCM during the day and periodically scheduling the builds overnight to support development
  • Using AWS container services like ECS, ECR, and EKS, maintaining an end-to-end large number of containerization clusters
  • Configuring different Artifacts (ECR) to make an image and deploying a Docker image to install the application on a Kubernetes (k8s) cluster
  • Maintained and troubleshot container and Kubernetes cluster problems
  • Great working experience in optimizing build process time and Storage management of Jfrog Artifactory
  • Making sure of a high availability and high scalability environment to reduce or eliminate planned and unplanned downtime for critical applications by incorporating redundant components and other technologies to address single points of failure in a system, database, or application
  • Implemented zero downtime PROD deployments on the docker service for rolling updates using a blue-green deployment strategy
  • Created custom Splunk dashboards, visualizations, and key performance metrics
  • Implemented complex search queries and alerting
  • Experience in implementing alert rules and managing Prometheus & Grafana for monitoring systems and application metrics
  • Contributed to the creation and maintenance of comprehensive documentation, including runbooks, playbooks, and knowledge base articles on confluence, to enable efficient issue resolution and knowledge sharing across teams
  • Proficient in cluster setup and management of Container orchestration service using AWS Elastic Container Service (EKS)
  • Experience in deploying microservices apps on Kubernetes and troubleshooting and optimizing Kubernetes(k8s) clusters for performance
  • Designed and implemented container orchestration using Kubernetes cluster for multiple projects, increasing efficiency by 30%
  • Led root cause analysis (RCA) investigations and implemented preventive measures to minimize the recurrence of incidents, promoting a culture of continuous improvement and learning
  • Expertise in writing Python scripts for lambda functions using Boto3
  • Proficient in writing Python and Shell scripts to automate the build process
  • Implemented and maintained the monitoring and alerting of production and corporate servers/storage using AWS CloudWatch and Splunk
  • Deployed pods using Replication Controllers by interacting with the Kubernetes API server, defining through declarative YAML files
  • Demonstrating expertise in monitoring, logging, and disaster recovery planning will further showcase the ability to ensure high availability, high scalability, and business continuity
  • Implemented Docker-Maven-plugin in Maven pom to build Docker images for all microservices and later used
  • Installed and implemented Ansible configuration management system
  • Used Ansible to manage web applications, environment configuration files, users, mount points, and packages
  • Created playbooks in YAML to automate the development processes
  • Using Jira we track the bugs & User Stories
  • Implemented Infrastructure automation through Ansible for auto-provisioning, code deployments, software installation, and configuration updates
  • Actively participated in service management processes, such as change management, incident management, and problem management, adhering to ITIL frameworks and industry standards
  • Skills: Git, GitLab, Jenkins, ServiceNow, GitHub, Ansible, Docker, Kubernetes, YAML, Groovy, Azure DevOps, EKS, ECR, ECS, AWS CLI, VPC, EC2, S3, RDS, DynamoDB, IAM, EBS, Route53, SNS, SES, SQS, CloudWatch, Cloud Trail, Security Groups, IAM, AWS infrastructure as code (laC), Python, shell scripting, Maven, Apache Airflow, Microservices, Splunk, Network Security.

DevOps Technical Lead

Toyota
Dallas, TX
11.2017 - 09.2022
  • Responsible for implementing DevOps automation for build and deployments over 8+ teams
  • Developed and managed build and release processes and configurations to support daily software development, testing, production builds, and deployment operations
  • Worked on GitHub Actions as a CI/CD (Continuous Integration and Continuous Deployment) to automate Code Integration and code build and applications code deployment to development, staging, and production environments using Jenkins
  • Written several Docker files to create Docker images and upload images to AWS ECR (Docker container registry), Wrote Kubernetes configuration scripts to deploy and expose the application using Helm chart
  • Implemented AWS solutions using EC2, S3, RDS, EBS, Elastic Load Balancer, and Auto scaling groups along with setting up private networks and sub-networks using the virtual private cloud (VPC) and creating security groups to associate with the networks
  • Identifying Security vulnerabilities, Patching, and Server upgrades
  • Closely working with multiple technical teams, architects, security teams, managers, and business users to review and document workloads, and business requirements
  • Advise business and technology delivery leadership on how to translate the client's requirements into executable technology solutions
  • Responsible for source code management using SVN and GIT, GitHub, and GitOps tools to handle Java, .NET, and C++
  • Participated in ongoing design, implementation, and maintenance of systems and tools to implement the CICD Pipeline
  • Configured PowerShell scripts to automate the Azure cloud system creation including end-to-end infrastructure, VMs, storage, and firewall rules
  • Worked with PowerShell to automate SQL tasks, environment management-related tasks, and procedures on Windows machines
  • Experience with software development tools like Postman, Azure DevOps, GitHub, and Visual Studio Code
  • Experience in deploying Azure Web Services on both Linux and Windows Containers
  • Experience creating Azure DevOps Pipelines in ADO
  • Developed & maintained, and scaled infrastructure for Dev, System test, and Production environments, also developed Shell and Python scripts for automation
  • Provisioning Cloud platforms such as Virtual Networks Multi-site, Cross-site, VMs, Azure Active Directory, Load Balancers, Azure SQL, Cloud Services, etc., and Created Azure Backup vault and protecting required VMs to take the VM backups & Disk Backup in Azure
  • Helm charts creation for managing Kubernetes environment and created reproducible builds of the Kubernetes applications, managed Kubernetes manifest files and managed releases of Helm packages and used Kubernetes and Docker for the runtime environment for the CICD system to build, test, and deploy
  • Deployed and managed services such as virtual machines, Networking, App services, storage, identity, databases on Microsoft Azure and AWS Operational experience with network security appliances with a clear understanding of the architecture behind secure networks
  • Extensively worked in Deploying Microservices on AKS Cluster and Building CI/CD Pipelines
  • Worked with Docker container snapshots, attaching to a running container, removing images, managing directory structures, managing containers, and dealing with Docker Registry
  • Creation and management of fully automated CI/CD pipelines on Azure DevOps using YAML and classic pipelines
  • Continuous deployment and setup of services on Azure such as Containerization service AKS, Azure site recovery and backups for disaster recovery (DR)
  • In charge of creating change requests in ServiceNow/REMEDY/Smart IT tools for production deployments
  • Act as a subject matter expert on development and operations tools along with tracking and managing the Bi-weekly / monthly/quarterly releases
  • Build out an SDLC for the specification to support fast code development and the ability to update the specification file without causing downtime or impacting customers
  • Proficiency in GitOps and the software development life cycle in cloud-native and Kubernetes environments
  • Excellent problem-solving and teamwork skills Great experience with different application and server monitoring tools such as Dynatrace, Grafana, AWS ELK, and bug tracking tools JIRA and Rally.

Cloud Engineer

Capgemini/First Data Corporation
India
05.2016 - 10.2017
  • Design infrastructure as code (laC) Terraform templates and modules to create custom VPC, subnets, Internet Gateway, Route Tables, NAT Gateway, NACLs, Security Groups, EC2 instances and Elastic Load Balancer as per design document
  • Create a high availability cluster using a Load balancer (ELB) and high scalability using Auto-Scaling (ASG)
  • Worked on multiple AWS services such as VPC, VPC Peering, VPN, IAM, EC2, ELB, Auto Scaling, Code Build, Code Deploy, Code Pipeline, CloudWatch, CloudTrail, SNS, SQS, Lambda, Route53, EBS, etc
  • Experience in AWS, which included managing applications in the cloud and creating EC2 instances to integrate with various AWS services
  • Amazon IAM service to enable permission and resources to users
  • Managed role and permission of user with the help of AWS IAM
  • Experienced in Network Security and architecting cloud solutions for various cloud providers
  • Highly proficient in cloud orchestration, Security, Identity & access management, application delivery, data protection, and Network Security
  • Included Security group, Network ACLs, Internet gateway, and elastic IP to ensure a safe area for the organization in the AWS public cloud
  • AWS cloud formation template to create customized size VPC, Subnet, and NAT gateway and ensuring successful deployment of the web application
  • Terraform template to build S3 buckets and their properties (versioning, encryption, object lock), bucket policies as well as lifecycle rules to upload files to S3 Glacier
  • Responsible for monitoring the AWS resources using Cloud Watch
  • Setup/Managing Linux Servers on Amazon (EC2, ELB, Security Groups, Subnet, and IAM)
  • Configuring and Networking of Virtual Private Cloud (VPC)
  • Utilized AWS Elastic Load Balancer (ELB) with EC2 Autoscaling groups to distribute traffic evenly, enhancing application responsiveness and minimizing downtime
  • Linux Operating system Expertise in setting up Upstream & Downstream Jenkins jobs using Groovy scripting
  • Created and managed CICD pipelines, webhooks & workflows using Jenkins, Maven, and Git to improve the efficiency and release bug-free / highly effective applications to the end users and did Code coverage analysis using Coverity
  • Good leadership and teamwork skills - Works collaboratively in an agile environment with DevOps application
  • Create a set of recommendations and Documents for how development teams should consume the specification file to reduce the risk of outages
  • Perform, design, implement, and upgrade information systems to meet the business and user needs
  • Involved in writing Docker files, creating Images, and pushing them to Elastic Container Registry (ECR)
  • Worked on Containerization systems like Docker and orchestrated container applications on Kubernetes, EC2 Container Service (ECS) Kubernetes cluster management was taken care of along with maintaining the PODs
  • Writing Ansible playbooks for installing WebLogic/tomcat applications and deployment of WAR, JAR, and EAR files across all environments
  • Excellent Communication skills.

Build and Release/ Systems Engineer

Syntel/Amex
India
03.2014 - 03.2016
  • Implemented continuous integration & continuous delivery pipelines using Jenkins to automate code build, code test, and code deployment processes for applications, reducing manual effort
  • Administered and Engineered Jenkins for managing weekly Build, Test, and Deploy chain
  • Automated Weekly releases with ANT/Maven scripting for Compiling Java Code, Debugging, and Placing Builds into the Maven Repository
  • Automated Tomcat deployments using Jenkins
  • Administrated JIRA, GIT, Bitbucket, Jenkins, Jfrog Artifactory, SonarQube, Ansible Tower and XL Release
  • Integrated Bitbucket for version control, enabling efficient collaboration and code review among development teams
  • Used Jenkins as a CI tool and installed Maven plugins and Ansible as a configuration management and deployment tool
  • Maintained GitHub repositories for code automation and configuration in continuous integration & continuous deployment Process to build and deploy the applications by utilizing tools such as Maven with plugins in the Jenkins server
  • Wrote Ansible Playbooks to automate the installation of Middleware Infrastructure like Apache Tomcat, JDK, and configuration tasks for new environments
  • Responsible for implementing AWS solutions and setting up the cloud infrastructure with different services, like EC2, S3, VPC, ELB, AMI, EBS, RDS, Auto Scaling, Route53, Subnets, NACL, CloudFront, Cloud Formation, Cloud Watch, Cloud Trail, SQS and SNS
  • Set up Ansible Tower, which provides an easy-to-use dashboard and role-based access control, so that it's easier to allow individual teams access to use Ansible for their deployments
  • Installing, Configuring, and managing in Ansible Centralized Server and creating the playbooks to support various middleware application servers, and involved in configuring the Ansible tower as a configuration management tool to automate repetitive tasks
  • Docker installation was taken care of in various Linux servers to support the Docker Swarm
  • Write shell/python scripts which include build path, build directories, and process to copy deployment packages and run them in deployment directories
  • Developed and configured Bash and Power shell scripts as required for automation
  • Well-used and experienced in deploying code through web application servers like Web Sphere, WebLogic, and Apache Tomcat
  • Migrated applications from SVN, and TFS to GIT; troubleshoot and resolved code merging issues by coordinating with application developers
  • Worked on JIRA for project tracking and updating defects and related tasks which smoothly handled the project
  • Experience working within an Agile methodology team environment
  • Strong understanding of the SDLC and experience in how developers develop, release, and operate systems
  • Possess high working qualities with good interpersonal skills, high motivation, fast learning, and problem-solving skills to provide the best solutions
  • Responsible for all aspects of the software configuration management process including code compilation, packaging, deployment, release methodology, and application configurations
  • Design and implement a platform specification file that conforms to developer and platform infrastructure requirements
  • Collaboration with various teams to resolve issues which include Business, Development, QA, and Support teams.

Education

Bachelor in engineering - Computer Science and Engineering

RGTU, Bhopal (India)

Skills

  • Cloud Services: Amazon Web Services, Microsoft Azure, Google Cloud Platform
  • Configuration Management Tools: Puppet, Ansible, Ansible Tower
  • CI/CD & Build Tools: Jenkins, GitLab, Maven, Ant
  • Container Tools: Kubernetes (K8s), Docker, Docker Swarm, Azure Kubernetes Service (AKS), Google Kubernetes Engine (GKE), Elastic Kubernetes Service (EKS), Amazon Elastic Cloud Service, Azure Container Service
  • Virtualization: VMWare, vSphere
  • Automation Tools: ANT, Terraform, Maven, Jenkins
  • Web Servers: Apache HTTP, Nginx, Apache Tomcat
  • Database System: MySQL, AWS RDS, MongoDB, DynamoDB
  • Scripting languages: Python, Bash shell, Power shell, YAML
  • Networking/Protocol: TCP/IP, DNS, NFS, ICMP, SMTP, DHCP, UDP, NIS, LAN, FTP
  • Version Control Tools: GitHub, Bitbucket, SVN, Gitlab
  • Monitoring Tools: Dynatrace, Splunk, ELK, PagerDuty, Prometheus, CloudWatch, Cloud Trail, Grafana

Certification

  • AWS Solution Architect
  • Certified Kubernetes Administrator (CKA)

Timeline

Lead DevOps Engineer

Cisco
10.2022 - Current

DevOps Technical Lead

Toyota
11.2017 - 09.2022

Cloud Engineer

Capgemini/First Data Corporation
05.2016 - 10.2017

Build and Release/ Systems Engineer

Syntel/Amex
03.2014 - 03.2016

Bachelor in engineering - Computer Science and Engineering

RGTU, Bhopal (India)
Dhiraj Kumar