Summary
Overview
Work History
Education
Skills
Websites
Certification
Timeline
Generic

Abhishek Singh

Irving,TX

Summary

Seasoned software professional with over 15 years of experience in development, operation support, design, and team leadership. Successfully managed software development projects and built high-performing teams. Skilled in infrastructure management, deployment, and releases using Azure, Kubernetes, Docker, and Jenkins. Expertise in implementing Continuous Integration and Continuous Deployment pipelines with Jenkins, Ansible, Terraform, and Docker containers for seamless automation. Proficient in leading collaboration efforts with software engineering teams to deploy and operate cloud infrastructure, while developing Infrastructure as Code on Azure cloud for efficient environment creation and resource management.

Overview

17
17
years of professional experience
1
1
Certification

Work History

IT - Technical Lead Cloud Platform | Azure Development & Solution Architecting

NT
08.2024 - Current
  • Azure Migration on prem to Azure cloud
  • Created and configured Azure VMs(Azure Virtual Machines) to meet application requirements
  • Working on Azure environment setup with different teams.
  • Supporting team to troubleshoot deployment issues like Azure, docker, and Kubernetes-related issues.
  • Building Kubernetes clusters for Kubernetes clients, doing regular updates for those clients also doing OS patches for those clusters.
  • Namespace creation for onboarding in Kubernetes
  • Managing Roles and permission for clients
  • Participate in User Requirement workshops.
  • Utilized IAC Terraform for automated infrastructure deployments (AWS ELB, AWS EC2, AWS Security Groups, AWS VPC, AWS Key Pairs, AWS IAM). Manage and replicate infrastructure and instance configurations using Terraform.
  • Using Python for automating cloud infrastructure provisioning and management.
  • Continuous Integration/Continuous Deployment (CI/CD): Integrate Ansible with CI/CD pipelines to automate the testing and deployment processes.
  • CI/CD Pipeline Management also Pipeline Implementation: Build and maintain CI/CD pipelines to automate code integration, testing, and deployment.
  • Python Developer:
  • Developed and maintained scalable backend applications using Python,integrating RESTful APIs and database models.
  • Automated infrastructure tasks and deployment processes using Python scripts, improving release efficiency
  • Implemented unit and integration tests using pytest and CI/CD integration to ensure code quality and rapid delivery.
  • Worked closely with DevOps teams to integrate Python-based microservices into containerized environments using Docker and Kubernetes.
  • Terraform Developer:
  • Designed and implemented Infrastructure as Code (IaC) using Terraform to automate provisioning of cloud resources across AWS/Azure.
  • Created and managed reusable Terraform modules for scalable infrastructure components like VPCs, EC2 instances, AKS/EKS clusters.
  • Integrated Terraform with CI/CD pipelines (GitHub Actions, Azure DevOps, Jenkins) for automated deployments and environment management.
  • Managed Terraform state files using remote backends like Azure Blob Storage or AWS S3 with state locking via DynamoDB/blob lease.
  • Enforced compliance and security standards by implementing Terraform validations, input variables, and policy-as-code with Sentinel or OPA.

Sr. DevOps Engineer

Verizon
11.2023 - 05.2024
  • Design the architecture for the rehydration process.
  • In my current and previous roles, I have demonstrated exceptional managerial and technical skills. My experience includes:
  • Leading over 10 teams, mentoring them through the submission of proofs of concept (POCs) to various clients, resulting in the acquisition of new projects.
  • Performing Scrum roles such as task and story assignments, ensuring smooth project execution.
  • Regularly communicating with clients to provide project updates and managing outages effectively.
  • Ensure the restored environment aligns with best practices and performance requirements.
  • Plan for scalability, security, and compliance.
  • Document the architecture and the rehydration process.
  • Develop and execute test plans to validate the restored environment.
  • Identify and document any issues or discrepancies.
  • Work with developers and engineers to resolve any problems found during testing.
  • Provide ongoing support during and after the rehydration process.
  • Monitor the restored environment for issues.
  • Respond to user reports and troubleshoot any problems.
  • Maintain documentation and knowledge bases for future rehydration processes
  • Implement the rehydration process using AWS services (e.g., S3, lambda function, step function EC2).
  • Automate the rehydration process using scripts and AWS tools (e.g., AWS CLI, CloudFormation, IAC Terraform).
  • Monitor the progress of data rehydration and resolve any technical issues.
  • As a Python DevOps, my roles and responsibilities were, writing Script for AWS EC2 Instance Management.
  • Used Python for automating cloud infrastructure provisioning and management.
  • Jobs and Duties in OpenShift Kubernetes
  • Platform Administration:
  • Cluster Management: Installing, configuring, and managing OpenShift clusters, including handling cluster upgrades and scaling.

Sr. DevOps Engineer

AT&T
01.2022 - 11.2023
  • Gather new requirements for different modules, drawing out a plan for releases of the enhancements for the application requested by the Business and application teams for Azure migration of an application from on-premise and planning work for subsequent releases.
  • Migrating multiple legacy applications to Azure Cloud using IAC Terraform and Kubernetes by following organization DevOps practices.
  • Implementation of Continuous Improvement of monitoring/alerting tools and process
  • Planning of Business Continuity and Disaster Recovery plans including providing High availability for all applications
  • Creating, validating, and reviewing solutions and effort estimates of converting existing workloads from classic to ARM-based Azure Cloud Environment.
  • To analyze existing applications on-premises, determine, and plan with proper recommendations for migration and modernization of applications to Azure.
  • Develop Infrastructure as a Code to create environments and resources on Azure cloud and run automation scripts to reproduce multiple environments.
  • Jobs and Duties in OpenShift Kubernetes
  • Platform Administration:
  • Cluster Management: Installing, configuring, and managing OpenShift clusters, including handling cluster upgrades and scaling.
  • Monitoring and Logging: Implementing monitoring solutions (e.g., Prometheus, Grafana) and centralized logging to maintain system health and performance.
  • DevOps Integration:
  • CI/CD Pipelines: Integrating OpenShift with Continuous Integration/Continuous Deployment (CI/CD) tools (e.g., Jenkins, Tekton) to automate the build, test, and deployment processes.
  • Lead automating various aspects of the build and deployment process starting from ground zero for streamlining operations and processes.
  • Implemented and configured Kibana for log aggregation, monitoring, and visualization of system and application logs.
  • Using Jenkins and Pipelines to drive all microservices builds out to the Docker registry and then deployed to Kubernetes, created Pods, and managed using Kubernetes.
  • Automating, Configuring, and Deploying Instances on Azure environments and in Data centers and migrating on-premise to Windows Azure using Azure Site Recovery and Azure backups.
  • Responsible for implementing containerized applications on Azure Kubernetes by using Azure Kubernetes Service (AKS), Kubernetes Cluster, which are responsible for cluster management, Virtual Network to deploy agent nodes, Ingress API Gateway, MySQL Databases, and Cosmo DB for stateless storage of external data, and set up reverse proxy Nginx in the cluster.
  • Defined and monitored SLOs and SLAs to ensure services meet expected performance and reliability standards.
  • Created several IAC Terraform scripts to build the infrastructure in a cloud environment and configured it using Chef Cookbooks.
  • Using Azure Terraform to deploy the infrastructure necessary to create development, test, and production environments for a software development project.
  • Analyzed log data and metric trends using Kibana and Grafana to identify anomalies, performance bottlenecks, and potential security threats.
  • Utilized advanced querying and filtering capabilities of Kibana and Grafana to extract actionable insights from large datasets.
  • Implemented unit and integration tests using pytest and CI/CD integration to ensure code quality and rapid delivery.
  • Worked closely with DevOps teams to integrate Python-based microservices into containerized environments using Docker and Kubernetes
  • Preparing and analyzing the weekly performance report of the product for different global markets and sharing the same with the business management team.
  • Regular interaction with the offshore team manager/leader as well as Microsoft Technical Managers, System Administration experts, and Quality Assurance using the proprietary tools of HCL.
  • Interacting with onsite and offshore teams in resolving their problems.
  • Led the implementation and configuration of ELK (Elasticsearch, Logstash, Kibana) stack for log aggregation, analysis, and visualization.
  • Resolving deployment issues and coordination with Operations for deploying services in production.
  • Preparing and sending out the weekly status report to the client and HCL managers.
  • Being DevOps Engineer with IAM jobs were, Integrate IAM solutions with DevOps tools and workflows, Manage secrets, keys, and certificates used in the DevOps pipeline, Implement and manage multi-factor authentication (MFA) for DevOps processes.
  • Being Site Reliability Engineering jobs and duties were Monitoring(Implement monitoring solutions to ensure the availability and reliability of services. Use tools like Prometheus, Grafana).
  • Incident Response:Develop and manage incident response processes, including alerting, incident handling, and post-mortem analysis.
  • CI/CD Pipeline Management also Pipeline Implementation: Build and maintain CI/CD pipelines to automate code integration, testing, and deployment.
  • Release Management: Manage the release process, ensuring smooth and reliable deployment of new features and updates.
  • Experienced in Developing scripts in Python to support continuous integration and continuous deployment processes.
  • Written, Python scripts to automate repetitive tasks, such as EC2 rehydration.
  • Implemented and managed infrastructure using tools like Terraform or AWS CloudFormation, scripting logic in Python.
  • Ansible roles and responsibilities
  • 1:Infrastructure Automation
  • Provisioning: Automate the provisioning of servers and infrastructure components.
  • Configuration Management: Use Ansible to ensure that systems are configured according to the desired state.
  • Resource Management: Manage and optimize the allocation of resources.
  • 2. Application Deployment
  • Automated Deployment: Create playbooks and roles to automate the deployment of applications.
  • Continuous Integration/Continuous Deployment (CI/CD): Integrate Ansible with CI/CD pipelines to automate the testing and deployment processes.
  • Rollback Strategies: Develop and implement rollback procedures in case of deployment failures.

Sr. DevOps Engineer

AT&T
08.2013 - 01.2022
  • Working on Azure environment setup with different teams.
  • Supporting team to troubleshoot deployment issues like Azure, docker, and Kubernetes-related issues.
  • Building Kubernetes clusters for Kubernetes clients, doing regular updates for those clients also doing OS patches for those clusters.
  • Namespace creation for onboarding in Kubernetes
  • Managing Roles and permission for clients
  • Participate in User Requirement workshops.
  • Developed and maintained scalable backend applications using Python,integrating RESTful APIs and database models.
  • Automated infrastructure tasks and deployment processes using Python scripts, improving release efficiency
  • Utilized IAC Terraform for automated infrastructure deployments (AWS ELB, AWS EC2, AWS Security Groups, AWS VPC, AWS Key Pairs, AWS IAM). Manage and replicate infrastructure and instance configurations using Terraform.
  • Implementing monitoring and logging solutions using ELK Stack, enabling proactive identification and resolution of system issues.
  • Jobs and Duties in OpenShift Kubernetes
  • Platform Administration:
  • Cluster Management: Installing, configuring, and managing OpenShift clusters, including handling cluster upgrades and scaling.
  • Monitoring and Logging: Implementing monitoring solutions (e.g., Prometheus, Grafana) and centralized logging to maintain system health and performance.
  • DevOps Integration:
  • CI/CD Pipelines: Integrating OpenShift with Continuous Integration/Continuous Deployment (CI/CD) tools (e.g., Jenkins, Tekton) to automate the build, test, and deployment processes.
  • Experience in deploying the Kubernetes environment on AWS and managing the rolling deployments of the patched docker images onto the cluster by tweaking the Pod definition.
  • Created various Terraform scripts to deploy Kubernetes, Dev, QA, and Pre-prod environments in AWS Cloud. Manage and replicate infrastructure and instance configurations using Terraform.
  • Elicit and understanding of Business and Technical Requirements.
  • Analyse and document Business and Technical Requirements.
  • Configured alerting rules and thresholds in Kibana and Grafana to proactively monitor system health and notify stakeholders of critical events.
  • Managed centralized log management using ELK stack, enabling real-time log monitoring, search, and analysis for troubleshooting and performance optimization.
  • Created Kibana dashboards and visualizations to provide insights into system and application logs, identifying trends, anomalies, and security incidents.
  • Automated the creation and maintenance of Kibana and Grafana dashboards using templating and scripting techniques to streamline monitoring setup across environments.
  • Creating and maintaining Infrastructure for Applications in AZURE using Terraform. Building and Automating CICD pipelines for different applications using Jenkins & Azure Pipelines.
  • Worked on a variety of services involving public cloud AWS to provision and configure such as EC2, EKS, ECR, VPC, CloudWatch, CloudTrail, Route53, Lambda, etc., and automated the services using Terraform.
  • Middleware application failure analysis and troubleshooting correlates implemented changes and service exceptions and take necessary action to resolve an issue.
  • Experience in working with Docker service rolling updates and involved in the implementation of BLUE GREEN DEPLOYMENT to attain zero downtime.
  • Optimized Kibana and Grafana configurations to improve query performance, reduce resource consumption, and enhance overall system efficiency.
  • Implemented Jenkins Pipeline as a Code for various Java repositories and implemented Sonar scanner using Docker file for code quality check and integrated them into Jenkins to achieve Continuous Integration.
  • Deployed application which is containerized using Docker onto a Kubernetes cluster which is managed by Amazon Elastic Container Service for Kubernetes (EKS).
  • Automating deployment, monitoring, and management processes using PowerShell DSC and Azure Automation
  • Define and manage roles and permissions for secure access to Azure resources.
  • Used(Role Based Access Control)RBAC to Manage and store unstructured data, such as text or binary data
  • Used and developed ARM templates to define and deploy Azure resources consistently.
  • Resource Management: Manage and organize resources using ARM, including deployments, updates, and deletions.
  • Created and configured Azure VMs(Azure Virtual Machines) to meet application requirements
  • Managed and stored unstructured data, such as text or binary data
  • Used Azure Active Directory to Implement and manage access controls using AAD, including single sign-on (SSO) and multi-factor authentication (MFA).
  • Interaction with a client right from the requirements stage to delivery of the application.
  • Understanding of the change requests that are raised by the client after the development starts.
  • Documentation of the change requests that are raised by the client from the point of initial alert or notification through resolution or escalation. Assesses problem impacts on service and notifies appropriate groups per Guidelines.
  • Be an interface between the offshore delivery team and the customer (AT&T).
  • Perform problem analysis/resolution of application processing exceptions from the point of initial alert or notification through resolution or escalation. Assesses problem impacts on service and notifies appropriate groups per Guidelines.
  • Documentation of alert resolution and middleware application issues to reduce the MTTR (Mean time to resolve)
  • Manage BMC-related components to provide accurate and timely event notification of application and system-monitored components.
  • Being Site Reliability Engineering jobs and duties were Monitoring(Implement monitoring solutions to ensure the availability and reliability of services. Use tools like Prometheus, Grafana).
  • Incident Response:Develop and manage incident response processes, including alerting, incident handling, and post-mortem analysis.
  • CI/CD Pipeline Management also Pipeline Implementation: Build and maintain CI/CD pipelines to automate code integration, testing, and deployment.
  • Release Management: Manage the release process, ensuring smooth and reliable deployment of new features and updates.
  • Ansible roles and responsibilities
  • 1:Infrastructure Automation
  • Provisioning: Automate the provisioning of servers and infrastructure components.
  • Configuration Management: Use Ansible to ensure that systems are configured according to the desired state.
  • Resource Management: Manage and optimize the allocation of resources.
  • 2. Application Deployment
  • Automated Deployment: Create playbooks and roles to automate the deployment of applications.
  • Continuous Integration/Continuous Deployment (CI/CD): Integrate Ansible with CI/CD pipelines to automate the testing and deployment processes.
  • Rollback Strategies: Develop and implement rollback procedures in case of deployment failures.
  • Environment & Tools: Azure, Terraform, Kubernetes, Grafana, Chef, Ansible, Git, Dynatrace, Kibana,Linux/Unix (Red Hat) and Windows.

Associate Operation

EXL (Client: Allstate)
10.2012 - 08.2013
  • Ensured that the services were delivered to the client within the specified time frame.
  • Supporting SQL/JAVA/UNIX applications on 24 x 7 basis.
  • The core role is to keep the applications up and available 24/7.
  • Developed packages, stored procedures, and functions using PL/SQL
  • Raising outage calls in case of degradation of service
  • Conducted performance tuning and optimization of infrastructure components and application services to improve scalability, reliability, and cost efficiency.
  • Carrying out the Deployments & Sanity testing of the applications on every new release.
  • Monitor performance and troubleshoot alerts.
  • Engaged with vendors and evaluated third-party solutions, used GRM tools, considering factors such as scalability, cost-effectiveness, and alignment with organizational requirements.
  • User account management and administration.
  • Administered server monitoring, batch jobs, and interfaces by firing SQL queries in TOAD.
  • Environment & Tools: AWS, Azure, Terraform, Kubernetes, Grafana, Prometheus, Docker, Chef, Jenkins, GIT, Splunk, Shell Script, Linux/Unix (Red Hat, Centos).

Sr Analyst Linux Administration

Igate (Client: Rockwell )
08.2008 - 11.2011
  • Administered server monitoring, batch jobs, and interfaces by firing SQL queries in TOAD.
  • Supporting SQL/JAVA/UNIX applications on 24 x 7 basis.
  • The core role is to keep the applications up and available 24/5.
  • Raising outage calls in case of degradation of service.
  • Administered data flows from IFS to other legacy applications and vice versa.
  • Carrying out the Deployments & Sanity testing of the applications on every new release.
  • Monitor performance and troubleshoot alerts.
  • Created RFC and OCR as per request and modification in process.
  • Ensure key deliverables meet the timelines (SLA Management).

Education

BSc -

Purvanchal University
01.2004

MCA - undefined

GIMT, Galgotia, Uttar pradesh technical university
01.2008

Skills

  • Version Control Tools, CI Tools & Build Tools Git, SVN, GitHub, Atlassian Bitbucket, GitLab, Jenkins, Atlassian Bamboo, Apache ANT, Maven
  • Languages Shell, Python, PowerShell
  • Databases Oracle, MySQL, MariaDB, and MSSQL
  • Artefact Repository Management Tools Nexus, JFrog
  • Configuration Management Tools Chef, Ansible, Ansible Tower
  • Containerization/Orchestration Docker, Kubernetes, ECS, ECR, EKS, AKS, ACR
  • Cloud Providers AWS, Azure
  • Bug Tracking Tools JIRA, Kanban Board
  • Monitoring Tools Dynatrace,NAGIOS, Splunk, Zabbix, Cloud Watch, and Grafana, kibana, Azure Monitor
  • Operating Systems/Virtualization Tools Linux, Windows Servers, Ubuntu, Virtual Box, VMware, CentOS, RHEL, and Hyper-V
  • Message brokers Kafka , RabbitMQ
  • Application Servers/ Middleware Apache Tomcat, WebSphere
  • Build and Release Engineering/ DevOps Gitlab, MAKE, Jenkins, Cruise Control, Ansible, Vagrant, AWS, Nagios, Rally, Hudson, Ant, Maven, Gradle, Docker
  • Azure Services Azure Virtual Machines (VMs), Azure Load Balancer or Azure Application Gateway, Azure Storage Account, Azure Virtual Network, Azure Database for MySQL/PostgreSQL, Azure Active Directory (AAD), Azure Resource Manager (ARM), Azure Monitor, Azure Activity Log and Azure Policy, Azure Service Bus, Azure ExpressRoute, Azure Logic Apps, Azure Function Apps, Azure API Management, Azure Service Bus, Azure Event Hubs
  • AWS Services AWS EC2 (Elastic Compute Cloud), AWS ELB (Elastic Load Balancer), AWS S3 (Simple Storage Service), AWS Glacier, AWS VPC (Virtual Private Cloud), AWS RDS (Relational Database Service), AWS IAM (Identity and Access Management), AWS CloudFormation, AWS CloudWatch, AWS CloudTrail, Azure Blob Storage, AWS SNS (Simple Notification Service), AWS SQS (Simple Queue Service), AWS SWF (Simple Workflow Service), Amazon Direct Connect

Certification

  • Certified in Kubernetes Administrator (CKA)
  • AZ-400 - Designing and Implementing Microsoft DevOps Solution
  • AZ-400 - Microsoft Certified: DevOps Engineer Expert https://learn.microsoft.com/api/credentials/share/en-us/ABHISHEKSINGH-3562/B64DDE7AAE66A591?sharingId=AA38EC66C7A34E57
  • AZ-104 Microsoft Certified: Azure Administrator Associate https://learn.microsoft.com/api/credentials/share/en-us/ABHISHEKSINGH-3562/44AD92EA7020FD113?sharingId=AA38EC66C7A34E57
  • AI-900 Microsoft Certified: Azure AI Fundamentals https://learn.microsoft.com/api/credentials/share/en-us/ABHISHEKSINGH-3562/2C85AAC1E5464FA77?sharingId=AA38EC66C7A34E57
  • AZ 900 - Microsoft Azure Fundamentals
  • Completion online Cloud Computing, Agile and Scrum, Terraform, MS Azure DevOps, Python(Udemy Certified)
  • ITIL Foundation Certified

Timeline

IT - Technical Lead Cloud Platform | Azure Development & Solution Architecting

NT
08.2024 - Current

Sr. DevOps Engineer

Verizon
11.2023 - 05.2024

Sr. DevOps Engineer

AT&T
01.2022 - 11.2023

Sr. DevOps Engineer

AT&T
08.2013 - 01.2022

Associate Operation

EXL (Client: Allstate)
10.2012 - 08.2013

Sr Analyst Linux Administration

Igate (Client: Rockwell )
08.2008 - 11.2011

MCA - undefined

GIMT, Galgotia, Uttar pradesh technical university

BSc -

Purvanchal University
Abhishek Singh