Summary
Overview
Work History
Education
Skills
Certification
Timeline
Generic

Karthik Kotha

Phoenix,AZ

Summary

Over 13 years of experience in Linux administration, DevOps, and systems engineering, with a proven track record in site reliability. Expertise in establishing and managing CI/CD workflows, container orchestration using Kubernetes, and automating processes to enhance system efficiency. Recognized for strong problem-solving abilities and commitment to system stability, achieving high uptime and seamless deployments in cloud environments.

Overview

13
13
years of professional experience
1
1
Certification

Work History

Sr Systems Engineer

Bank of America
Phoenix, AZ
04.2024 - Current
  • Administered Linux (RHEL, CentOS) and IBM AIX (Power9/Power10) systems across hybrid infrastructure including VMware, AWS, and Azure platforms.
  • Performed advanced system troubleshooting and performance tuning across on-prem Linux/AIX servers and cloud-hosted virtual machines.
  • Managed AIX LPARs and VIOS configurations on IBM Power systems, ensuring optimal utilization and high availability.
  • Automated OS provisioning and configuration management across cloud and on-prem environments using Ansible and shell scripts (Bash/KSH).
  • Migrated workloads between on-prem and cloud environments (Azure/AWS), including lift-and-shift and hybrid P2V/V2V migrations.
  • Deployed and maintained Linux virtual machines in Azure (using Azure CLI, ARM templates) and AWS (using EC2, CloudFormation).
  • Configured and supported secure, scalable cloud infrastructure leveraging Azure VNets, NSGs, and AWS VPCs, Security Groups, and IAM policies.
  • Maintained VMware vSphere infrastructure, including provisioning VMs, managing vCenter Server, VCSA, PSC, and ESXi hosts.
  • Supported firmware updates, patch management, and system health monitoring for Dell, Intel, Cisco UCS, and IBM Power hardware.
  • Implemented infrastructure-as-code practices for Linux/AIX systems using Ansible and cloud-native tools (Azure Resource Manager, AWS CloudFormation).
  • Ensured high availability and DR strategies using VMware HA/DRS, Azure Availability Sets/Zones, and AWS Auto Scaling Groups.
  • Managed OS and application patching using YUM, RPM (Linux), and NIM/SUMA (AIX) across hybrid environments.
  • Conducted system hardening and enforced security baselines across Linux, AIX, and cloud-hosted instances based on CIS benchmarks.
  • Participated in 24x7 on-call support rotation, handling incident response, root cause analysis, and proactive remediation in both on-prem and cloud ecosystems.
  • Monitored infrastructure health and performance using native and third-party tools (CloudWatch, Azure Monitor, Nagios, errpt, top, vmstat, sar).

Site Reliability Engineer

Walmart
Phoenix, AZ
11.2023 - 03.2024
  • Managed and supported scalable cloud infrastructure on AWS using VPC, EC2, S3, RDS, DynamoDB, Route 53, IAM, Lambda, ELB, Auto Scaling, SNS, SQS, Redshift, ECS, ECR, and EKS.
  • Built infrastructure using Infrastructure-as-Code tools like CloudFormation and Terraform for provisioning and managing cloud resources across multiple environments.
  • Developed AWS Lambda functions to process incoming event data, aggregate results, and store output in DynamoDB and S3 for analytics and archiving.
  • Implemented monitoring and observability using CloudWatch, CloudTrail, and custom dashboards to proactively detect, alert, and resolve performance and availability issues.
  • Administered IBM MQ and AWS SQS/SNS messaging systems in Linux environments to ensure reliable asynchronous communication between distributed services.
  • Designed and deployed content delivery and caching strategies using CloudFront, optimizing performance and availability for global users.
  • Automated provisioning and configuration of Red Hat Linux systems using Ansible, integrated with Terraform for infrastructure deployment and Vagrant/Oracle VM for local dev environments.
  • Built and deployed containerized microservices using Docker, including writing Dockerfiles, managing images in ECR, and integrating Docker with Jenkins CI/CD pipelines.
  • Worked with Docker registries (private, Artifactory, ECR) to version, store, and deploy application containers securely and efficiently.
  • Orchestrated CI/CD pipelines using Jenkins, GitLab CI, and Bitbucket Pipelines, integrating with Nexus, JIRA, and container registries to streamline build and release cycles.
  • Configured S3 buckets with custom policies and lifecycle rules for hot/cold storage management, using Glacier for archival and cost optimization.
  • Designed high-availability architectures using Auto Scaling Groups, Elastic Load Balancers (ELB), and multi-AZ RDS for fault tolerance and scalability.
  • Applied IAM policies and roles to secure access to AWS services, using fine-grained permissions to enforce least privilege and compliance.
  • Implemented automation and configuration drift detection across environments using AWS Config, Terraform plan/apply, and Git-based infrastructure management.
  • Led environment provisioning, application deployment, and incident response for production, staging, and development stacks using DevOps best practices and SRE principles.

Sr System Engineer

CVS Healthcare
Phoenix, AZ
11.2015 - 12.2021
  • Deployed and configured Red Hat Enterprise Linux (RHEL) on Dell PowerEdge servers using Kickstart templates, ensuring consistent OS provisioning across physical and virtual environments.
  • Automated RHEL provisioning and configuration management with Ansible, including package installation, system hardening, and service configuration.
  • Designed and maintained LVM (Logical Volume Manager) configurations on RHEL for efficient storage management and system scalability.
  • Provisioned and managed AWS EC2 instances, implementing secure and highly available cloud infrastructure with automated deployments.
  • Utilized AWS CloudWatch Logs and Events to centralize monitoring and trigger automated remediation actions, improving operational efficiency.
  • Executed seamless AWS workload migrations using AWS MGN, coordinating closely with AWS support to ensure zero data loss and minimal downtime.
  • Automated the provisioning of Linux servers on Azure via Jenkins pipelines, supporting CI/CD workflows and improving deployment consistency.
  • Integrated Azure Active Directory (AD) for role-based access control and secure authentication across Azure-hosted services.
  • Built and maintained HA clusters using Veritas Cluster Server, HACMP, and GPFS for both AIX and RHEL platforms to ensure continuous availability.
  • Led AIX OS and storage migrations, including upgrades from AIX 7.1 to 7.2 and data transfers via EMC storage, with minimal downtime through careful planning and validation.

Systems Administrator

State Farm
Phoenix, AZ
08.2012 - 10.2015
  • Configured and maintained Red Hat and Veritas Clusters to ensure high availability of mission-critical applications, including integration with Oracle RAC and DB2 databases.
  • Monitored and managed over 16,000 UNIX/Linux servers across test, development, and production environments, using tools like Puppet, Nagios, and BMC for automation and performance optimization.
  • Collaborated with cross-functional teams (VMware, SAN, hardware) to perform hot/cold migrations, SAN storage provisioning, and hardware replacements with zero downtime, following ITIL and CMRB protocols.

System Administrator

Capital One
Richmond, VA
01.2012 - 05.2012
  • Installed, configured, and updated Red Hat Linux (5.x, AS 3/4.x) and Solaris (8, 9, 10) systems using Kickstart, Jumpstart, and Live Upgrade, including creation of Zones and Containers in Solaris 10.
  • Collaborated with Oracle DBA teams to optimize OS performance and install Oracle 9i/10g on both Solaris and Red Hat environments.
  • Configured Apache 2.x with Name-Based and IP-Based Virtual Hosts and managed SSL certificate generation for secure web hosting.

Education

Master of Science - Electrical Engineering

Gannon University
Erie, PA
05-2011

Skills

  • Operating Systems: Linux (Red Hat, SUSE), SLES (8–11), HP-UX (11x), IBM AIX (43–7x), Windows (NT, 2003, 2008)
  • Server Hardware: HP ProLiant (DL 480 G4–G7, BL 460C G8), HP Blade Enclosures (C7000), Dell PowerEdge (2800, R310, R410, R710), Intel Servers
  • Cloud and Containerization: AWS, Azure, Google Cloud Platform (GCP), Kubernetes (K8s), Docker
  • Monitoring tools include Nagios, Kibana, Prometheus, Grafana, New Relic, Datadog, Splunk, AppDynamics, Dynatrace, SolarWinds, ELK (Elasticsearch, Logstash, Kibana), Big Panda, vROps, ThousandEyes, and Foglight
  • DevOps & CI/CD: Chef, Puppet, Ansible, Jenkins, Bamboo, Git, SVN, Vagrant
  • Build & Artifact Tools: Maven, Gradle, Artifactory, Nexus, Sonar
  • Networking & Protocols: Cisco & Extreme Networks, Brocade Switches ▪ Protocols: TCP/IP, UDP, RIP, OSPF, EIGRP, IGRP, SNMP, SMTP, TELNET ▪ Network Services: DNS, DHCP, NIS, NFS, WAN, LAN, FTP/TFTP
  • Web/Application Servers: Apache (2x–3x), Tomcat, WebLogic (8–10), WebSphere (40, 50), IIS (60, 70)
  • Programming & Scripting: UNIX Shell, Perl, Python, Ruby, PHP, C, VB, HTML
  • Backup & Storage Management: VERITAS Volume Manager, Tivoli, NetBackup, RAID, EMC Storage, Double-Take, BMC Blade Logic

Certification

  • Certified Kubernetes Administrator (CKA)

https://www.credly.com/badges/6a7ae4f2-9be5-48db-b6ba-a4d875b5e96c/linked_in?t=rxdue7

Timeline

Sr Systems Engineer

Bank of America
04.2024 - Current

Site Reliability Engineer

Walmart
11.2023 - 03.2024

Sr System Engineer

CVS Healthcare
11.2015 - 12.2021

Systems Administrator

State Farm
08.2012 - 10.2015

System Administrator

Capital One
01.2012 - 05.2012

Master of Science - Electrical Engineering

Gannon University
Karthik Kotha
Want your own profile? Create for free at Resume-Now.com