Summary
Overview
Work History
Education
Skills
Certification
Timeline
Generic

Hari Krishna Karanam

Summary

Results-driven IT professional with over 18 years of experience as a Subject Matter Expert and Lead Engineer in System Validation, DevOps, and MLOps, specializing in AWS, CI/CD pipelines, and cloud automation. Proven expertise in designing and implementing scalable cloud-based applications across PaaS, IaaS, and SaaS environments, along with strong command of configuration management using Ansible and Kubernetes orchestration. Recognized for enhancing code quality through the integration of CI/CD tools with security checks, achieving a 35% reduction in code smells across microservices. Committed to fostering collaboration and driving successful project outcomes in fast-paced environments while delivering innovative solutions that meet organizational goals.

Overview

19
19
years of professional experience
1
1
Certification

Work History

SME DevOps Engineer

Comcast
Sunnyvale, CA
07.2024 - Current
  • Designed and implemented an end-to-end MLOps workflow, covering model training, versioning, deployment, serving, scaling and lifecycle management.
  • Containerized ML workloads using Docker, ensuring reproducible builds and immutable model artifacts across environments.
  • Deployed ML models on Kubernetes clusters, implementing Deployments, Services and Ingress for scalable and resilient inference.
  • Integrated external model artifacts storage (s3 and HTTP-based endpoints) for decoupled model deployments in Kubernetes.
  • Understood and applied SageMaker Domain, User Profiles and IAM execution roles, aligning ML access control with enterprise security practices.
  • Designed IAM-based role separation for Data Scientists, ML Engineers and MLOps Engineers within SageMaker environments.
  • Deployed and managed AWS EMR on Amazon EKS to run containerized Spark workloads, improving scalability and resource utilization.
  • Designed and executed big data processing pipelines using EMR Spark on Kubernetes, handling large datasets.
  • Developed Python/Scala Spark scripts for complex transformations, aggregations, and machine learning pipelines running on EMR on EKS.
  • Knowledge on ETL pipelines using AWS Glue to extract, transform, and load large datasets efficiently from DB.
  • Developed a strong understanding of real-world MLOps responsibilities, including model reliability deployment safety, infrastructure automation and Operational scalability.
  • Worked with different scripting languages such as PowerShell, Ruby, Groovy, Python as a part of writing scripts for the automation tasks like release processes, applications migration, and monitoring application health during migration etc.
  • Managed EKS cluster node pools with multi-type HW benches jobs.
  • Configured AWS Identity Access Management (IAM) Group and users for improved login authentication.
  • Involved heavily in setting up the CI/CD pipeline using Jenkins, Maven, Nexus, GitHub, Ansible, Terraform and AWS.
  • Worked on Docker container snapshots, attaching to a running container, removing images, managing director structures, and managing containers in AWS EKS.
  • Used Terraform for managing the infrastructure through the terminal sessions and executing scripts in creating alarms and notifications for EC2 instances using AWS Cloud Watch. Wrote Ansible Playbooks for various applications and deploying them in AWS using Terraform.
  • Worked on designing and implementing CI/CD pipelines for sea shark using Jenkins and Gitlab.
  • Implemented automation for deployments by using YAML scripts for massive builds and releases.
  • Experienced in working with Agile Methodology and was responsible for monitoring and managing the development.
  • Configured VMs availability sets using AWS to provide resiliency for IaaS based solution and scale sets using Resource Manager to manage network traffic.
  • Worked on service mesh like Istio for advanced service discovery of microservices running in the cluster.
  • Worked with developers in mitigating issues with pipelines on Jenkins and ArgoCD.
  • Involved in Jira as defect tracking system and configure various workflows, customizations, and plugins for Jira bug/issue tracker integrated Jenkins with Jira, GitHub.
  • Working to set up CI/CD pipeline integrating different tools with Jenkins to build and run Terraform jobs in order to create infrastructure.
  • Good understanding of Kubernetes Clusters platform in managing Docker containers and upgrading versions.
  • Created and Maintained Docker files in Source Code Repositories, building images and ran containers for applications, handled multiple Docker images primarily for middleware installations and domain configurations.
  • Build and maintain Docker images for applications.
  • Created homebrew alarms and trigger points in Jenkins to monitor services running in dev, stage, and production environments.
  • Used Kubernetes to orchestrate the deployment, scaling, and management of Docker Containers.
  • Wrote Ansible playbooks from scratch in YAML. Installing, setting up & Troubleshooting Ansible, created and automated platform environment setup.
  • Used Ansible as configuration management tools to automate repetitive tasks, quickly deploy mission-critical applications and proactively manage changes.

DevOps Engineer

Renault
Banglore, India
03.2018 - 06.2024
  • Worked on creating a Continuous Delivery CI pipeline with jenkin Pipeline to automate builds with AKS.
  • Designed and built custom VNET setups, maintained the environment for rapid deployment, setup High Availability Cloud architectures in AWS. Architect.
  • Maintained the best-in-class Azure Cost Optimization, Azure Cloud Security, Operational Excellence, Performance, and Reliability for Azure Cloud architecture and deployments.
  • Integrated CI/CD pipelines successfully, resulting in a 40% reduction in deployment times and an increased release frequency.
  • Experience with AWS services like Amazon Vms and storage .AKS.
  • Configured groovy script, shared Jenkins libraries and multi-branch pipeline to handle automated deployment for Dev, QA, UAT and PROD environment.
  • Migrating and maintaining build and test environments into the Cloud Infrastructure.
  • Working with developers to fine-tune process for newly introduced Atlassian tools in organization like JIRA, and Git-Stash repository.
  • Experience in working on Docker, Jenkins, for continuous integration and for End-to-End automation for all Build and Deployments.
  • Configured AWS Identity Access Management (IAM) Group and users for improved login authentication.
  • Involved heavily in setting up the CI/CD pipeline using Jenkins, Maven, Nexus, GitHub, Ansible, Terraform and Azure.
  • Automated various infrastructure activities like Continuous Deployment, Application Server setup, Stack monitoring using Ansible playbooks and has Integrated Ansible with Jenkins.
  • Collaborated with development & support teams to service a continuous delivery environment with the use of build platform tool DOCKER.
  • Involved with Docker and Kubernetes on multiple cloud providers, from helping developers build and containerize their application (CI/CD) to deploying either on public or private cloud.
  • Migrated on-premises workloads to Azure using hybrid/multi-cloud architectures, ensuring secure data transfer and minimal downtime.
  • Development of automation of Kubernetes clusters with Ansible, writing playbooks.

Lead engineer

Qualcomm
Hyderabad, India
05.2015 - 03.2018
  • Leading the validation team of end-to-end system catering related to power estimation consumed by devices, identification of hot spots and their optimization.
  • Debugging the issues and proposing the solutions to the stakeholders.
  • Worked with integration teams for setting up local automation development using python.
  • Delivery ownership, root cause analysis of issues in earlier deliverables of firmware and embedded releases.
  • Develop and execute manual and automated test plans for embedded systems in mobile and wireless products.
  • Conduct functional, performance, and stress testing on embedded firmware running on Snapdragon processors.
  • Identify, log, and track bugs using JIRA, providing detailed steps for reproduction and ensuring timely resolution.
  • Collaborate with cross-functional teams (hardware engineers, firmware developers) to identify potential improvements and resolve integration issues.
  • Perform unit testing using CMock and Unity frameworks for embedded software.
  • Use tools like Lauterbach and JTAG debuggers for on-chip debugging and system-level testing.
  • Identified software bugs and system anomalies, providing comprehensive bug reports with reproduction steps.

Lead engineer

Google Ind PVT
Hyderabad, India
12.2012 - 05.2015
  • Taking owner ship for some module in regression testing.
  • Gathering requirements for GCP printers for Logo certification.
  • Creating Virtual Machines Using Proxmox Application and Validating Printer Functionalities.
  • Gathering requirements for third party UI applications for GCP Logo certification.
  • Working on Selenium for automation of different web Based application (Chrome, Firefox) using python.
  • Android OS Testing for different Platforms with Google Apps (G+, G drive and Photos ).
  • IOS Testing for different Platforms on Apple devices with Google Apps (G+, G drive and Photos ).
  • Chrome OS Testing for different Platforms.
  • Created test plans, decided testing strategies.
  • Written test cases for - functional and integration testing.
  • Resolving issues for any software issues of virtual machines.
  • Defect tracking, Root cause analysis and documenting test reports, bug reports and release notes.
  • Worked on troubleshooting various Linux server and network support issues using log files and tools like ping, telnet and traceroute for application teams.

Sr. System validation engineer

Intel
Banglore, India
07.2011 - 12.2012
  • Graphics Driver Testing for Intel clover view tablet platform.
  • Involved in Test Planning, Test cases writing, Test execution and Reporting.
  • WHQL:
  • Involved and responsible for WHQL certification for the driver.
  • Installing and Configuring the WHQL Client-Server environment (Using WLK 1.4, 1.5).
  • Hardware validation and qualification of server platforms, with deep expertise in BMC , PCIe, CPU, and GPU subsystems.
  • Performing firmware deployment and upgrade testing (BMC, BIOS, NIC, GPU) across multi environments with rollback and recovery validation.
  • Setting up DTM controller, DTM studio and DTM Client on Server and Client (Test systems) respectively.
  • Scheduling Jobs, Status monitoring, Job results, DTM Test Logs DTM Job logs analysis.
  • General Reporting in terms of generating CPK file and troubleshooting.

Sr. Embedded Systems Test Engineer

NvidiaPvt Ltd
Pune, India
08.2010 - 07.2011
  • Involved in Test Plan creation and test cases preparation for SDK Power Management, Sasken Inc Media player (Audio, Video, and Graphics), Devnotification(USB, DataCD, and Data DVD, SD card, Blueray and iPod) and iPod (Audio, Video and Images).
  • Testing includes System Testing, Sanity Testing and Regression Testing for iPod and Media player.
  • Validation of different Audio, video and image file formats.
  • Responsible for building test software, Sanity testing, Regression testing and System testing for each Delivery.
  • Supporting to the Development team for reproducing the all issues raised by testing team.
  • Documenting test reports, bug reports and release notes.
  • Communicating with the Development team for change in requirements, update on defects.
  • Executed compliance testing of PCIe devices for throughput, error injection, and interoperability in large-scale HPC systems.
  • Interacting with hardware architects, platform engineers, and software teams to deliver integration validation for heterogeneous compute environments.

Embedded Systems Test Engineer

AMD Pvt Ltd
Hyderabad, India
01.2007 - 07.2010
  • Conducted Regression testing to ensure backward compatibility with previous GPU driver releases.
  • Monitored and analyzed test results to identify performance bottlenecks and system anomalies; filed and tracked defect reports through closure.
  • Utilized Perforce for source control and version tracking of asset builds and build changes.
  • Implemented continuous test optimization and coverage improvement, dynamically adjusting ASIC configurations and test cases based on project needs.
  • Created and maintained OS images using the latest software updates for automated validation environments.
  • Partnered with IT and development teams to resolve infrastructure and automation framework issues ensuring seamless test execution.
  • Prepared post-mortem reports and root-cause analysis for test failures.
  • Developed and executed release test plans, test matrices, and test batches for GPU and display driver validation projects.
  • Built and configured automated test environments, including OS image creation and server setup aligned with automation frameworks.
  • Performed functional, regression, compatibility, and system testing of GPU display drivers across multiple operating systems (Windows XP, Vista, Windows 7).
  • Automated manual test cases using Perl scripting and AMD’s proprietary Testbot automation framework, improving test efficiency and coverage.

Education

Masters of Technology - Electronics and Communications Engineering

Jawaharlal Nehru Technological University
Hyderabad, India
04-2006

Bachelors of Engineering - Electronics and Communications Engineering

Madras University
Chennai, India
04-2003

Skills

  • Cloud Platforms: AWS/AZURE
  • IaC &: Terraform, Ansible
  • CI/CD Tools: Jenkins, GitHub Actions
  • Containers & Orchestration: Docker, Kubernetes, Helm
  • Version Control: Git, Perforce
  • Scripting: Bash, Python, PowerShell, YAML
  • Monitoring & Logging: Prometheus, Grafana, ELK Stack
  • Ticketing Tools: ServiceNow, JIRA
  • Databases: MySQL

Certification

AWS cloud Practitioner, HCP Terraform associate

Timeline

SME DevOps Engineer

Comcast
07.2024 - Current

DevOps Engineer

Renault
03.2018 - 06.2024

Lead engineer

Qualcomm
05.2015 - 03.2018

Lead engineer

Google Ind PVT
12.2012 - 05.2015

Sr. System validation engineer

Intel
07.2011 - 12.2012

Sr. Embedded Systems Test Engineer

NvidiaPvt Ltd
08.2010 - 07.2011

Embedded Systems Test Engineer

AMD Pvt Ltd
01.2007 - 07.2010

Bachelors of Engineering - Electronics and Communications Engineering

Madras University

Masters of Technology - Electronics and Communications Engineering

Jawaharlal Nehru Technological University
Hari Krishna Karanam