Summary
Overview
Work History
Education
Skills
Accomplishments
Timeline
Generic
Akhil Sai Kumar Jami

Akhil Sai Kumar Jami

1877 McKelvey Hill Dr , St Louis,MO

Summary

Microsoft Certified SRE with 7+ years of IT Experience in Working on Handling End-to-End Infrastructure DevOps , Cloud , Software Development , Automation , With Cloud and Coding skills. Hands-on experience in AWS CLOUD . Designing, implementing, Testing and maintaining robust CI/CD pipelines, leveraging a wide range of technologies to optimize so ware development processes. Proficient at orchestrating container technologies, like Docker and Kubernetes , Ansible & Terraform automating infrastructure deployment, and ensuring software quality through testing. . Professional in various programming languages Python , Java and skilled in utilizing monitoring and ticketing systems for effective project management following agile and scrum methodologies.

Overview

8
8
years of professional experience

Work History

Sr. Site Reliability Engineer

Adis InfoTech
01.2024 - Current
  • Worked on full software development life cycle, including coding standards, code reviews, source control management, build processes, testing, and operations
  • Hands on Experience on Cloud Migrations from On-Prem Infrastructure.
  • Experience in Implementing Organization DevOps and SRE strategy in various environments of Linux and Windows servers along with adopting cloud strategies based on Amazon Web Services
  • Worked with Amazon AWS Cloud Administration which includes services like: EC2, S3, EBS, VPC, ELB, AMI, SNS, RDS, IAM, Route 53, Auto scaling, Cloud Front, Cloud Watch, Cloud Trail, Cloud Formation.
  • Provisioning EC2 instances and having knowledge on all resource areas of EC2 like Instances, Dedicated hosts, volumes, Key pairs, Elastic IP's, Snapshots, Load Balancers and Security Groups.
  • Connected user requests to infrastructure running in AWS such as Amazon EC2 instances, Elastic Load Balancing load balancers, or Amazon S3 buckets and outside AWS using Amazon Route 53.
  • Worked on AWS Cloud environment with various Managed APIʼs and migrated various applications to AWS.
  • Implemented and maintained the monitoring and alerting of production and corporate servers/storage using AWS Cloud watch.
  • Established end-to-end CI/CD pipelines using AWS Code Pipeline, Code Build, and Code Deploy, reducing deployment times and improving release reliability
  • Worked on AWS ECS service and Setup clusters and deployed different tomcat-based applications on ECS and configured with Auto Scaling Groups and Load balancer for high availability.
  • Orchestrated containerized applications using Kubernetes on AWS, managing deployments, services, and ensuring optimal resource utilization
  • Experience with AWS S3 services creating buckets, configuring buckets with permissions, logging, versioning and tagging.
  • AWS data backup (Snapshot, AMI creation) techniques, along with data security within AWS.
  • Implemented and enforced security practices in AWS environments, conducting regular security audits and ensuring compliance with industry standards.
  • Developed scripts and automation workflows using languages such as Python, Bash, or PowerShell to streamline operational tasks and improve overall system efficiency
  • Deployed and managed Kubernetes clusters on various platforms, including AWS EKS and on-premises environments
  • Utilized Terraform to automate the provisioning and management of AWS resources for Kubernetes clusters and Created and maintained Helm charts for packaging and deploying applications on AWS EKS.
  • Worked on centralized logging using AWS Cloud Watch Logs and Elasticsearch for effective troubleshooting and auto-scaling strategies for AWS EKS clusters to dynamically adjust resources based on demand
  • Implemented robust monitoring solutions using AWS Cloud Watch to capture and analyze metrics, logs, and events.
  • Integrated AWS Cloud Watch Alarms with AWS Lambda for automated response to critical incidents and Generated and published custom metrics to Cloud Watch using the AWS SDKs and Cloud Watch API
  • Designed and implemented event-driven architectures using AWS Lambda to respond to changes in S3, DynamoDB streams, and other events.

Sr. Site Reliability Engineer

Infosys Ltd
09.2023 - 01.2024
  • Migrating the on-premise data center applications to AWS Amazon Cloud Service, Infrastructure Design and Architecture.
  • Worked on IAM to create custom users and groups and policies
  • Implementation EC2 Server setup and deployment, build, maintenance, and configuration of various AWS resources like, EC2, EBS, Elastic Load Balancers, S3, VPC, EKS and Security Groups that are utilized for different environments like dev, testing, Production
  • Implemented Jenkins CI/CD Pipeline flow for different projects by creating multiple stages like build, integration, test, stage and production
  • Creation of subnets and Route tables, Internet gateway, virtual gateway customer gateway, create VPN connection, creating VPC peering between many VPCʼs.
  • Scanning of newly built servers for security and configuring them as per compliance standards.
  • Worked with strategic plan and guided in cost cutting within AWS for projects.

Sr. Site Reliability Engineer

Verizon
03.2023 - 09.2023
  • Worked with Multiple SRE & DevOps Technologies on On prem Servers and built pipelines for automation using Jenkins
  • Re-Wrote and Handled Python , SQL projects to make perform more efficient way and written ci scripts using Gitlab runner
  • Building Continuous Integration/Deployment pipelines for multiple projects using Jenkins and GIT
  • Working with Jfrog artifactory and Container orchestration tools like Docker and Kubernetes and it's pods
  • Handled projects collaborating with multiple teams and added Code quality scans using SonarQube
  • Working on Type Script , Java , Python Development projects as per requirements for new implementations
  • Writing Ansible / bash scripts as per requirement for automating the tasks on Linux / Ubuntu Servers
  • Implementing Container Security scans like Fortify and sysdig as per Organization Standards
  • Involved in presentations to the management and multiple teams for Automated pipelines built which can used across organization .

Sr Site Reliability Engineer

Scientific Games
09.2019 - 05.2021
  • Expert level experience with AWS DevOps tools, technologies and APIs associated with IAM, Cloud Formation, Cloud Watch, AIMs, SNS, EC2, EBS, S3, RDS, VPC, ELB, IAM, Route 53, Security Groups, Lambda etc
  • Responsible for Continuous Integration (CI) and Continuous Delivery (CD) Process implementation using Jenkins with Maven (SDLC) and MultiJob, MultiBranch Pipelines for different application and Infrastructure deployments
  • Worked on AWS Cloud environment with various Managed APIʼs and migrated various applications to AWS
  • Hands-on Experience in Jenkins Administration like set-up Security, create Jobs, configure Notifications, Distributed Builds, Install Plugins, Backup, and CLI and also Maintained Jenkins masters for different applications supported several quarterly and project releases in parallel
  • Implemented and maintained the monitoring and alerting of production and corporate servers/storage using AWS Cloud watch
  • Implementation of Continuous Delivery CD framework using Ansible, and Jenkins in Linux environment
  • Working on AWS ECS service and Setup clusters and deployed different tomcat-based applications on ECS and configured with Auto Scaling Groups and Load balancer for high availability.
  • Experience with AWS S3 services creating buckets, configuring buckets with permissions, logging, versioning and tagging
  • Worked with ELK (ElasticSearch, Logstash, and Kibana) Stack Setup for Log Monitoring and Elastic Curator to maintain the ElasticSearch Snapshots
  • Developed and tested installation scripts for automated deployment
  • Implemented rapid-provisioning and lifecycle management for Red Hat Linux using custom Ruby/Bash scripts, Puppet, and Amazon EC2
  • Deploy and monitor scalable infrastructure on Amazon web services (AWS) and configuration management using puppet
  • Setting up of private networks and Sub-networks using Virtual Private Cloud (VPC) and creating security groups to associate with the networks
  • Created AWS Route53 to route traffic between different regions
  • Configured AWS IAM and Security Group in Public and Private Subnets in VPC
  • Designed, Installed and Implemented Ansible configuration management systems.
  • Used Ansible to manage Web applications, Environments configuration Files, Users, Mount points and Packages
  • Maintained high availability clustered and standalone server environments and refined automation components with bash scripts and Ansible
  • Authored Python Scripts for AWS Automation using Python SDK Boto3, for tasks like EBS Snapshots backups, Applying S3 IAM Permission Policies and Roles, Managing S3 LifeCycle Management Policies
  • Implementing AWS Lambda function as event handlers to connect logs to S3 buckets
  • Worked with multiple Containerized applications and microservices Orchestration using Docker and Kubernetes
  • Setting up different S3 buckets with KMS encryption and attach different policies to setup restricted access
  • AWS data backup (Snapshot, AMI creation) techniques, along with data security within AWS
  • Worked with On-Premises Kubernetes HA Master/Worker Nodes Setup, for different Application Requirements Interface with IT application owners and the business in order to provide technical solutions to meet user needs
  • Responsible for writing the Ansible playbooks which is the entry point for Ansible provisioning, where the automation is defined through tasks using YAML format Run Ansible Scripts to provision Dev servers
  • Authored/Deployed Application Helm Charts and performed Rolling Updates for Micro Services using Blue/Green Deployments with Kubernetes Orchestration Tuning weekly basis to meet Nagios and CloudWatch performance requirements
  • Installed and administered the JIRA on Linux environments
  • Created and modified JIRA workflows including project workflows, field configurations, notification schemes, etc
  • Worked with Ansible/Puppet for Configuration Management to deliver automation for Infrastructure and Application deployment and provisioning related tasks
  • Involved in Configuration Automation and Centralized Management with Ansible and Implemented playbooks to manage all existing servers and automate the build/configuration of new servers
  • Worked with AWS CloudFormation and Terraform templates to deploy AWS infrastructure using Ansible
  • Expertise in Configuration of Ansible Tower, which provides a dashboard and role, based access control so that it is easier to allow individual teams access to use Ansible for their deployments
  • Worked with Monitoring and Log Monitoring based tools like Nagios, ELK/EFK stack, Grafana Dashboards, Custom Plugins Distribution, Host and ServiceGroup Configuration using Templates with Ansible.

Site Reliability Engineer

Bitstat Technologies
06.2016 - 09.2019
  • Managed , Built and Involved in Handling End- End Infrastructure , automated the tasks , Projects across the Organization
  • Hands-on experience handling Linux Administration , Network Operations and patching activities following the ITIL processes
  • Contributed to the adoption of Docker containers for application packaging, leading to enhanced portability and consistency across environments utilizing Kubernetes Services
  • Written bash , PowerShell scripts for automation at the Server level .Supported the implementation of Infrastructure as Code (IaC) using Terraform, reducing provisioning time and ensuring infrastructure consistency
  • Experience in writing Java , Python , Bash , Powershell automation Scripts and Development for Reducing the Human Intervention and efficiency
  • Led the integration of Git for version control, streamlining code collaboration and enhancing codebase management
  • Orchestrated Docker and Kubernetes to optimize application deployment and scaling, resulting in improved system reliability and resource utilization
  • Implemented Kubernetes orchestration , using YAML manifests to define pods, deployments, services, ingress resources
  • And Deployed Docker containers encapsulating microservices and orchestrated them using Kubernetes Controllers
  • Worked with Kubernetes concepts like Horizontal Pod Autoscaler and Readiness/Liveness Probes which optimized resource usage and application health monitoring
  • Automated infrastructure provisioning using Terraform and Ansible, reducing manual intervention and minimizing configuration errors
  • Orchestrated Infrastructure as Code (IAC) adoption using Terraform's HashiCorp Configuration Language (HCL) and Ansible's YAML playbooks
  • Written Terraform configurations for provisioning Infrastructure, while Ansible playbooks to handle server configurations and application deployment which helped in collaboration and consistent configurations across servers
  • Conducted code quality assessments using SonarQube and Fortify, ensuring compliance with industry standards which helped creation of more secure and robust applications
  • Integrated of SonarQube for static code analysis scans and made it triggered during the CI/CD pipeline using SonarScanner which identified code vulnerabilities , duplications
  • Collaborated with development teams to create and maintain automated testing scripts, improving software quality and reducing manual testing efforts
  • Developed many comprehensive scripts on regular basis in Bash, PowerShell, and Python to automate routine tasks, enhancing team efficiency by 80%
  • Leveraged Java programming skills to develop and built custom tools ( Journel ) in our organization for monitoring and management and to optimize application performance.
  • Assisted in the monitoring and alerting setup using Prometheus and Grafana, enabling proactive identification of performance bottlenecks
  • Managed and prioritized project tasks and issues within Jira, ensuring effective communication and progress tracking
  • Handling , Developing and Maintaining end to end Infrastructure and Working with cross-functional teams
  • Collaborated with cross-functional teams to design and implement efficient CI/CD pipelines using Jenkins, reducing deployment time by 80%
  • Engineered CI/CD pipelines Automation using Jenkins , employing Groovy-based Jenkins Pipeline DSL , In Which The pipeline does source code retrieval from Git repositories, triggering builds, running unit tests, and orchestrating Docker image builds and Kubernetes deployments
  • Performed Fortify for DAST on running applications to identify security , runtime vulnerabilities like XSS and injection attacks
  • Implemented Prometheus and Grafana for real-time monitoring and analysis of system metrics, enabling proactive issue identification and resolution
  • Worked on multiple ticketing tools like Service Now and Jira and monitoring tools like Zabbix , Splunk
  • Configured Prometheus exporters in various languages (Python, Java) for data collection Created Graphana dashboards using the Grafana web UI for visualization and alerting and using data fetched from Prometheus
  • Managed project workflows and issues through ServiceNow and Jira, ensuring seamless collaboration and effective project tracking.

Education

Master of Science - Applied Computer Science

Northwest Missouri State University
Maryville, MO
01-2023

Skills

  • DevOps Tools & Technologies
  • Cloud - AWS & AZURE
  • Coding - Python , Java
  • Scripting - Bash , PowerShell
  • Project Management
  • Agile & Scrum

Accomplishments


  • Reduced deployment times by 50% through automated CI/CD pipeline development and effective use of container orchestration.
  • Achieved 30% cost reduction by optimizing GCP resources and implementing autoscaling, saving thousands annually on infrastructure costs.
  • Enhanced operational uptime by 99.9% through robust monitoring, logging, and proactive incident management practices.
  • Architected and implemented a serverless, multi-region disaster recovery solution using AWS Cloud Functions and Cloud Spanner, reducing recovery time objective (RTO) by 85% and achieving 99.999% availability for critical
    systems.
  • Led a cross-functional team of 15 engineers in migrating 200+ microservices to Cloud, resulting in a 40% reduction in operational costs and a 30% improvement in application performance.
  • Designed and implemented a cloud-native CI/CD pipeline using Cloud Build, Artifact Registry, and Cloud Deploy, increasing deployment frequency by 300% and reducing mean time to recovery (MTTR) by 70%.
  • Spearheaded the implementation of a comprehensive observability stack using Cloud Monitoring, Cloud Logging and Cloud Trace, resulting in a 50% reduction in incident resolution time and a 25% increase in system uptime.
  • Orchestrated the migration of legacy applications to containerized microservices on AWS , reducing infrastructure costs by 35% and improving scalability to handle a 5x increase in user traffic.

Timeline

Sr. Site Reliability Engineer

Adis InfoTech
01.2024 - Current

Sr. Site Reliability Engineer

Infosys Ltd
09.2023 - 01.2024

Sr. Site Reliability Engineer

Verizon
03.2023 - 09.2023

Sr Site Reliability Engineer

Scientific Games
09.2019 - 05.2021

Site Reliability Engineer

Bitstat Technologies
06.2016 - 09.2019

Master of Science - Applied Computer Science

Northwest Missouri State University
Akhil Sai Kumar Jami