Summary
Overview
Work History
Education
Skills
Certification
Timeline
Work Availability
Quote
Generic
Ajay Saraf

Ajay Saraf

USA

Summary

Accomplished Sr DevOps and SRE Engineer at CDK Global, specializing in AWS infrastructure and automation. Expert in Terraform and Ansible, I significantly enhanced provisioning efficiency, reducing setup time by 90%. A collaborative problem-solver, I thrive in agile environments, driving continuous improvement and operational excellence in cloud-native applications.

Overview

13
13
years of professional experience
1
1
Certificate

Work History

Sr DevOps and SRE Engineer

CDK Global
Portland, OR
11.2016 - Current
  • Experience in infrastructure development and operations, involved in designing and deploying almost all the AWS stack, such as EC2, ECS, EKS, Lambda, EBS, S3, VPC, RDS, ELB, autoscaling, SQS, SNS, Route 53, VPN, etc.
  • Provision and maintain infrastructure in the cloud and VMware using IAC tools, such as Terraform and AWS CloudFormation.
  • Experience in migration and modernization of applications using Ansible on AWS Cloud and VMware platforms.
  • Creating and managing AWS AMIs using Terraform Packer.
  • Designed and implemented provisioning process that reduced Standalone application provision process on VMware from 12 hours to 1hour using puppet, Ansible and terraform (reduced approx. 90% of provisioning time)
  • Experience in writing Puppet modules to load packages and maintain the configuration as part of the system provisioning process.
  • Extensive use of Ansible automation in multiple projects, such as migrations, OS upgrades, patching, and applications like Postgres and PHP upgrades.
  • Designed end to end fail over mechanism using ansible during upgrades and patching process.
  • Worked on AWS Cloud and VMware for developing cloud-native applications, CI/CD workflows for build and release management using Atlassian Bamboo, Gitlab, and Jenkins.
  • Set up CI/CD pipelines so that each commit a developer makes goes through the standard software development lifecycle and gets tested well enough before it can make it to production.
  • Experience in creating RPM packages using Atlassian Bamboo.
  • Designed and implemented the Node.js dashboard application builds and deployment process.
  • Experience in building microservices using Docker containers.
  • Collaborated with development support teams to set up a continuous delivery environment with the use of Docker.
  • Deployed a Kubernetes cluster on cloud environments with a master/minion architecture, and wrote YAML files to create many services, such as pods, deployments, auto-scaling, load balancers, labels, health checks, and namespaces.
  • Worked on K8s services like Ingress, Load Balancer, Node Port, Cluster IP, etc.
  • Creating snapshots and Amazon Machine Images (AMIs) of the instances for backup and creating clone instances.
  • Experience in backing up and restoring VMware instances and databases using the Rubrik backup application.
  • Designed and implemented Rubrik backup solution to legacy applications, replicated same with AWS snapshot process (reduced backup failures from 30% to less than 2% )
  • Created and managed RDS databases (PostgreSQL) using Terraform as part of application modernization.
  • Designed a data replication process between the legacy PICK database and AWS RDS (PostgreSQL) using SQS and Lambda.
  • Implemented NLP search functionality on legacy systems using AWS Lambda and Elastic Search.
  • Designed and implemented a high-availability application logging architecture using td-agent (Fluentd) and Kibana/Splunk/New Relic.
  • Reduced loss of logs from 30% to less than 1% using high availability architecture.
  • Experience in configuring AWS services and app logs to New Relic to monitor and analyze application logs.
  • Experience in configuring alerts using CloudWatch.
  • Experience in writing REST API scripts using Python REST and PHP cURL modules.
  • Experience in writing Bash, shell, and Expect scripts as part of Puppet and Ansible automation.
  • Hands-on experience with GraphQL using Python.
  • Hands-on experience in creating APIs using PHP and Python.
  • Experience in working on networking, such as AWS VPC, subnets, load balancing, and AWS auto-scaling.
  • Created an application load balancer to manage ECS Fargate instances to validate JWT tokens.
  • Having experience in creating SSL/TLS certificates using the AWS Certificate Manager.
  • Creating S3 buckets, managing policies for S3 buckets, and utilizing S3 buckets and Glacier for storage and backup on AWS.
  • Created and managed the AWS Glacier service for archiving backups, as per the client agreement policy.
  • Hands on experience in Kafka architecture for Application modernization.
  • Extensive working experience in an agile environment and a full understanding of the SDLC and processes.
  • Environment: AWS Cloud, Terraform, GitHub, Docker, Kubernetes, Ansible, Puppet, New Relic, Splunk, td-agent, fluentd, Rubrik.

DevOps Engineer

Recondo healthcare
Denver, CO
11.2015 - 10.2016
  • Managing the AWS infrastructure of the company, which we use daily, includes providing access using IAM, creating and managing EC2 instances, and uploading, downloading, and managing S3 buckets.
  • Worked on configuring GitHub repos, creating new release branches as per the release checklist, and providing them to the developer for every major release.
  • Monitoring and fixing the infrastructure and application-related issues of all environments using Nagios (production and non-production environments).
  • Setting up build configurations and fixing the build issues in Jenkins.
  • Set up and manage monitoring and logging solutions using AWS CloudWatch, providing real-time insights into resource performance and health.
  • Configure and manage virtual networks, subnets, and load balancers to ensure secure and efficient network traffic management.
  • Maintaining servers by managing packages using Puppet.
  • Environment: AWS, Puppet, Boto Python, Nagios, Shell Scripting.

Site Reliability Engineer

Qualcomm(CSR)
San Diego, CA
10.2014 - 11.2015
  • Launching SGEE and SHAP servers with auto-deployment using the Jenkins tool.
  • Configuration of the AWS server on which the application runs, manual deployment, working on auto deployment, monitoring, troubleshooting issues, and creating automation scripts to launch the setup with one click.
  • Launching AWS EC2 and Apache Cloud Stack instances.
  • Copying files, whitelisting IPs using Puppet Master.
  • Monitoring Apache CloudStack servers using Nagios.
  • Performance testing using JMeter and monitoring Java processes using JMX and JConsole.
  • Writing Boto-Python scripts to automate AWS provisioning and deployment processes.
  • Environment: AWS, AWS CLI, Puppet, Python, Boto, Zabbix, Nagios, Jenkins, JMeter.

System engineer (Linux Admin and production support)

Apalya Technologies
Hyd, IND
12.2011 - 10.2014
  • Monitoring servers' health checkup with the Nagios tool.
  • Installing and configuring applications like Wowza Media Server, Darwin, VLC, and FFmpeg transcoder tools.
  • Installing and maintaining JBoss Middleware applications.
  • Configured Nagios monitoring, which monitors all servers' health status.
  • Configuration of servers like FTP, SAMBA, NFS, TELNET, SSH, YUM, and NTP.
  • Implementing backup scripts for log backup of the application, streaming logs to the backup disk using TAR, SCP, and bash scripts.
  • I worked on FFmpeg and developed code for better streaming.
  • Capturing network interface traffic using tcpdump, lsof, and netstat.
  • Environment: SUSE Linux, CentOS, Nagios, Wowza, JBoss.

Education

Bachelor's Degree - Electronics and Computer Science Technology

JNTU
Hyderabad
08-2010

Skills

  • Cloud platforms: AWS, Azure, VMware
  • Containerization: Docker, Kubernetes, AWS Fargate
  • CI/CD: Jenkins, Bamboo, GitLab, and Harness
  • Automation and IaC: Ansible, Puppet, Terraform, Packer
  • Source control: Git, SVN, and BitBucket
  • Monitoring and Logging: New Relic, Splunk, Nagios, Grafana, Fluentd, and Kafka
  • Scripting: Python, PHP, Bash, Expect
  • Databases: Mongo, PostgreSQL, MySQL
  • Storage and Backups: Rubrik, NetBackup, AWS Snapshots, AWS storage services

Certification

  • AWS Certified Solution Architect – Associate (SAA-C03)
  • Hashicorp Certified Terraform Associate (003)
  • Azure Designing and Implementing DevOps Certification (AZ-400)
  • AWS AI Practitioner certification

Timeline

Sr DevOps and SRE Engineer

CDK Global
11.2016 - Current

DevOps Engineer

Recondo healthcare
11.2015 - 10.2016

Site Reliability Engineer

Qualcomm(CSR)
10.2014 - 11.2015

System engineer (Linux Admin and production support)

Apalya Technologies
12.2011 - 10.2014

Bachelor's Degree - Electronics and Computer Science Technology

JNTU

Work Availability

monday
tuesday
wednesday
thursday
friday
saturday
sunday
morning
afternoon
evening
swipe to browse

Quote

Every problem is a gift—without problems we would not grow.
Tony Robbins
Ajay Saraf
Want your own profile? Create for free at Resume-Now.com