Summary
Overview
Work History
Education
Skills
Timeline
Generic

SHYLENDRA CH

Summary

Accomplished Sr. Site Reliability Engineer at PayPal Inc., specializing in AWS and CI/CD tools. Enhanced system uptime by 40% through innovative automation and monitoring solutions. Proven ability to collaborate effectively with cross-functional teams, driving significant improvements in incident response and operational efficiency. Expert in Docker and Kubernetes for scalable application deployment.

Overview

11
11
years of professional experience

Work History

Sr. Site Reliability Engineer

PayPal Inc.
San Jose
03.2020 - Current
  • Designed and implemented highly available, scalable, and fault-tolerant systems, reducing downtime by 40%
  • Developed self-healing automation to minimize manual intervention, improving incident response time by 60%
  • Created and optimized SLIs, SLOs, and error budgets, ensuring 99.98% service uptime and reducing SLA breaches by 35%
  • Built monitoring, logging, and alerting solutions with Splunk, Grafana, ELK, and Datadog, cutting false alerts by 50%
  • Designed and implemented data collection pipelines, leveraging Kafka to enable real-time and batch data processing
  • Developed Automated data analysis workflows to extract insights from MongoDB, optimizing query performance and storage efficiency
  • Created interactive dashboards using BI tools to visualize key metrics and trends, enhancing decision-making for stakeholders
  • Collaborated with development teams to identify data-driven scenarios, ensuring seamless integration between applications and data pipelines
  • Developed and maintained Java/J2EE-based microservices to support data ingestion, transformation, and reporting functionalities
  • Implemented event-driven architecture using Kafka to streamline data streaming and message processing across distributed systems
  • Optimized MongoDB schema design and indexing strategies, improving query performance and scalability
  • Automated data validation and quality checks to ensure high integrity in collected datasets
  • Worked closely with cross-functional teams to bridge gaps between software development and data engineering, improving system reliability
  • Integrated Kafka consumers and producers within Java applications to handle high-throughput messaging and distributed event streaming

Sr. Cloud / AWS Engineer

T-Mobile
Seattle
09.2018 - 02.2020
  • Created AWS CloudFormation templates to automate the provisioning of VPCs, subnets, EC2 instances, security groups, and ELB configurations
  • Ensured compliance with tagging standards for EC2 and other AWS services (CloudFront, CloudWatch, RDS, S3, Route 53, SNS, SQS, and CloudTrail) to streamline cost management and ownership tracking
  • Managed deployments with AWS Elastic Beanstalk for scaling Java, Node.js, Python, and Docker-based applications on Apache servers
  • Monitored applications through automated health checks, fast routing, and failover mechanisms, ensuring 100% target uptime across regions
  • Calculated and tracked SLIs (Service Level Indicators) such as availability, latency, durability, and coverage to align with defined SLAs (Service Level Agreements)
  • Conducted Chaos Engineering experiments in production using Chaos Monkey during maintenance windows to identify system weaknesses and ensure resilience
  • Developed and deployed containerized microservices using Docker and orchestrated them on Kubernetes clusters
  • Managed Kubernetes clusters through Helm, created reproducible builds, and automated deployments using Jenkins integrated with Docker plugins
  • Configured replication controllers and scaling policies for Kubernetes to ensure high availability and load distribution across multiple pods and minions
  • Designed a CI/CD roadmap for the project, integrating tools such as Jenkins, Ansible, Chef, and Maven to streamline deployments and reduce lead time
  • Automated deployment pipelines with Jenkins and Ansible Tower, writing playbooks to automate repetitive tasks and managing Linux configurations
  • Incorporated security scans in CI/CD pipelines using SonarQube Quality Gates and API automation for scanning reports using Python
  • Set up Nexus repositories and Maven builds, configuring SCM polling and webhook triggers for continuous deployment
  • Utilized AWS CLI to automate backup routines to S3, create nightly AMI snapshots, and deploy a 10-node Elasticsearch cluster in AWS
  • Managed cloud deployments through CloudFormation and Terraform, ensuring predictable and consistent resource provisioning
  • Developed automation scripts in Python and Bash for AWS resource management, integrating with the AWS SDK for EC2 and S3 operations
  • Used Apache Airflow to orchestrate, schedule, and automate ETL pipelines, managing complex data transformations
  • Automated workflows for Hadoop jobs using Apache Oozie, streamlining big data processing and reporting tasks
  • Deployed Docker-based containers to enhance the scheduling and scalability of Airflow workflows
  • Performed security assessments on REST APIs and web applications, focusing on OWASP vulnerabilities
  • Automated API testing and reporting processes using Python, ensuring compliance with security standards
  • Deployed and maintained virtualized Linux environments on AWS and Rackspace Cloud
  • Automated OS installations, upgrades, and patching using PXE, DHCP, and Kickstart/Jumpstart scripts on Red Hat Linux (RHEL 5.x, 6.x, 7.x)
  • Developed shell and Python scripts for Tomcat/TC Server deployments, automating build and release processes with Maven

Cloud Admin / DevOps Engineer

Pacific Dental Services
CA
10.2017 - 09.2018
  • Designed and deployed a robust cloud infrastructure on AWS, implementing services such as EC2 for scalable computing, RDS for managed relational databases, and VPC for network isolation and security
  • Utilized AWS IAM to create fine-grained access policies, ensuring secure permissions across users and applications, and implemented AWS Route 53 for efficient DNS management, enhancing application availability and performance
  • Implemented AWS Direct Connect to establish secure, high-throughput connections between on-premises environments and AWS, facilitating seamless data transfers and improving latency for critical applications
  • Managed storage solutions through AWS S3 for data storage and AWS Glacier for cost-effective long-term data archiving, optimizing both data accessibility and budget management
  • Employed AWS CloudWatch for monitoring resources, setting up alerts and dashboards to track key performance metrics, ensuring the infrastructure remained responsive and efficient
  • Developed comprehensive CloudFormation templates to automate the provisioning of AWS resources, significantly reducing manual setup time and minimizing human error in the deployment process
  • Utilized the AWS Serverless Application Model (SAM) to streamline the deployment of RESTful APIs via API Gateway, which integrated seamlessly with AWS Lambda functions for backend processing, enhancing application scalability and resilience
  • Created and maintained Terraform scripts for building, modifying, and versioning AWS infrastructure, defining reusable modules that covered Compute, Network, Operations, and User management to expedite the setup of multiple environments
  • Configured AWS Lambda functions to automate routine tasks, enhancing operational efficiency by reducing manual workloads and providing serverless processing capabilities
  • Architected applications for high availability using AWS services, incorporating auto-scaling groups and load balancers (ELB) to handle varying traffic loads, ensuring that applications remained available during peak usage periods
  • Conducted regular performance audits and optimizations, utilizing New Relic for real-time application performance monitoring and gaining insights into bottlenecks and inefficiencies, leading to significant improvements in response times and user experience
  • Implemented CI/CD practices that emphasized reliability, integrating automated testing and continuous integration processes to enhance code quality and deployment speed
  • Designed and implemented an ELK stack (Elasticsearch, Logstash, Kibana) for centralized log management, enabling the aggregation, analysis, and visualization of logs across multiple AWS services and applications, which facilitated better troubleshooting and incident response
  • Established monitoring solutions with Nagios, Splunk, and Zenoss, configuring alerts and dashboards to track system health and performance, ensuring rapid identification and resolution of issues
  • Developed custom scripts for log parsing and reporting, enhancing the ability to detect anomalies and security incidents in real-time
  • Leveraged Kubernetes and OpenShift to orchestrate containerized applications, developing multi-regional deployment strategies to ensure high availability and performance across diverse geographic locations
  • Integrated Jenkins with Docker using the CloudBees Docker Pipeline plugin to streamline the CI/CD process for microservices, enabling automated builds, tests, and deployments directly to Docker Registries and Kubernetes clusters
  • Created and maintained Docker images and Docker Swarm configurations, optimizing the deployment process for microservices and ensuring seamless rollouts and updates across containerized environments
  • Utilized Chef for configuration management, developing roles, cookbooks, and recipes that automated the provisioning and configuration of servers, ensuring consistency across development, staging, and production environments
  • Established a robust configuration management process that included bootstrapping Chef nodes, managing infrastructure as code, and enforcing compliance through automated configurations
  • Enhanced deployment efficiency by automating the build infrastructure using Jenkins, integrating SonarQube for automated code quality checks and vulnerability assessments during the CI/CD pipeline
  • Spearheaded the implementation of a CI/CD framework utilizing Jenkins, SonarQube, Maven, and Nexus, enhancing the speed and reliability of software releases
  • Set up Jenkins jobs for automated polling of source code repositories, facilitating continuous builds and deployments, which resulted in reduced lead times for feature releases and bug fixes
  • Automated testing processes as part of the CI/CD pipeline, ensuring that code quality was maintained, and vulnerabilities were identified early in the development lifecycle
  • Configured JIRA as the primary defect tracking system, customizing workflows to align with Agile methodologies and ensuring that the development team could efficiently track, manage, and resolve issues
  • Collaborated with cross-functional teams to define processes for issue resolution and to enhance team productivity through improved communication and visibility of project statuses

AWS DevOps Engineer

Capital One
McLean
01.2017 - 10.2017
  • Installation and Upgradation of packages and patches, Configuration management, Version Control, Service packs, troubleshooting connectivity issues and reviewing Security constraints
  • Performed AWS Cloud administration and managed EC2 instances, Cloud Formation, VPC, RDS, SQS, ELB, Auto-scaling, S3 (versioning & created AMI’s), SES and SNS services
  • Handled migration of on premises applications to cloud and created resources in cloud to enable this
  • Configured an AWS (VPC) and Database Subnet Group for isolation of resources within the Amazon RDS, MySQL, DB cluster
  • Created S3 Buckets in AWS and stored files
  • Enabled Versioning and generated AWS security group perform virtual firewalls to control traffic
  • Expertise in Installing, configuring and administering Jenkins Continuous Integration tool on Linux servers along with adding/updating plugins such as GIT, ANT, Maven, Check style, Deploy to Container, Build Pipeline etc
  • Configured Bitbucket with Jenkins& automated the build process through SCM polling
  • Created post-commit& pre-push hooks using Python in Bitbucket repositories
  • Resolved merging issues during rebasing & re-integrating branches
  • Integrated Docker container-based test infrastructure to Jenkins CI test flow and set up build environment integrating with Git and Jira to trigger builds using Webhooks and Slave Machines
  • Configured Jenkins as a common build engine maintained over 100 jobs in Jenkins from 10 different application teams for over 4-5 releases in parallel
  • Installed and configured Nexus Firewall to block unwanted components from entering the CI/CD pipeline
  • Experience in Networking including the OSI LAYERS and protocols such as TCP/IP, NIS, DNS, NFS, FTP, DHCP, HTTP, HTTPS, SFTP & SMTP
  • Integrated Java and Angular based application to ELK Elastic Search tier via spring 4 Restful Controllers communicating to a custom java utility wrapping the Jest API
  • Expert level skills in application development Using: Java, Spring Framework, Hibernate, Struts, JSP, JSF, EJB, JPA, Servlets, JDBC, JEE complaint application servers, client/server

Configuration Manager/ Build Engineer

3i Infotech
Hyderabad
06.2014 - 12.2015
  • Worked as a Build & Release engineer for a team that involved multiple development teams with parallel releases
  • Software Configuration management (Automate CI & CD pipeline using Maven, Jenkins & GIT)
  • Expertise in SCM concepts like branching, merging and tags in GIT
  • Automated build and release process including monitoring changes between releases
  • Developed Jenkins scripts to have Infrastructure as a service
  • Configure new applications and software updates as required including upgrades, installations, validations and setting up new servers
  • Administer and maintain build and release processes using source code management tools, build and integration tools, and automated testing tools
  • Supported and developed tools for integration, automated testing, and release management
  • Verified if the methods used to create and recreate software builds are consistent and repeatable
  • Releasing code to testing regions or staging areas as per the schedule published
  • Used JIRA for change control & ticketing
  • Automated Clear Case based release management process including monitoring changes between releases
  • Developed basic Shell/Bash/Perl Scripts for automation purpose
  • Handled code reviews and merging Pull requests
  • Diagnosed and resolved issues relating to local and wide area network performance
  • Worked with JIRA, a tool that handles DCR (Defect Change Request) & MR (Maintenance Request)
  • Written playbooks for WebLogic, JDK and Jenkins, Tomcat and deployment automation
  • Resolving merging issues during build and release by conducting meetings with developers and managers
  • Formulated and executed designing standards for DNS servers
  • Worked closely with software developers and DevOps to debug software and system problems
  • Maintained and coordinated environment configuration, controls, code integrity, and code conflict resolution
  • Implemented Maven builds to automate JAR and WAR
  • Involved in taking the weekly backups of the repositories and managing the repositories

Education

Master’s - computer information system

Virginia International University
Virginia
01.2017

Bachelor of Engineering -

JNTU
Hyderabad
01.2014

Skills

  • AWS
  • GCP
  • CI/CD Tools Jenkins
  • AWS Code Pipeline
  • GCP pipeline
  • Docker
  • Kubernetes
  • ECS
  • OpenShift
  • Google Kubernetes Engine (GKE)
  • Chef
  • Ansible
  • GIT
  • GITHUB
  • SVN
  • Tomcat
  • Web Sphere
  • Web Logic
  • Nginx
  • Splunk
  • ELK
  • Dynatrace
  • Datadog
  • CloudWatch
  • Kafka
  • Bug Tracking Tools
  • Shell
  • YAML
  • MySQL
  • Dynamo DB
  • Mongo DB
  • Cassandra
  • Nexus
  • Jfrog
  • Ubuntu
  • Window
  • DNS
  • DHCP
  • OSPF
  • UDP

Timeline

Sr. Site Reliability Engineer

PayPal Inc.
03.2020 - Current

Sr. Cloud / AWS Engineer

T-Mobile
09.2018 - 02.2020

Cloud Admin / DevOps Engineer

Pacific Dental Services
10.2017 - 09.2018

AWS DevOps Engineer

Capital One
01.2017 - 10.2017

Configuration Manager/ Build Engineer

3i Infotech
06.2014 - 12.2015

Master’s - computer information system

Virginia International University

Bachelor of Engineering -

JNTU
SHYLENDRA CH