Summary
Overview
Work History
Education
Skills
Certification
Timeline
Generic
Nithish Dantosh

Nithish Dantosh

Sr. Site Reliability Engineer
Austin,TX

Summary

Sr. Site Reliability Engineer with around 9 years of experience in DevOps and site reliability engineering, specializing in Kubernetes and CI/CD processes to enhance infrastructure automation. Proficient in cloud technologies and dedicated to driving innovation, creating robust and scalable environments that support advanced software solutions. Committed to leveraging expertise in MLOps and Terraform to streamline operations and elevate system reliability.

Overview

11
11
years of professional experience
3
3
Certifications

Work History

Sr. Site Reliability Engineer

Apple
09.2021 - Current
  • Managed Kubernetes charts with Helm, creating reproducible application builds, templatizing manifests, customizing deployments through dynamic configuration parameters, and overseeing the release management of Helm packages.
  • Led the setup and deployment of scalable, high-availability Rubix clusters by orchestrating containerized applications using Kubernetes manifests and Helm, optimizing performance, and resource efficiency.
  • Led the RIO migration effort by aligning existing Jenkins CI jobs with Rio’s modern architecture, optimizing build configurations, and reducing maintenance overhead.
  • Designed and implemented Spinnaker pipelines for multi-cloud deployments, seamlessly integrated with Kubernetes and Helm to support zero-downtime rollouts and progressive delivery. Achieved a 40% increase in deployment speed and reliability by replacing legacy systems with Spinnaker.
  • Managed Shield VIP configurations, traffic routing based on the weight, certificate renewals, and automation using Ansible.
  • Established Prometheus and Grafana for monitoring and alerting, enhancing system performance visibility, and enabling proactive issue resolution.
  • Conducted capacity planning and performance tuning, which includes analyzing the infrastructure capacity, choosing the right garbage collection policies for applications that could handle a capacity of a million transactions per day.
  • Ensured that systems and applications adhere to security and compliance standards (HIPAA, NIST).
  • Developed custom scripts/tools as needed to automate routine tasks, increasing overall team productivity and efficiency.
  • Conducted root-cause analyses after major incidents to identify areas for process improvement or technical enhancement opportunities.
  • Participated in the on-call rotation to triage production-impacting events and see to their resolution.

Sr. Cloud Platform Engineer

Cox Communications
09.2020 - 09.2021
  • Automated AWS resource provisioning across 10+ environments using CloudFormation, reducing deployment times by 30% and ensuring consistent, repeatable infrastructure.
  • Developed templated CloudFormation to build lambdas, AWS glue, Transfer family user, S3, KMS keys, EC2, Security groups etc.
  • Increased security compliance by implementing best practices for IaC, IAM roles, policies and security groups, passing multiple security audits with no critical findings.
  • Collaborated with cross-functional teams to design and implement secure network architectures with private and public subnets.
  • Improved team responsiveness and system reliability by setting up automated monitoring and alerting with AWS CloudWatch, leading 30% reduction in downtime.
  • Configured AWS security group rules to allow or deny traffic to and from the VM's instances upon configuration used AWS cloud CDN (Content delivery Network) to deliver content from AWS cache locations highly growing encountered latency and user experience.
  • Developed multiple Ansible playbooks to automate day-day activities such as software upgrades, server patching, clean up activities.
  • Designed and developed full lifecycle CICD pipelines for ANT based projects using Jenkins.
  • Configured Grafana for application monitoring and created multiple dashboards to maintain the system stability.

AWS/DevOps Engineer

McDonald's
08.2019 - 09.2020
  • Automated day-to-day tasks using Shell scripting and Groovy in Jenkins, and reducing manual intervention.
  • Participated in release/environment meetings to identify and mitigate risks.
  • Administered Jenkins, proposed and implemented new branching strategies, and prepared CI/CD flows.
  • Automated build-release processes using Jenkins, SonarQube, Ansible, and AWS.
  • Developed CI/CD systems with Jenkins on Kubernetes container environments (EKS).
  • Created Kubernetes charts using Helm and managed Kubernetes manifests.
  • Set up IAM user integrations with LDAP access for GIT and AWS console.
  • Monitored and alerted production servers using AWS CloudWatch.
  • Utilized Terraform for infrastructure as code (IaC) to set up new environments.
  • Improved application monitoring and reduced incident detection time by 30% through the set up New Relic monitoring and creation of dashboards with NRQL.
  • Troubleshot VPCs, NATs, bastions, and connectivity issues.
  • Clustered RabbitMQ tool for queuing the messages and attached policies for high availability.
  • Participated in 24/7 on-call rotation.

DevOps Engineer

CVS Health
01.2018 - 08.2019
  • Developed CI/CD pipelines from scratch for Spring RESTful API applications.
  • Optimized microservices deployment in an agile environment, leading to faster delivery cycles and improved product stability.
  • Configured Jenkins shared library with Groovy, optimizing build process and ensuring seamless integration across teams.
  • Implemented SonarQube for code review and scanning in CI/CD pipelines and Vault for secrets management.
  • Administered Artifactory servers, including installation, upgrades, and performance tuning.
  • Implemented Ansible for configuration management to deploy builds in Dev, QA, and Production environments.
  • Developed a POC solution with Spark team for SQL script analysis and PySpark solutions.
  • Worked on Consul Catalog sync service in Kubernetes to use that as service discovery for applications deployed in Kubernetes.
  • Prototyped CI/CD system with Gitlab utilizing Docker for build, test and deploy.
  • Monitored Server, Applications health using New Relic and created dashboards using NRQL.
  • Worked on RabbitMQ to IBMMQ message broker migration.
  • Engineered a microservice deployment strategy in an agile setting, enhancing delivery lifecycles and fostering team collaboration.

Build and Release Engineer

Genex Technologies PVT LTD
05.2014 - 08.2015
  • Release Engineer for a team that involved different development teams and multiple simultaneous software releases, and implemented a Continuous Integration process.
  • Participated in weekly release meetings with technology stakeholders to identify and mitigate potential risks associated with the releases.
  • Development, Quality Assurance, and Management teams ensure cross-communication and confirmed approval of all production changes.
  • Build and deploy Java/J2EE and .NET applications to a web application server in an Agile, continuous integration environment, and automate the whole process.
  • Created and maintained the Shell deployment scripts for WebLogic and web application servers.
  • Involved in editing the existing MAVEN files in case of errors or changes in the project requirements.
  • Propagated the JIRA issue solution from the baseline to other build lines automatically by applying SCM standards and implementing the system back end to cherry-pick the changes.
  • Used SonarQube to help maintain the source code quality.
  • Performed integration, JUnit, and code quality tests as a part of the build process.

Education

MBA -

New England College
01.2021

Masters - Computer Science

University of central Missouri
01.2017

Bachelor's - EEE

Jawaharlal Nehru Technological University
01.2014

Skills

Certification

AWS Certified Developer Associate, AWS

Timeline

Sr. Site Reliability Engineer

Apple
09.2021 - Current

Sr. Cloud Platform Engineer

Cox Communications
09.2020 - 09.2021

AWS/DevOps Engineer

McDonald's
08.2019 - 09.2020

DevOps Engineer

CVS Health
01.2018 - 08.2019

Build and Release Engineer

Genex Technologies PVT LTD
05.2014 - 08.2015

Masters - Computer Science

University of central Missouri

MBA -

New England College

Bachelor's - EEE

Jawaharlal Nehru Technological University
Nithish DantoshSr. Site Reliability Engineer