Summary
Overview
Work History
Education
Skills
Certification
Timeline
Generic

SreeDurga Mohanraj

Site Reliability Engineer
Leander,TX

Summary

  • Over 6 years of dedicated experience in the IT Industry, specializing in supporting, automating, and optimizing DevOps processes encompassing CI/CD, configuration management, and environment management, with a strong focus on Site Reliability Engineering (SRE) and Cloud Services, particularly GCP.
  • Proficient in full lifecycle management, adept at cost monitoring and reduction, change management, incident management, and problem management.
  • Skilled in defining Service Level Objectives (SLO), VALET metrics, capacity planning, application reliability reviews, deployment, and support across various services.
  • Hands-on experience in GCP cloud monitoring, error reporting, cloud logging, alert analysis, BigQuery, network security, Cloud Armor, and SSL certificates .
  • Proficient in creating custom metrics and dashboards in Stackdriver using Google API developer tools.
  • Expertise in deployment utilizing Vulcan config and extensive knowledge of Chef images
  • Implemented secure processing of traffic to and from backend services using Apigee Edge micro gateway.
  • Automated IAM provisioning, VAULT, and firewall rules using Terraform.
  • Developed dashboards, accounts, custom reports, and analytic snapshots tailored to company reporting and decision-making needs.
  • Proficient in conducting load tests using NeoLoad and destructive tests using Rehub.
  • Well-versed in large-scale, multi-data center, cloud-hosted web administration.
  • Experienced in a range of DevOps tools, primarily Git and Docker.
  • Skilled in utilizing development and deployment tools such as Eclipse IDE and force.com IDE.
  • Solid understanding of core Java concepts.
  • Demonstrated excellence in troubleshooting, with the ability to dive into all aspects of the stack to identify and resolve issues
  • Proficient in change management and incident analysis using ServiceNow.
  • Possess strong work ethics, self-motivation, quick learning ability, and a tea4m-oriented approach, consistently delivering value-added services to clients through experienced insights and effective communication skills.

Overview

7
7
years of professional experience
1
1
Certification

Work History

Site Reliability Engineer

The Home Depot
03.2020 - Current
  • Collaborated with SREs, Developers, and Business Partners to establish Service Level Objectives (SLOs) for Loyalty services.
  • Spearheaded the implementation of the Wheel of Misfortune (WOM) practice document in collaboration with Google, standardizing practices across the organization.
  • Partnered with vendors and stakeholders for successful launches of ProXtra and Path to Pro Campaigns
  • Worked closely with development teams for key initiatives such as the Annualization process and Tandem sync process.
  • Defined SLOs and oversaw factors including transaction volume, capacity planning, application and system availability, latency, and incidents for each application.
  • Accountable for monitoring volume, availability, latency, errors, and ticket SLOs for both external and internal services
  • . Facilitated the onboarding of critical new services into the Loyalty platform, ensuring robust monitoring, alerts, and custom dashboards were in place.
  • Utilized Stackdriver, SQL, and BigQuery for logging and monitoring, enabling efficient incident analysis.
  • Conducted load testing and engaged in capacity planning for holiday preparedness and onboarding of new services.
  • Leveraged Terraform for IAM provisioning, Service Account creations, and firewall management.
  • Led efforts to containerize services using COS images and onboard consumers into Apigee.
  • Participated in capacity planning exercises and ensured smooth deployments in LLCs and PROD environments, with thorough documentation of pre and post-deployment steps.
  • Developed a Python script to automate instance scaling based on peak traffic using cloud scheduler and functions.
  • Presented updates and insights in weekly KPI meetings, addressing incidents and other key areas within the Loyalty domain.
  • Engaged in Appropriate Sizing and tuning for OS patching and resolution of security vulnerabilities for JVMs.
  • Led incident response efforts, ensuring quick resolution and comprehensive post-incident communications.
  • Conducted regular Application Resiliency and Recovery (ARR) exercises to enhance production issue handling.
  • Utilized Grafana to visualize performance and user data, aiding in informed decision-making.
  • Actively participated in on-call rotations, addressing and resolving issues and alerts promptly.
  • Possess extensive experience in Unix systems administration, contributing to robust system management and maintenance.

Salesforce Developer

USAA
08.2018 - 09.2019

• Developed Audit application using Apex Controllers and visual force pages.

• Involved in writing SOQL and SOSL queries for fetching and inserting data.

• Created Apex triggers for functional needs in the application.

• Created and implemented both standard and custom reports and dashboards.

• Developed a batch job functionality scheduled to run every day to send out email notifications for users as a part of their notification tasks.

• Involved in Administration (configuration changes), Development (Customizations) and Deployment activities of Salesforce.com.

Salesforce Administrator

Spokes Of Hope
05.2017 - 06.2018

• Salesforce Admin and support analyst at non-profit organization Spokes of hope

• More available on request.

Education

Skills

Google Cloud PlatformApigee EdgemicrogatewayTerraformChefNeoloadReloadRehubGrafannaSalesforce CRMHTMLCSSJavascriptAgileSCRUMEclipse IDE Plug-inService NowApex Data LoaderPythonApexCore JavaJ2EEVisual ForceSpannerFirestoreMySQLGCPLinuxGCP CLI

Certification

Salesforce.com certified Administrator (ADM – 201)

Timeline

Site Reliability Engineer

The Home Depot
03.2020 - Current

Salesforce Developer

USAA
08.2018 - 09.2019

Salesforce Administrator

Spokes Of Hope
05.2017 - 06.2018

SreeDurga MohanrajSite Reliability Engineer