Proactive site reliability engineer with expertise in system monitoring, performance optimization, and incident response training. Achievements include successful automation of workflows and consolidation of monitoring tools, leading to enhanced operational efficiency and significant cost reductions.
Overview
4
4
years of professional experience
2
2
Certifications
Work History
Site Reliability Engineer 2
Checkr
, CA
01.2024 - 03.2026
Led the creation and distribution of incident response training for engineers and non-technical contributors, resulting in improved incident response
Oversaw migration from PagerDuty to Incident.io, automating workflows and creating alerting rules.
Migrated AWS resources and Datadog configurations into unified environment to enhance system observability.
Created Terraform modules to standardize Datadog monitor creation among engineering teams.
Evaluated synthetic test metrics and log usage, achieving reductions in Datadog monitoring expenses.
Centralized external Datadog organizations into single environment, improving management efficiency and visibility.
Site Reliability Engineer
Checkr
San Francisco, California
08.2022 - 01.2024
Monitored system performance and reliability across Checkr's infrastructure.
Implemented automated monitoring solutions to enhance operational efficiency.
Created and updated documentation for systems and processes to ensure team alignment and knowledge transfer.
Enhanced application performance visibility through Datadog dashboards and custom metrics, facilitating proactive monitoring.
Served as on-call SRE and incident commander, leading resolution of critical production incidents
Refined monitoring and alerting quality, leading to earlier detection of system issues and reduced incident response times.
Software Engineer
Lob
01.2022 - 06.2022
Streamlined developer onboarding by reducing integration friction and simplifying setup processes
Developed automated tests to validate integrations, increasing system reliability and stability
Built demo applications and created documentation to support SDK integrations, enhancing developer adoption