Summary
Overview
Work History
Education
Skills
Timeline
Generic

Kevin Thomas

Portland,OR

Summary

Site Reliability Engineer with a strong background in designing and implementing robust infrastructure through Terraform. Values collaboration, documentation, and cost efficiency. Effectively managed incident responses and optimized cloud resources for over 200 ECS services.

Overview

3
3
years of professional experience

Work History

SRE Engineer III

RealPage
Richardson, Texas
09.2023 - Current
  • Began role as Site Reliability Engineer I, focusing on foundational support.
  • Built SRE team from ground up, reducing Sev1 incidents from 13+ annually to fewer than 3 since 2024.
  • Removal of New Relic and implementation of Datadog within python and javascript based applications and Terraform based Infrastructure.
  • Datadog bill management and reduction of over 5k+/month.
  • Reduction of AWS Bill by ~15k/month through log management, scaling insights, and proper container design.
  • Designed and implemented deployment strategy to replicate lower level environment using terraform and github actions, enabling developers, QA, and product teams to spin up entire copies with one button for over 250 ECS services, serverless, EKS, SNS, SQS, AMQP, PostgresDB, and ALBs.
  • Developed live documentation for all products overseen, providing quick reference for developers and alert responders, including deep dives into known issues and key infrastructure connections.
  • Datadog alerting; created over 300 alerting systemside to help maintain health, including implementing synthetics and guidebooks for alerting response and context.
  • Terraform led alerting for AWS and Datadog, including implementing auto restarts and scaling to handle loads/temporary interruptions of service.

DevOps Engineer I

RealPage
Richardson, Texas
09.2022 - 09.2023
  • Started as Intern before being hired full time.
  • Terraform Standardization; using modulation to containerize all ECS services in a replicable manner for all 140+ past ECS services, and designed to easily integrate future ECS services.
  • AWS Firewall Implementation; Terraform defined with policies and rule groups, developing network skills that would help quickly alleviate several Sev1s while as an SRE.
  • Pipeline transition; Transitioned deployment pipeline for Jenkins into Github Actions using Terraform workflows.
  • AMI modification; built knowledge of containerization through template, working with developers for specific guidelines.

Education

Bachelor of Science - Computer Science

Portland State University
Portland, Oregon, OR
05-2023

Skills

Infrastructure as Code with Terraform

Python and Javascript

AWS Cloud Computing

AMI based containerization

Kubernetes

Datadog and Langsmith Alerting

Incident Management and Oncall scheduling with pagerduty

Timeline

SRE Engineer III

RealPage
09.2023 - Current

DevOps Engineer I

RealPage
09.2022 - 09.2023

Bachelor of Science - Computer Science

Portland State University
Kevin Thomas