Summary
Overview
Work History
Education
Skills
Software
Timeline
Generic

CODY STEVENS

Bozeman,MT

Summary

  • 20+ years of experience delivering Linux production environments in cloud scale, high demand, mission critical conditions
  • Expert Systems Engineer with operational proficiency with many BigData tools
  • Strong programming and scripting ability in Python, Unix Shell
  • Excellent ability to quickly identify and remediate problems in a fast-paced environment
  • Exceptional individual and team work ethic, sees problems through to the end while collaborating with multiple teams

Overview

20
20
years of professional experience
2
2
years of post-secondary education

Work History

Staff Monitoring Engineer

Datto
Norwalk, CT
11.2019 - Current
  • Built and deployed multi-team/multi-datacenter monitoring solution allowing teams to self-service their own alerting
  • Maintained multiple systems syncing into Redis cluster for fast alert configuration lookup
  • Built flexible solution allowing end user to target various alerting paths
  • Maintained CI/CD pipeline in GitLab for automated configuration deployments
  • Wrote custom python modules to support internal alerting requirements
  • Set up and maintained Elastalert instances and wrote custom enhancements to support internal teams.
  • Designed, setup, configured, and supported streaming pipelines using Kafka, Logstash, Telegraf, Elasticsearch/OpenSearch
  • Assisted with deploying Victoriametrics as backend store for all systems and applications metrics.
  • Administered and maintained legacy Bosun alerting solution until it was replaced with Victoriametrics
  • Reduced recurring Datadog bill by $30k monthly by moving to internally hosted monitoring solution
  • Supported multiple internal teams alerting requirements, working with individuals to help them craft vmalert queries, elastalert rules and Api calls for their alerting
  • Maintained ansible playbooks for multiple services
  • Worked towards deprecating existing Zabbix deployment with Victoriametrics and Prometheus.
  • Maintained documentation around using internal monitoring solution as well as provided user support

Principal Systems Administrator

Oracle
Bozeman, MT
02.2012 - 11.2019
  • Designed, setup, configured, and supported streaming pipelines using Kafka, Spark, HBase, Yarn, and Zeppelin.
  • Member of DevOps focused team in OCI (Oracle Cloud Infrastructure) and OCI-C (Classic) deploying cloud instances using terraform, docker and kubernetes into multiple high-scale, geographically dispersed production systems.
  • Worked across multiple teams both locally and with remote members globally to architect solutions using best technology for each situation.
  • Researched and evaluated competing technologies to determine which fit needs and requirements of each project best.
  • Maintain and support data streaming and web applications in multiple production environments around the globe.
  • Deployed and administered Oracle Big Data Appliance from both command line and Cloudera Manager.
  • Collaborated with a global team to move a logging framework from batch oriented to a streaming solution, which enabled near real-time search capabilities.
  • Continued ownership of original RightNow Technologies Hadoop cluster after acquisition from Oracle.

Hosting Systems Administrator

RightNow Technologies
Bozeman, MT
05.2009 - 02.2012
  • Built and deployed a production Hadoop cluster currently still in use with over 1 petabyte of searchable data.
  • Designed and deployed an ETL pipeline for data flow into a Hadoop cluster before such tools were widely available that is still in use 10 years later.
  • Deployed and maintained graphite framework for realtime system and application metric visualization from proof-of-concept through to a multi-datacenter solution. This became crucial to allow the global operations center to monitor current production health as well as provide forensic data after an event to provide a root cause.
  • Day to day administration of RHEL/CentOs linux environments in high-scale, geographically dispersed production systems consisting of thousands of servers and customers.

Developer

Kobie Marketing
Saint Petersburg, FL
04.2007 - 05.2009
  • Developed scripts as needed for automation of various tasks.
  • Performed Quality Assurance for code to be released into a production environment.
  • Configured JIRA, subversion, apache webservers for internal and external customers.

System Administrator

Zoot Enterprises, Inc
Bozeman, MT
02.2004 - 05.2007
  • Automated many tasks including system maintenance, file sends, archiving, and system monitoring, using a variety of languages and tools.
  • Configured systems for a production environment to ensure 24/7 operation.
  • Documented response procedures for system monitoring pages and emails.
  • Deployed customer specific code into a production environment.
  • Responded to system issues escalated by the support team during off hours.

Education

Bachelor of Science - Computer Science

Montana State University

Associate of Applied Science -

Montana State University-Billings College of Technology
09.1997 - 05.1999

Skills

Systems: Linux, AIX, Windowsundefined

Software

Kafka
Hadoop
Docker
Hbase
Cassandra
Terraform
Kubernetes
Ansible
Puppet
Kong
Yarn
Mapreduce
Zookeeper

Timeline

Staff Monitoring Engineer

Datto
11.2019 - Current

Principal Systems Administrator

Oracle
02.2012 - 11.2019

Hosting Systems Administrator

RightNow Technologies
05.2009 - 02.2012

Developer

Kobie Marketing
04.2007 - 05.2009

System Administrator

Zoot Enterprises, Inc
02.2004 - 05.2007

Associate of Applied Science -

Montana State University-Billings College of Technology
09.1997 - 05.1999

Bachelor of Science - Computer Science

Montana State University
CODY STEVENS