20+ years of experience delivering Linux production environments in cloud scale, high demand, mission critical conditions
Expert Systems Engineer with operational proficiency with many BigData tools
Strong programming and scripting ability in Python, Unix Shell
Excellent ability to quickly identify and remediate problems in a fast-paced environment
Exceptional individual and team work ethic, sees problems through to the end while collaborating with multiple teams
Overview
20
20
years of professional experience
2
2
years of post-secondary education
Work History
Staff Monitoring Engineer
Datto
Norwalk, CT
11.2019 - Current
Built and deployed multi-team/multi-datacenter monitoring solution allowing teams to self-service their own alerting
Maintained multiple systems syncing into Redis cluster for fast alert configuration lookup
Built flexible solution allowing end user to target various alerting paths
Maintained CI/CD pipeline in GitLab for automated configuration deployments
Wrote custom python modules to support internal alerting requirements
Set up and maintained Elastalert instances and wrote custom enhancements to support internal teams.
Designed, setup, configured, and supported streaming pipelines using Kafka, Logstash, Telegraf, Elasticsearch/OpenSearch
Assisted with deploying Victoriametrics as backend store for all systems and applications metrics.
Administered and maintained legacy Bosun alerting solution until it was replaced with Victoriametrics
Reduced recurring Datadog bill by $30k monthly by moving to internally hosted monitoring solution
Supported multiple internal teams alerting requirements, working with individuals to help them craft vmalert queries, elastalert rules and Api calls for their alerting
Maintained ansible playbooks for multiple services
Worked towards deprecating existing Zabbix deployment with Victoriametrics and Prometheus.
Maintained documentation around using internal monitoring solution as well as provided user support
Principal Systems Administrator
Oracle
Bozeman, MT
02.2012 - 11.2019
Designed, setup, configured, and supported streaming pipelines using Kafka, Spark, HBase, Yarn, and Zeppelin.
Member of DevOps focused team in OCI (Oracle Cloud Infrastructure) and OCI-C (Classic) deploying cloud instances using terraform, docker and kubernetes into multiple high-scale, geographically dispersed production systems.
Worked across multiple teams both locally and with remote members globally to architect solutions using best technology for each situation.
Researched and evaluated competing technologies to determine which fit needs and requirements of each project best.
Maintain and support data streaming and web applications in multiple production environments around the globe.
Deployed and administered Oracle Big Data Appliance from both command line and Cloudera Manager.
Collaborated with a global team to move a logging framework from batch oriented to a streaming solution, which enabled near real-time search capabilities.
Continued ownership of original RightNow Technologies Hadoop cluster after acquisition from Oracle.
Hosting Systems Administrator
RightNow Technologies
Bozeman, MT
05.2009 - 02.2012
Built and deployed a production Hadoop cluster currently still in use with over 1 petabyte of searchable data.
Designed and deployed an ETL pipeline for data flow into a Hadoop cluster before such tools were widely available that is still in use 10 years later.
Deployed and maintained graphite framework for realtime system and application metric visualization from proof-of-concept through to a multi-datacenter solution. This became crucial to allow the global operations center to monitor current production health as well as provide forensic data after an event to provide a root cause.
Day to day administration of RHEL/CentOs linux environments in high-scale, geographically dispersed production systems consisting of thousands of servers and customers.
Developer
Kobie Marketing
Saint Petersburg, FL
04.2007 - 05.2009
Developed scripts as needed for automation of various tasks.
Performed Quality Assurance for code to be released into a production environment.
Configured JIRA, subversion, apache webservers for internal and external customers.
System Administrator
Zoot Enterprises, Inc
Bozeman, MT
02.2004 - 05.2007
Automated many tasks including system maintenance, file sends, archiving, and system monitoring, using a variety of languages and tools.
Configured systems for a production environment to ensure 24/7 operation.
Documented response procedures for system monitoring pages and emails.
Deployed customer specific code into a production environment.
Responded to system issues escalated by the support team during off hours.
Education
Bachelor of Science - Computer Science
Montana State University
Associate of Applied Science -
Montana State University-Billings College of Technology
09.1997 - 05.1999
Skills
Systems: Linux, AIX, Windowsundefined
Software
Kafka
Hadoop
Docker
Hbase
Cassandra
Terraform
Kubernetes
Ansible
Puppet
Kong
Yarn
Mapreduce
Zookeeper
Timeline
Staff Monitoring Engineer
Datto
11.2019 - Current
Principal Systems Administrator
Oracle
02.2012 - 11.2019
Hosting Systems Administrator
RightNow Technologies
05.2009 - 02.2012
Developer
Kobie Marketing
04.2007 - 05.2009
System Administrator
Zoot Enterprises, Inc
02.2004 - 05.2007
Associate of Applied Science -
Montana State University-Billings College of Technology
09.1997 - 05.1999
Bachelor of Science - Computer Science
Montana State University
Similar Profiles
Bettina FiliusBettina Filius
Production Control Coordinator at DattoProduction Control Coordinator at Datto