Summary
Overview
Work History
Skills
Military Experience
Timeline
Generic

Daniel Shafer

Meridian,ID

Summary

Senior Site Reliability Engineer with heavy focus in Linux Administration and Engineering, as well as in Cloud and Automation. Over two decades of Linux experience, with a decade of Python experience, plus another decade of Cloud experience (OpenStack/AWS). Currently seeking Senior Site Reliability Engineer roles, and possibly Site Reliability Engineer Manager/Supervisor roles.

Overview

11
11
years of professional experience

Work History

Site Reliability Engineer Supervisor

GoDaddy, Inc
09.2022 - Current
  • Directly managed team of 5 engineers.
  • Run daily standup, team, sprint planning, retro, and backlog meetings
  • Wrote and deployed massive monitoring uplift for GoDaddy Domains using Prometheus, Thanos, Grafana, and other tools.
  • Write CLI tools and API scripts using Python for ServiceNow, OpenStack, AWS, and other services.
  • Manage CI/CD pipelines and other Jenkins tasks
  • Support domain organization in various SRE tasks

Site Reliability Engineer II

GoDaddy, Inc
07.2020 - 08.2022
  • Handled patching tasks for various teams
  • Wrote Python tools to help automate or improve tasks
  • Wrote extensive automation to manage GoDaddy's Domains infrastructure.
  • Wrote extensive documentation
  • Mentored peers and other coworkers

Python Developer

A10 Networks Inc
01.2019 - 12.2019
  • Built and implement testing environment for OpenStack infrastructure to improve testing of code reviews
  • Participated in bi-weekly planning meetings, review PRs daily and handled bug reports
  • Improved server reliability by implementing automation tools
  • Reviewed, developed and improved features and scripts for various projects using Python.

Site Reliability Engineer

Kount, Inc.
04.2018 - 10.2018
  • Handled patch and life cycle management for releases on over 50 Linux Servers running multiple distributions
  • Refactored legacy Python code, improving readability and functionality while cleaning up several 1000s lines of code
  • Cross-trained multiple teams on Python development and better practices
  • Provisioned and migrated new servers during major long-term-support release upgrade.

Site Reliability Engineer

MediaMath
05.2017 - 04.2018
  • Managed cloud infrastructure with hundreds of instances running both RHEL and Debian-based distributions
  • Handled configuration management using Chef and Ansible on over 100 servers
  • Stood up Prometheus and AlertManager to improve monitoring for multiple projects by 30%
  • Collaborated with teams from several offices all over world both physically and virtually

DevOps Engineer

20th Century Fox Film
07.2015 - 02.2017
  • Worked extensively with Amazon Web Services to maintain stable environments for all Fox Movies websites to run on
  • Managed development environments and deployments for websites and maintained excellent communication with third-party vendors
  • Automated process to build development environments and configuration, improving it by 95%
  • Researched, designed, and implemented new monitoring environment improving responsiveness to outages and issues by 90%.

OpenStack Engineer

Mirantis
02.2015 - 06.2015
  • Maintained and improved CI infrastructure
  • Performed reliability and performance stress testing
  • Handled DevOps tasks, bug regressions, writing bug reports
  • Wrote several Python scripts to improve OpenStack environment

Cloud Data Center NOC Engineer

HP Helion
02.2014 - 02.2015
  • Worked with support and service teams to quickly and properly resolve any issues that occur
  • Monitored cloud infrastructure, responding to alerts quickly and providing guidance on any changes or additions of new alerts
  • Wrote portal in Python and Django to assist all engineers and developers with tasks and reports, improving incident handling by over 50%
  • Mentor colleagues on new tools and improvements to best practices improving familiarity

Linux Systems Administrator

HostGator
04.2013 - 01.2014
  • Installed, configured and managed thousands of dedicated and shared servers
  • Diagnosed and improved customer website configurations for security, performance and other issues
  • Diagnosed server problems from networking issues to resource usage problems and then provide customers with root cause analysis and steps recommended for repairing problems
  • Maintained excellent communication with over 50 customers daily for various issues.

Skills

  • Cloud Environments (OpenStack/AWS)
  • Python Development/Scripting
  • Automation (Ansible, Chef)
  • Kubernetes
  • Unix Systems (REHL, Debian, FreeBSD)
  • CI/CD Deployment and Management
  • Containerization
  • Supervisory Skills for SREs
  • Reporting and Dashboards (Grafana)
  • Linux Security and Hardening Practices

Military Experience

Special Electronics Device Repairer
United States Army Jan 2008 to Oct 2012

Other MOS positions include Combat Engineer (12B) and Wheeled Mechanic (91B).  Served one tour in Iraq from 2010 to 2011 in support of Operation Iraqi Freedom and Operation New Dawn.

Timeline

Site Reliability Engineer Supervisor

GoDaddy, Inc
09.2022 - Current

Site Reliability Engineer II

GoDaddy, Inc
07.2020 - 08.2022

Python Developer

A10 Networks Inc
01.2019 - 12.2019

Site Reliability Engineer

Kount, Inc.
04.2018 - 10.2018

Site Reliability Engineer

MediaMath
05.2017 - 04.2018

DevOps Engineer

20th Century Fox Film
07.2015 - 02.2017

OpenStack Engineer

Mirantis
02.2015 - 06.2015

Cloud Data Center NOC Engineer

HP Helion
02.2014 - 02.2015

Linux Systems Administrator

HostGator
04.2013 - 01.2014
Daniel Shafer