Summary
Overview
Work History
Education
Skills
Timeline
Generic

Jacob Frulla

Seattle

Summary

IT Professional and Systems Engineer with a proven track record of infrastructure design, implementation, automation, and maintenance of clustered systems in academic and high-performance computing environments. Adept at diagnosis, repair, testing, and documentation of software, hardware, and networks in secure environments. Self-starter with strong work ethic and excellent communication skills. Able to adapt, utilize, and evaluate new technologies or changing environments.

Overview

7
7
years of professional experience

Work History

HPC Systems Engineer

University Of Washington
08.2024 - Current
  • Maintained key roles in the deployment, administration, and maintenance of the university's centralized research computing environment, which included 3 clusters and 3 large scale parallel filesystems.
  • Engineered and deployed clustered systems using container orchestration systems, clustered VM environments, and bare-metal provisioning systems.
  • Established the design and performed troubleshooting of complex L2 networking infrastructures in a Cumulus Linux context, including MLAG configuration, VLAN segmentation, and firewall setup.
  • Developed and deployed automation and infrastructure management scripts via Ansible.

HPC Consultant

Los Alamos National Laboratory
01.2023 - 08.2024
  • Supported high-performance computing users and workflows across 10 clustered systems, including the 19,420 node Trinity cluster, which debuted at #6 in the November 2015 Top500 list.
  • Provided user documentation, education, outreach, and training session in regards to all available research computing resources at the lab.
  • Coordinated with a variety of teams and groups throughout the HPC department to implement Open OnDemand web portal instances in multiple secure networks.
  • Led projects with the goal of implementing and releasing key support services (Gitlab and Quay) to aid both HPC users and other HPC staff.

HPC Computer Specialist

MSU High Performance Computing Collaboratory
01.2019 - 01.2023
  • Supported the hardware, software, configuration, and support infrastructure of 6 clustered systems, including the 1,800 node Orion cluster, which debuted at #62 in the June 2019 Top500 list.
  • Provided Unix support to researchers using HPC resources in a secure NIST SP 800-171 compliant environment.
  • Successfully transitioned all clustered systems from PBS/Torque to the Slurm resource manager.
  • Wrote thorough and comprehensive guides to assist new users in the research computing environment, and documented all system changes and events in detail for auditing and reporting purposes.
  • Implemented a real-time cluster monitoring database and web-based dashboard to display live data in a public area of the facility.

Education

Computer Science

Mississippi State University
Starkville, MS
05-2019

Skills

  • Infrastructure as code using Ansible
  • Scheduling and resource management using Slurm, PBS, Torque, and Maui
  • Service deployment via Kubernetes, Proxmox, and baremetal clusters
  • Service containerization using Apptainer, Singularity, Podman, and Docker
  • Prometheus, MySQL, MariaDB, and InfluxDB design, deployment, and administration
  • Parallel Filesystems, such as Lustre and GPFS
  • Version control systems, such as Github and Gitlab
  • Extensive experience with systems scripting in Bash, Perl, Python, and Golang
  • Networking in ethernet and infiniband environments

Timeline

HPC Systems Engineer

University Of Washington
08.2024 - Current

HPC Consultant

Los Alamos National Laboratory
01.2023 - 08.2024

HPC Computer Specialist

MSU High Performance Computing Collaboratory
01.2019 - 01.2023

Computer Science

Mississippi State University