Summary
Overview
Work History
Education
Skills
Accomplishments
Languages
Timeline
Generic

Jacob Bouffard

Hatboro,PA

Summary

Software developer and data engineer with over 8 years of experience working with distributed systems, developing normalization routines, and designing API's. In addition has planned, written, and maintained data pipelines and their surrounding architecture for processes that work with TB's of data; as well as developed software application and libraries used in both the public and private sector.

Overview

8
8
years of professional experience

Work History

Geospatial Engineering Consultant

Signature Commercial Solutions, LLC
04.2022 - 01.2025
  • Expanded, monitored, fixed, and evaluated over five-hundred different Bayer internal and external geospatial tables/views using AWS and open source technologies.
  • Saved Bayer approximately five-million dollars annually through consolidation and optimization of AWS EMR/EC2 instances and their ETL processes.
  • Troubleshooted and aided on over fifty internal issues regarding geospatial processing; working across multiple teams to diagnose and fix problems at Bayer.

Senior Data Engineer

Penn Interactive
01.2021 - 03.2022
  • Developed, tested, and deployed over 50 different tables/views in AWS Redshift and RDS using DBT
  • Designed, developed, deployed, and maintained alert systems via AWS SQS and Lambda that monitored activity of over 50,000 users
  • Setup and updated AWS microservices such ECS, EC2, Lambda, and SQS using Terraform

Data Engineer

HealthVerity
10.2019 - 12.2020
  • Worked with team members to write, debug, and optimize normalization routines for over 15 TB of health care data using Python and Apache Spark
  • Developed and maintained automated normalization routines using Airflow for 47 different data providers
  • Implemented logging systems which captured run details for over 50 data ingestion processes

Software Developer

Azavea
08.2016 - 08.2019
  • Collaborated with teammates to develop distributed geospatial processing library in Scala
  • Implemented features for data processing and ingesting at scale by utilizing Apache Spark, AWS S3, HDFS, and Apache Accumulo
  • Ingested and analyzed over 10 TB of geospatial data using Apache Spark deployed on AWS EMR
  • Lead development of API and deployment strategies for distributed Python library.

Education

Bachelor of Arts - Geography and Urban

Temple University
Philadelphia, PA
05.2016

Skills

  • Apache Spark
  • AWS EMR
  • AWS EC2
  • AWS S3
  • AWS Lambda
  • AWS Redshift
  • Terraform
  • Docker
  • Kubernetes

Accomplishments

    Co-presenter at the 2019 FOSS4G NA Conference

    Co-Presented a demo showing how open source technology can be used to measure the UN's Sustainable Development Goals (SDG)s.

    Speaker at the 2018 GeoPython Conference

    Showcased an open source, geospatial, distributed Python library in real time using Jupyter and AWS EMR.

Languages

Python
Scala
Java
SQL

Timeline

Geospatial Engineering Consultant

Signature Commercial Solutions, LLC
04.2022 - 01.2025

Senior Data Engineer

Penn Interactive
01.2021 - 03.2022

Data Engineer

HealthVerity
10.2019 - 12.2020

Software Developer

Azavea
08.2016 - 08.2019

Bachelor of Arts - Geography and Urban

Temple University
Jacob Bouffard