Summary
Overview
Work History
Education
Skills
Timeline
Generic

Michael Thomas

Data Engineer
New York,NY

Summary

Dynamic Engineer with 5+ years in the data ecosystem. Devoted to maintaining reliable systems for uninterrupted workflows and quality information. Delivers up-to-date methods to increase process stability and efficiencies.

Overview

6
6
years of professional experience
4
4
years of post-secondary education

Work History

Lead Software Engineer - Data

MarketOps LLC
6 2023 - Current


  • Integrated LLM functionality into analytics stack via Langchain and Llama Index, and internally designed web components.
  • As founding Data Engineer, I built and scaled our data infrastructure to store, monitor and analyze terabytes of data
  • Contributed to our web platform (Elixir)
  • Contributed and helped design internal bidder pacing algorithm.

Senior Data Engineer

LiveRamp
06.2022 - 06.2023
  • Lead migration effort from in house ETL systems to Apache Airflow Architecture
  • Developed and maintained internal APIs surfacing predictions and estimates from Data Science models to core product UI.
  • Re-factored pandas code to production grade spark
  • Developed CI systems for Data Science oriented work
  • Developed unit and integration testing frameworks for data science workflows.

Senior Data Engineer - ML Engineering

FanDuel
03.2022 - 06.2022

* Lead Fanduel's first Machine Learning Engineering team focused on building services bridging the gap between Data Science and Software Engineering.

* Architected internal feature-store (feast)

* Serve as mentor and consult for fellow engineers and management respectively.

Data Engineer

FanDuel
05.2020 - 03.2022
  • Pioneered the utilization of Spark's structured streaming (Databricks, Kinesis, Lambda, SNS,SQS).
  • Develop batch ETL pipelines pulling from third party APIs and internal operational stores.
  • Maintain Apache Airflow and Redshift AWS infrastructure.
  • Deployed Fanduel's E2 Databricks Workspaces (Terraform)
  • Deployed ML Models behind REST API (Fast API, ECS, RDS).

Data Engineer

Essence
05.2019 - 05.2020
  • Built and maintained Data Lake for global team (BigQuery/GCS).
  • Published internal python packages for data scientist to leverage for pulling data from external APIs (Pandas/Python).
  • Designed ETL pipelines for various end consumers (Google Cloud Composers/ Apache Airflow).
  • Documented best practices for running queries against serverless architecture.
  • Developed internal web application to develop personalized, re-usable, templates for panel study questionnaires (jQuerry, Flask, CloudSQL).

Campaign Architect

SundaySky Ltd.
05.2018 - 05.2019
  • Optimized/Managed media buying via open RTB.
  • Automated various reporting tasks for client facing roles (Python).
  • Developed dashboards/alarms to monitor data streams from clients' sites to internal data systems (Dynamo DB, Splunk, JavaScript).
  • Assisted data science teams in targeting algorithm optimization. (Light GBM)

Education

Bachelor of Arts - Economics

Tulane University
New Orleans, LA
09.2012 - 05.2016

Program - Applied Machine Learning

Columbia Engineering
New York, NY

Skills

Python

Apache Spark

SQL

Terraform

Java

Bash

Scala

Apache Airflow

AWS

GCP

Lambda

Kinesis

Docker

Timeline

Senior Data Engineer

LiveRamp
06.2022 - 06.2023

Senior Data Engineer - ML Engineering

FanDuel
03.2022 - 06.2022

Data Engineer

FanDuel
05.2020 - 03.2022

Data Engineer

Essence
05.2019 - 05.2020

Campaign Architect

SundaySky Ltd.
05.2018 - 05.2019

Bachelor of Arts - Economics

Tulane University
09.2012 - 05.2016

Lead Software Engineer - Data

MarketOps LLC
6 2023 - Current

Program - Applied Machine Learning

Columbia Engineering
Michael ThomasData Engineer