Summary
Work History
Education
Software
Timeline
OperationsManager
Vincent Marinelli

Vincent Marinelli

Data Engineering Technology Leader / Architect
Collegeville,PA

Summary

Experienced, energetic and passionate Data Engineering Leader with a deep understanding of the principles of science and clinical research who is focused on finding and understanding value in data.

Work History

Director of Data Engineering

Greenphire
09.2023 - Current
  • In first 18 months, reduced issue resolution times by 40%, reduced defect rate by over 50% and improved system down time by > 75%. Improved on-time feature delivery rate from < 25% to > 80%.
  • Leading design and development of data-driven products based on existing Greenphire data assets as well as data from strategic partners using PySpark (Glue), TensorFlow, Scikit-Learn
  • Leading migration from legacy ETL/DW based toolset to a modern streaming platform and data lake toolset based on Lake Formation, Glue, Step Functions, Lambdas, and Athena
  • Modernized Developer Experience through improved CI/CD toolsets and processes using GitHub Actions, FluxCD, Liquibase, Terraform.
  • Established multi-account AWS deployment strategy to improve data security and isolation, while increasing team autonomy and reducing deployment overhead.
  • Driving organization towards Data Mesh concepts of distributed ownership and stewardship of data assets across organization
  • Manage, mentor and train data engineering staff members.

Principle Software Engineer

Workiva
07.2021 - 09.2023
  • Worked with senior leadership to set data architecture vision
  • Worked with Legal and InfoSec to design secure and privacy-preserving data infrastructure for analysis of highly sensitive financial data for AI/ML team
  • Led engineering team that built Data Lake using AWS technologies (DMS, Glue/PySpark, S3, Lambda, Kinesis, Athena, Lake Formation, Redshift, Step Functions, etc.). Data Lake supports current AI/ML team features in production
  • Working with Platform data team to scale out and enhance data management capabilities of customer-facing Java Spring / Presto-based data analytics tool
  • Drove integration and rationalization of Data Engineering toolsets and patterns through projects delivered for AI/ML team

Senior Director - Enterprise Data Architecture

Medidata Solutions, Division Of Dassault Systèmes
01.2019 - 07.2021
  • Responsible for Data & Analytics infrastructure that supported to >100,000 patient lives, >10,000 clinical trials, and >500TB of clinical data at scale
  • Supervised architecture team of six that supported a technical portfolio with an operating budget of > $15M/yr
  • Provided data-tier enterprise architectural design and technical direction to over twenty teams
  • Drove DevOps and SRE transformation within teams to improve velocity, quality and predictability
  • Reduced release cycle times for data systems from months to days
  • Drove transformation to quality-metric and risk-reward metric release decision-making process
  • Supported transformation from single-cloud to cloud-agnostic infrastructure

Enterprise Data Architect

Medidata Solutions Worldwide
01.2015 - 01.2019
  • Led design of flagship "MEDS" clinical data storage and analytics platform used by all applications in Medidata platform
  • Led design and delivery of cloud-based distributed ETL system that processes hundreds of jobs and >10M data change events per day
  • Oversaw design of reporting and analytics architectures for use with both in-house and cloud located data
  • Led development of Java / Hadoop / Redshift data pipeline for processing data streams that deliver analytics data to RTSM customers
  • Introduced MongoDB to support data whose schema is variable / evolving
  • Introduced Data Vault 2.0 on Snowflake as evolution of existing data warehouse

Director of Platform Data Services

Medidata Solutions Worldwide
01.2012 - 01.2015
  • Led four engineering teams that supported company's reporting and analytics tools
  • Successfully reduced year-over-year failure rates across supported systems by up to 80%
  • Led transition of data and analytics tools from on-premise to cloud

Visualization, Analytics & Reporting Manager

Medidata Solutions Worldwide
01.2010 - 01.2012
  • Architected and built Insights Data Warehouse product
  • Awarded patents for novel analytic algorithms for computation of clinical site metrics
  • Built and led team of 6 engineers that supported visualization, reporting and analytics efforts for Medidata

Education

Master of Science - Computer Science

Kutztown University of Pennsylvania
Kutztown, PA
09.2023 - Current

Master of Science - Software Engineering

Pennsylvania State University
Malvern, PA
05.2001 -

Bachelor of Arts - Biochemistry

University of Delaware
Newark, DE
05.2001 -

Software

Languages (Python, Java, C#NET, C/C, Scala, SQL)

Data Engineering Tools (Spark, Presto, Hive, Pig, Scoop, Hadoop, Kafka, Debezium)

Data Science Tools (TensorFlow, Keras, Scikit-Learn, NumPy, Pandas, JupyterHub)

K8s Toolsets (Docker, kubectl, Lens, Argo Workflows, Prometheus, Grafana)

AWS Infrastructure (Glue Family, Lake Formation, Step Fxns, DMS, Kinesis, MKS)

DevOps Toolsets (FluxCD, GitHub Actions, Terraform, Ansible, Artifactory, ECR, Code Artifacts)

Database Systems (Oracle, SQL Server, MySQL, Postgres, Redshift, DocumentDB, MongoDB, Snowflake)

Timeline

Director of Data Engineering

Greenphire
09.2023 - Current

Master of Science - Computer Science

Kutztown University of Pennsylvania
09.2023 - Current

Principle Software Engineer

Workiva
07.2021 - 09.2023

Senior Director - Enterprise Data Architecture

Medidata Solutions, Division Of Dassault Systèmes
01.2019 - 07.2021

Enterprise Data Architect

Medidata Solutions Worldwide
01.2015 - 01.2019

Director of Platform Data Services

Medidata Solutions Worldwide
01.2012 - 01.2015

Visualization, Analytics & Reporting Manager

Medidata Solutions Worldwide
01.2010 - 01.2012

Master of Science - Software Engineering

Pennsylvania State University
05.2001 -

Bachelor of Arts - Biochemistry

University of Delaware
05.2001 -
Vincent MarinelliData Engineering Technology Leader / Architect