Summary
Overview
Work History
Education
Skills
Websites
Certification
CONTACT
Timeline
Generic

James Sagey

Senior Bioinformatics Data Scientist
Remote

Summary

Biotechnology professional with comprehensive experience in molecular biology, genetic engineering, and bioinformatics. Known for strong collaboration skills and consistently delivering results in dynamic environments. Skilled in laboratory techniques, data analysis, and process optimization. Reliable team player with flexible approach to evolving project needs.

Overview

12
12
years of professional experience
7
7
Certifications
4
4
Languages

Work History

Senior Bioinformatics & Machine Learning Engineer

Coin Nexus Inc
02.2020 - 05.2025
  • Key Technologies: R, Python, NGS, TensorFlow, PyTorch, Apache Spark, Kubernetes, Docker, SQL, Bioconductor, Nextflow, GATK, AWS, GCP.
  • Led development and deployment of machine learning pipelines for genomics, precision medicine, and biomarker discovery, integrating scalable infrastructure with Kubernetes and Docker.
  • Designed and optimized ETL pipelines to process terabytes of high-dimensional biological, clinical, and omics datasets using Apache Spark and cloud-native tools.
  • Established a robust MLOps framework automating CI/CD, model retraining, and monitoring of predictive models for bioinformatics applications with MLflow and Airflow.
  • Integrated AI solutions into bioinformatics workflows for anomaly detection, patient stratification, and drug response prediction, improving accuracy and translational impact.
  • Directed a cross-functional team of 20 scientists and engineers, fostering reproducibility, interpretability, and best practices in biomedical AI.
  • Collaborated with researchers and stakeholders to align computational biology initiatives with clinical and pharmaceutical goals, accelerating discovery pipelines and improving decision-making.

Senior Bioinformatics Data Scientist

NENFB LLC
01.2016 - 01.2020
  • Developed predictive models for gene expression analysis, achieving 70% improvement in accuracy over previous methods, resulting in more precise biomarker identification.
  • Designed and implemented an automated pipeline for next-generation sequencing (NGS) data, reducing data processing time by 60% and increasing throughput.
  • Led a cross-functional team to integrate machine learning models for drug target identification, resulting in identification of potential candidates in immunotherapy.
  • Conducted data engineering for large-scale genomics datasets, leveraging Spark and Hadoop to enable scalable data processing.

Software & Cybersecurity Engineer

Muso LLC
01.2013 - 01.2016
  • Developed 15+ software applications, utilizing programming languages such as Java, Python, and C++, improving system functionality and user experience.
  • Engineered a data processing module using Apache Kafka that enhanced data ingest rate by 60%.
  • Managed all phases of the software development lifecycle (SDLC) from initial concept through development, testing, and deployment.
  • Implemented Agile methodologies, resulting in a 30% improvement in project delivery timelines.

Education

Bachelor of Science - Information Technology

Purdue University

Master of Science - BioTechnology

Northeastern University
Boston, MA
04.2001 -

Skills

Microsoft Azure Solutions Architect Expert

Programming Languages: Python, R, Java, C

Machine Learning: TensorFlow, PyTorch, scikit-learn, Keras, XGBoost

Bioinformatics Tools: Bioconductor, Biopython, BLAST, GATK, SAMtools

Data Analysis: Pandas, NumPy, SciPy, Matplotlib, Seaborn

Natural language processing

Model development

Data analytics

Machine learning

Supervised learning

Machine learning integration

Feature engineering

Clustering algorithms

Gradient boosting machines

Statistical modeling

Project management

Neural networks

Algorithm development

Data mining

Dimensionality reduction

Optimization techniques

Unsupervised learning

Anomaly detection

Big data analytics

Genetic algorithms

Bayesian inference

Reinforcement learning

Data science principles implementation

Large dataset management

R programming language

Python programming

Gene expression

Statistical analysis techniques

Comparative genomics

Biostatistics

Drug discovery

Gene expression analysis

Data visualization tools

Next generation sequencing

Proteomics analysis

Phylogenetic analysis

Population genetics

Biological database management

Network biology

Systems biology

Epigenomics analysis

Cheminformatics

Structural bioinformatics

Functional genomics

Pathway analysis

Transcriptomics analysis

Genomic data analysis

Microarray data analysis

Molecular dynamics simulations

Genome assembly

Machine learning algorithms

Protein structure prediction

Variant calling

Certification

CompTIA Security+

CONTACT

  • +1 920-710-0357
  • Https://github.com/JD-101994
  • Jamessagey@pm.me
  • Remote or Re-location

Timeline

Senior Bioinformatics & Machine Learning Engineer

Coin Nexus Inc
02.2020 - 05.2025

Senior Bioinformatics Data Scientist

NENFB LLC
01.2016 - 01.2020

Software & Cybersecurity Engineer

Muso LLC
01.2013 - 01.2016

Master of Science - BioTechnology

Northeastern University
04.2001 -

Bachelor of Science - Information Technology

Purdue University
James SageySenior Bioinformatics Data Scientist