Summary

Overview

Work History

Education

Skills

Websites

Timeline

Oghenemarho Ayanruoh

Guttenberg

Summary

Senior computational data scientist with expertise in Python and SQL, specializing in the design and implementation of complex data pipelines. Demonstrated success in analyzing intricate datasets and troubleshooting workflows, contributing to impactful projects in multimodal deep learning and sentiment analysis.

Overview

year of professional experience

Work History

Multimodal Pet Photo Engagement Prediction

Pennsylvania State University

State College

01.2026 - 05.2026

Built a late-fusion multimodal deep learning pipeline in PyTorch combining a ResNet-18 image encoder (512-dim) with a metadata MLP (16-dim), achieving 20.09 RMSE on a 0–100 scale, competitive with Kaggle leaderboard submissions (17.0–20.0 RMSE)
Conducted Pearson correlation analysis across 12 metadata features, identifying near-zero individual predictive power and informing design of a fusion architecture to capture non-linear feature interactions
Analyzed 9,912 out-of-fold predictions, uncovering RMSE degradation from 11.53 (scores 21–40) to 48.57 (scores 81–100) and pinpointing spurious open-mouth correlation as a key model failure
Migrated training pipeline from CPU to T4 GPU with mixed precision (bfloat16 AMP), reducing runtime from 7 hours to 1.5 hours (4.7x speedup)
Leveraged AI tools (Claude, ChatGPT) to enhance debugging and documentation processes, maintaining complete ownership of modeling

StockTwits Sentiment-Based Stock Prediction

Pennsylvania State University

State College

01.2026 - 05.2026

Built an end-to-end pipeline scraping ticker-specific StockTwits posts and aggregating them into 5-minute rolling windows for intraday stock direction prediction
Engineered sentiment and attention features including net sentiment index, bullish share, sentiment momentum, message density, and abnormal density
Merged sentiment features with intraday price data, training a logistic regression classifier that achieved ~57% accuracy and AUC ~0.563 for predicting stock direction
Researched LLM-based sentiment classification with FinBERT, proposing it as a pipeline improvement to enhance accuracy over noisy self-tagged labels

NFL Fourth-Down Decision Analysis Project (Python, PySpark)

Pennsylvania State University

State College

08.2025 - 12.2025

Transformed 1.5M+ NFL play-by-play records using Python (PySpark) on Apache Spark in Jupyter Notebook, building data pipelines that isolated ~70–90K valid fourth-down scenarios for targeted analysis.
Built scalable ETL and feature engineering workflows, generating labeled outcomes (GO, PUNT, FG) from game data for downstream analysis and modeling
Conducted exploratory data analysis (EDA) and developed performance metrics to assess fourth-down decision effectiveness, informing strategic insights across game contexts.
Optimized performance (4 GB CSV → 1.5 GB Parquet) and ran distributed jobs on Penn State’s ICDS HPC cluster
Presented technical findings through reports and presentations, leveraging AI tools (ChatGPT) to enhance development speed and workflow optimization.

Education

Bachelor of Science - Computational Data Science

Pennsylvania State University

University Park, PA

05-2026

Skills

Data Analysis
Data exploration
Statistical modeling
Machine Learning
Natural Language Processing
Python
Pandas
NumPy
PySpark
SQL

Apache Spark
Data pipelines
Data Visualization
Tableau
Ggplot2
Excel
Relational Databases
Git
AI Tools
Machine Learning

Websites

https://www.linkedin.com/in/oghenemarho-ayanruoh-402b8a230/

Timeline

Multimodal Pet Photo Engagement Prediction

Pennsylvania State University

01.2026 - 05.2026

StockTwits Sentiment-Based Stock Prediction

Pennsylvania State University

01.2026 - 05.2026

NFL Fourth-Down Decision Analysis Project (Python, PySpark)

Pennsylvania State University

08.2025 - 12.2025

Bachelor of Science - Computational Data Science

Pennsylvania State University

Oghenemarho Ayanruoh

Summary

Overview

Work History

Multimodal Pet Photo Engagement Prediction

StockTwits Sentiment-Based Stock Prediction

NFL Fourth-Down Decision Analysis Project (Python, PySpark)

Education

Bachelor of Science - Computational Data Science

Skills

Websites

Timeline

Multimodal Pet Photo Engagement Prediction

StockTwits Sentiment-Based Stock Prediction

NFL Fourth-Down Decision Analysis Project (Python, PySpark)

Bachelor of Science - Computational Data Science

Similar Profiles

Danny HellerDanny Heller

Yussef AliYussef Ali

OMAR MUSABEHOMAR MUSABEH

Praveen IntiPraveen Inti