Summary
Overview
Work History
Education
Skills
Hobbies and Interests
Projects
References
Timeline
Generic

Olivia Jones

Fort Collins,CO

Summary

Detail-oriented with strong skills in R, Python, and Machine Learning. Experienced in ensuring data integrity and enhancing research capabilities through effective collaboration and communication.

Overview

2
2
years of professional experience

Work History

Undergraduate Research Intern

Colorado State University
Fort Collins, USA
09.2023 - 05.2025
  • Entrusted with the critical task of dataset oversight, ensuring the integrity and cleanliness of data used in research projects.
  • Develop data visualizations and figures for a Data Science book, applying accessibility standards and design principles to ensure inclusivity and usability across diverse audiences.
  • Actively collaborating with research team members, providing technical support, and sharing insights to enhance project outcomes.
  • Aided in development of comprehensive video tutorials on essential software tools such as Jupyter Notebook. These resources facilitated smoother onboarding for peers.
  • Enhancing the research team's capabilities in data analysis and computational research. Integrating new tools and methodologies to drive research innovation.
  • Committed to personal and professional growth by consistently exploring new technologies and research practices. Applying newfound knowledge to refine research strategies and contribute to the team's success.
  • Playing a role in managing and enhancing the Riveria data repository, ensuring optimal organization, accessibility, and security of critical research data.

Outreach Coordinator

4th Paradigm
08.2023 - 11.2024
  • Lead initiatives to identify and engage with heads of prominent research organizations, significantly increasing the caliber of speakers at meetings and events.
  • Oversee the organization's social media presence, creating engaging content that noticeably boosted event attendance.
  • Enhanced community involvement through proactive outreach and engaging communication, strengthening relationships, and fostering an environment of collaboration.

Education

Undergraduate Student - Data Science with concentration in Statistics

Colorado State University
Fort Collins, CO
05.2025

Skills

  • Python
  • R
  • Java
  • Machine learning
  • Data visualization
  • Data analysis
  • Statistical modeling
  • SQL
  • Git
  • VS Code
  • Jupyter Notebook
  • MATLAB

Hobbies and Interests

  • Data Visualization
  • Machine Learning
  • Data Cleaning & Wrangling
  • Statistical Analysis
  • Mathematical modeling and Optimization
  • Dimensionality reduction
  • Geometric Modeling
  • High-Dimensional Data Analysis
  • Interdisciplinary Applications of Mathematics

Projects

Erowid Psychedelic Reports Analysis - Python, NLP (Hugging Face Transformers), Selenium, BeautifulSoup, scikit-learn, NLTK, PCA, LSA, Clustering

  • Collected and processed 40,000 user-submitted trip reports from Erowid.org using web scraping (Selenium, BeautifulSoup) and stored them in a structured dataset for analysis.
  • Conducted sentiment analysis using a Longformer Transformer model to classify experiences into five categories: very negative, negative, neutral, positive, very positive.
  • Applied Latent Semantic Analysis (LSA) and K-means clustering to identify semantic themes across 48 psychedelic substances, uncovering key experience patterns such as mystical, auditory, dissociative, and empathy-related effects.
  • Performed data cleaning, lemmatization, TF-IDF transformation, PCA, and clustering to extract meaningful insights while removing confounding factors like substance names and administration methods.
  • Created semantic similarity matrices and cluster maps to visualize substance relationships

Drum Audio Classifier - Python, scikit-learn, SVM, Decision Trees, Ridge/Lasso Regression, CREPE

  • Developed a machine learning pipeline to classify drum audio samples (e.g., kicks, snares, hats, rides) using features like core frequency, amplitude, and average activation extracted from WAV files with the CREPE library
  • Engineered a balanced dataset by cleaning raw audio data, applying filtering, and handling class imbalances through under sampling techniques.
  • Implemented and compared multiple machine learning models: Decision Tree Classifier (optimized via cross-validation, achieving), Ridge and Lasso Logistic Regression (analyzed feature importance), Support Vector Machines (SVM) with various kernels (linear, RBF, poly, sigmoid), achieving up to 97% accuracy in binary classification tasks.

Machine Learning Model: Titanic – Python, R, scikit-learn, nnet, Random Forest, SVM, Logistic Regression

  • Developed and evaluated four machine learning models (Logistic Regression, SVM, Random Forest, Neural Network) to predict Titanic passenger survival, achieving accuracies up to 85%
  • Engineered features like FarePerClass and LogFare, applied one-hot encoding, and visualized relationships via heatmaps to enhance model performance and interpretability.
  • Addressed class imbalance challenges, optimized hyperparameters (e.g., tree count in Random Forest, hidden layers in Neural Network), and analyzed model performance using confusion matrices, precision-recall tradeoffs, and F1-scores

References

References available upon request.

Timeline

Undergraduate Research Intern

Colorado State University
09.2023 - 05.2025

Outreach Coordinator

4th Paradigm
08.2023 - 11.2024

Undergraduate Student - Data Science with concentration in Statistics

Colorado State University