Detail-oriented with strong skills in R, Python, and Machine Learning. Experienced in ensuring data integrity and enhancing research capabilities through effective collaboration and communication.
Overview
2
2
years of professional experience
Work History
Undergraduate Research Intern
Colorado State University
Fort Collins, USA
09.2023 - 05.2025
Entrusted with the critical task of dataset oversight, ensuring the integrity and cleanliness of data used in research projects.
Develop data visualizations and figures for a Data Science book, applying accessibility standards and design principles to ensure inclusivity and usability across diverse audiences.
Actively collaborating with research team members, providing technical support, and sharing insights to enhance project outcomes.
Aided in development of comprehensive video tutorials on essential software tools such as Jupyter Notebook. These resources facilitated smoother onboarding for peers.
Enhancing the research team's capabilities in data analysis and computational research. Integrating new tools and methodologies to drive research innovation.
Committed to personal and professional growth by consistently exploring new technologies and research practices. Applying newfound knowledge to refine research strategies and contribute to the team's success.
Playing a role in managing and enhancing the Riveria data repository, ensuring optimal organization, accessibility, and security of critical research data.
Outreach Coordinator
4th Paradigm
08.2023 - 11.2024
Lead initiatives to identify and engage with heads of prominent research organizations, significantly increasing the caliber of speakers at meetings and events.
Oversee the organization's social media presence, creating engaging content that noticeably boosted event attendance.
Enhanced community involvement through proactive outreach and engaging communication, strengthening relationships, and fostering an environment of collaboration.
Education
Undergraduate Student - Data Science with concentration in Statistics
Collected and processed 40,000 user-submitted trip reports from Erowid.org using web scraping (Selenium, BeautifulSoup) and stored them in a structured dataset for analysis.
Conductedsentiment analysisusing aLongformer Transformer model to classify experiences into five categories: very negative, negative, neutral, positive, very positive.
Applied Latent Semantic Analysis (LSA) and K-means clustering to identify semantic themes across 48 psychedelic substances, uncovering key experience patterns such as mystical, auditory, dissociative, and empathy-related effects.
Performed data cleaning, lemmatization, TF-IDF transformation, PCA, and clustering to extract meaningful insights while removing confounding factors like substance names and administration methods.
Created semantic similarity matrices and cluster maps to visualize substance relationships
Developed a machine learning pipeline to classify drum audio samples (e.g., kicks, snares, hats, rides) using features like core frequency, amplitude, and average activation extracted from WAV files with the CREPE library
Engineered a balanced dataset by cleaning raw audio data, applying filtering, and handling class imbalances through under sampling techniques.
Implemented and compared multiple machine learning models: Decision Tree Classifier (optimized via cross-validation, achieving), Ridge and Lasso Logistic Regression (analyzed feature importance), Support Vector Machines (SVM) with various kernels (linear, RBF, poly, sigmoid), achieving up to 97% accuracy in binary classification tasks.
Developed and evaluated four machine learning models (Logistic Regression, SVM, Random Forest, Neural Network) to predict Titanic passenger survival, achieving accuracies up to 85%
Engineered features like FarePerClass and LogFare, applied one-hot encoding, and visualized relationships via heatmaps to enhance model performance and interpretability.
Addressed class imbalance challenges, optimized hyperparameters (e.g., tree count in Random Forest, hidden layers in Neural Network), and analyzed model performance using confusion matrices, precision-recall tradeoffs, and F1-scores
References
References available upon request.
Timeline
Undergraduate Research Intern
Colorado State University
09.2023 - 05.2025
Outreach Coordinator
4th Paradigm
08.2023 - 11.2024
Undergraduate Student - Data Science with concentration in Statistics
Emergency and Critical Care Services Supervisor at James L. Voss Veterinary Teaching Hospital at Colorado State UniversityEmergency and Critical Care Services Supervisor at James L. Voss Veterinary Teaching Hospital at Colorado State University