Summary
Work History
Education
Skills
Timeline
AWARDS & HONORS
Hi, I’m

Kora Dey

RIVERSIDE,CA

Summary

I’m deeply passionate about using statistical modeling and machine learning to uncover meaningful insights from complex, high-dimensional data. With experience applying these methods in fields like public health, imaging, and behavioral science, I’ve developed a strong appreciation for building interpretable, data-driven models that make real-world impact. I’m excited to pursue a PHD while taking on research that pushes the boundaries of statistical methodology and contributes to solving important, data-intensive problems.

Work History

University of California Riverside, APRO

MSRIP INTERN / UC LEADS
05.2025 - Current

Job overview

(2 Year UC Leadership Excellence through Advanced Degrees Program)

  • Designed and deployed a domain-specific ChatGPT-based research assistant using OpenAI, LangChain, and FAISS, trained on academic publications, statistical texts, and university datasets to support undergraduate research.
  • Developed an intuitive natural language interface that allows users to generate code, interpret statistical models, and receive real-time guidance on reproducibility and research methodology.
  • Fine-tuned model performance using Python, Jupyter, and Hugging Face Transformers, with extensive prompt engineering to align responses with disciplinary standards.
  • Currently leading efforts to institutionalize the tool across multiple UCR departments, aiming to democratize access to advanced AI research support and enhance student-faculty collaboration.

Dr. Haofei Zhang (Chemistry Dept)

Data Analyst (Chemistry Lab)
08.2024 - Current

Job overview

  • Designed end-to-end data pipelines in Python and R to automate preprocessing, normalization, and visualization of high-throughput experimental datasets.
  • Led advanced analysis of high-dimensional reaction data using unsupervised learning techniques including PCA, hierarchical clustering, and k-means, enabling identification of hidden chemical behavior across experimental conditions.
  • Developed modular, reproducible frameworks using pandas, ggplot2, and dplyr, supporting batch experimentation and accelerating time-to-insight for new chemical workflows.
  • Took initiative in modernizing the lab’s data ecosystem by enforcing FAIR principles, coordinating scalable data practices across projects, and improving readiness for publication and cross-collaboration.

Fallbe Lab, UC Davis

Data Collecter /Data Collecter
05.2025 - 07.2025

Job overview

  • Assisted in a public health research initiative focused on understanding disparities in food marketing and access across California.
  • Collected, reviewed, and organized large datasets from retail food environments, including store audits and community-level health indicators.
  • Ensured data consistency and accuracy across multiple regions, supporting the lab’s broader statistical analysis and reporting.
  • Contributed to a research brief used by health agencies to address nutrition equity in underserved areas.

University of California Riverside,

Research in Science & Engineering Fellowship
05.2024 - 09.2024

Job overview

  • Developed high-fidelity epidemic simulations using Poisson multigraph SEIR models in R and Python to examine transmission across structured population networks.
  • Conducted network-level statistical inference to identify dominant group interactions and evaluated the impact of dynamic intervention strategies.
  • Created custom visualizations using ggplot2, igraph, and matplotlib to summarize complex transmission pathways.
  • Delivered an internal report that modeled targeted outbreak containment strategies in heterogeneous populations.

Hugh Yeh, Associate Professor

Mammogram Diagnostics- Independent Research
05.2021 - 08.2023

Job overview

  • Trained a convolutional neural network using TensorFlow and Keras to classify malignant and benign mammographic findings, with the goal of improving early breast cancer detection
  • Preprocessed high-dimensional imaging datasets by normalizing, augmenting, and resizing inputs to enhance model accuracy and reduce diagnostic bias
  • Tuned sensitivity and specificity to benchmark model performance against clinical standards, exploring its use as a decision support tool for radiologists
  • Presented results at the Synopsys Science and Technology Fair, where the project was recognized for innovation in medical AI, with mentorship from faculty across multiple institutions

Seattle University

AI4ALL Research Fellowship
05.2022 - 08.2022

Job overview

  • Designed a behavior classification model using scikit-learn and trained it on criminological datasets to identify psychological risk profiles, learning how AI can uncover hidden patterns in human decision-making.
  • Used SHAP to visualize feature importance and interpret predictions, gaining hands-on experience with cutting edge explainable AI tools and the ethical challenges they address.
  • Explored algorithmic fairness by benchmarking model outputs across sensitive demographic groups, and proposed mitigation strategies for bias and representation gaps.
  • developed understanding of applied AI in high-impact social systems, culminating in a national symposium presentation on the role of machine learning in criminal justice and ethical AI deployment.

CS Summer Immersion Programs

Software Engineering Intern
06.2020 - 08.2021

Job overview

  • Built full-stack data pipelines using Python, SQL, and Flask to process and model structured datasets, applying supervised learning techniques for real-world predictions
  • Automated data cleaning, normalization, and visualization using tools like Plotly and Seaborn to streamline analysis workflows
  • Completed accelerated training in Python, Java, JavaScript, and C++, developing cross-functional software applications in collaborative team environments
  • Led beginner coding workshops, supported peers in debugging and logic fundamentals, and cultivated a lasting commitment to accessible computing and education equity

Spotline Inc.

Machine Learning Intern
04.2021 - 07.2021

Job overview

  • Developed and tuned classification models using scikit-learn, pandas, and NumPy to address real-world business challenges including wine quality prediction and customer lifetime value ranking
  • Built end-to-end machine learning pipelines with modular preprocessing, hyperparameter tuning, cross-validation, and automated performance tracking
  • Deployed models into production by integrating outputs with internal dashboards and collaborating with engineering teams to ensure seamless, real-time insights for stakeholders
  • Strengthened practical knowledge of applied machine learning, focusing on model interpretability, business alignment, and scalable solution design

Education

University of California, Riverside
Riverside, CA

Bachelor of Science from Statistics & Computer Science

Skills

    Languages: Python, R, SQL, Java, C, Julia, C, JavaScript, Bash
    Libraries/Frameworks: Pandas, NumPy, Scikit-Learn, TensorFlow, Keras, PyTorch, Hugging Face Transformers, LangChain, OpenCV, Plotly, Seaborn, Matplotlib, ggplot2, Bokeh, Plotnine
    Tools/Platforms: Git, GitHub, Docker, Apache Spark, Airflow, SQLite, Excel, Tableau, Stata, VS Code, RStudio, Jupyter Notebook, Google Colab
    Cloud & Deployment: AWS (S3, EC2), GCP (BigQuery, Colab), Flask, Streamlit, REST APIs, CI/CD basics
    Core Competencies: Machine Learning, Deep Learning, Natural Language Processing, Computer Vision, Statistical Modeling, Predictive Analytics, Time Series Forecasting, Dimensionality Reduction (PCA, t-SNE), Model Interpretability (SHAP, LIME), Data Engineering, Advanced Data Visualization, MLOps, Research Automation, Reproducible Science, Data Ethics & Fairness

Timeline

MSRIP INTERN / UC LEADS

University of California Riverside, APRO
05.2025 - Current

Data Collecter /Data Collecter

Fallbe Lab, UC Davis
05.2025 - 07.2025

Data Analyst (Chemistry Lab)

Dr. Haofei Zhang (Chemistry Dept)
08.2024 - Current

Research in Science & Engineering Fellowship

University of California Riverside,
05.2024 - 09.2024

AI4ALL Research Fellowship

Seattle University
05.2022 - 08.2022

Mammogram Diagnostics- Independent Research

Hugh Yeh, Associate Professor
05.2021 - 08.2023

Machine Learning Intern

Spotline Inc.
04.2021 - 07.2021

Software Engineering Intern

CS Summer Immersion Programs
06.2020 - 08.2021

University of California, Riverside

Bachelor of Science from Statistics & Computer Science

AWARDS & HONORS


  • University of California Leadership Excellence through Advanced Degrees (UC LEADS) Scholar
  • University of California, Riverside Research in Science and Engineering (RISE) Scholar
  • David Nightingale First-Generation Research Symposium Presenter
  • AI4ALL National Symposium Presenter (Seattle, WA)
  • Synopsys Science and Technology Championship Awardee (2022, 2023)
  • College Board Advanced Placement (AP) Scholar
  • Johns Hopkins Center for Talented Youth (CTY) Advanced Recognition
  • National Society of High School Scholars (NSHSS) Member
  • Second Place, Science Olympiad Invitational Tournament
Kora Dey