Summary
Overview
Work History
Education
Skills
Coursework
Languages
Timeline
Intern

Yufan HE

Irvine,CA

Summary

Experienced in statistical modeling, regression analysis (linear, logistic, ridge), clustering (K-means, hierarchical), and causal inference. Proficient in Python, R, and SQL. Expertise in data preprocessing, time series analysis, and machine learning techniques. Strong background in data visualization using ggplot2, Matplotlib, and Tableau. Analytical skills in financial auditing, predictive modeling, and environmental data analysis. Well-equipped to handle complex projects. Familiar with A/B testing, hypothesis testing, and optimization methods.

Overview

1
1
year of professional experience

Work History

Intern

Chongqing Zhixiang Jintai Biopharmaceutical Co., Ltd.
04.2023 - 07.2023
  • Conducted data-driven analysis for production planning and quality control.
  • Developed ridge regression & autoregressive models to assess raw material variability and processing effects.
  • Identified key quality predictors and optimized manufacturing processes.
  • Provided data-driven recommendations, improving efficiency and reducing material waste.

Audit Intern

PricewaterhouseCoopers Zhong Tian LLP
02.2023 - 03.2023
  • Ensured data accuracy & compliance with IFRS and China GAAP.
  • Conducted data cleaning & reconciliation using Excel and Python to enhance audit reliability.
  • Optimized workflows, reducing audit preparation time by 20%.
  • Analyzed revenue, cost structures, and profit margins.
  • Collaborated with clients' financial teams to resolve discrepancies.

Intern

National Laboratory of Pattern Recognition
08.2022 - 09.2022
  • Cleaned & preprocessed data in R, handling missing values.
  • Visualized response distributions using pie and bar charts.
  • Applied multiple linear regression to assess behavior-anxiety relationships.
  • Used K-means clustering to identify patterns by gender and education.

Education

Master of Science - Statistics

University of California, Irvine
Irvine, CA
06.2025

Bachelor of Science - Statistics

University of South Carolina
Columbia, SC
12.2022

Skills

  • Programming: Python, R, Java, C, SAS, SQL
  • Data Analysis & Management: Data preprocessing, cleaning, and SQL-based management
  • Statistical Modeling: Regression (linear, logistic, ridge), clustering (K-means, hierarchical), causal inference, time series analysis, and stochastic processes
  • Data Visualization: ggplot2, Matplotlib, Tableau, and presentation skills
  • Microsoft Office Suite: Excel, Word, PowerPoint

Coursework

  • Data Visualization with SAS: Processed student test data and generated tabular grade reports.
  • Reaction Time Study: Used Wilcoxon signed-rank test, confirming significant improvement post-training.
  • Leukemia GR Site Comparison: Applied Wilcoxon rank-sum test, identifying significant variability between chronic and acute leukemia patients.
  • Diabetes Risk Modeling: Built logistic regression models in R, identifying glucose, BMI, pedigree, and age as key predictors.

Languages

English
Professional Working
Chinese (Mandarin)
Native or Bilingual

Timeline

Intern

Chongqing Zhixiang Jintai Biopharmaceutical Co., Ltd.
04.2023 - 07.2023

Audit Intern

PricewaterhouseCoopers Zhong Tian LLP
02.2023 - 03.2023

Intern

National Laboratory of Pattern Recognition
08.2022 - 09.2022

Master of Science - Statistics

University of California, Irvine

Bachelor of Science - Statistics

University of South Carolina
Yufan HE