Summary
Overview
Work History
Education
Skills
Projects
Timeline
Generic

Pai Yang

Pittsburg

Summary

Seasoned collaborator experienced in meeting needs, improving processes and exceeding requirements in team environments. Diligent worker with strong communication and task prioritization skills. Detail-focused Data Analyst with knowledge in data warehousing, process validation and business needs analysis. Proven to understand customer requirements and translate into actionable project plans. Dedicated and hard-working with passion for Big Data.

Overview

1
1
year of professional experience

Work History

Game Designer and ScriptWriter

CMU Game Creation Society
  • Designed the whole storyline and basic gameplay
  • Utilized Unity engine to design the environment for the game setting.

Data Analyst Intern

ManpowerGroup
05.2024 - 07.2024
  • Utilized various professional statistical techniques and maintained large databases to collect and analyze data from partners and customers.
  • Completed data cleaning and data validation of existing spreadsheets to promote robust data management platform, resulting in accurate data analysis and entry.
  • Performed ad hoc data investigations analysis as necessary.
  • Triaged and tracked internal and external data requests.

Data Analytics Intern/Data Scientist Intern

ECMAX
06.2023 - 08.2023
  • Data Collection + EDA + 发现了insights
  • Preprocess the textdata (including removal of special characters and stopwords) and prompt engineering
  • Fine tuning the Large language Models (LLM) to optimize the response accuracy of the chatbot, achieving 20x efficiency on average for critical business processes
  • Trained, Tuned, and compared LLM1, LLM2, LLM3 models to build a chatbot 给公司员工去查询品牌营业额 提高工作效率
  • Presentation xxxx.

Education

Bachelor of Science - Statistics And Machine Learning

Carnegie Mellon University
Pittsburg, PA
05.2025

Skills

  • Programming Languages: Python(NumPy, Pandas, Matplotlib), R(dplyr, tidyr, ggplot2,purrr), SQL(MySQL), Excel, C#(Unity),
  • Data Visualization: Tableau, Power BI, Looker Studio
  • Cloud Platforms: AWS, Snowflake, GCP

Projects

Python Machine Learning Model Development for heart disease diagnosis prediction, Train the Decision Tree model using Python with data points like check pain factor, blood sugar, gender, and other examination parameters. Preprocess the data to improve model prediction accuracy. Evaluate and optimize model performance with analysis on confusion matrix and heatmap visualization. Customized Model Development for Natural Language Processing, Utilize Vectorization methods to tokenize unstructured comments, train and fit data for logistics regression and Decision Trees to identify the model with highest accuracy for predicting overall sentiment polarity (positive or negative) for restaurant reviews Data Wrangling and Visualization with R and Python for EDA, Data processing using Dataframe and manipulate raw data into cleaned version for subsequent analysis. Generate scripts to process data and plot visualization graphs for exploratory data analysis CMU Hackathon 2022 (scripts development for customized game: Merge Big Watermelons), Built software program from the beginning for the game integrating various elements including geometry, physical laws, graphic design, etc. Software Development (Dungeon Crawl type game), Wrote Python code from scratch for the entire game including control panel, interactive features, etc. Designed User Interface with message prompt and notification system for the players

Timeline

Data Analyst Intern

ManpowerGroup
05.2024 - 07.2024

Data Analytics Intern/Data Scientist Intern

ECMAX
06.2023 - 08.2023

Game Designer and ScriptWriter

CMU Game Creation Society

Bachelor of Science - Statistics And Machine Learning

Carnegie Mellon University
Pai Yang