Summary
Overview
Work History
Education
Skills
Languages
Software
Timeline
Generic

Wenhao Pei

Wellsville,USA

Summary

Enthusiastic recent graduate with a strong foundation in data science and engineering concepts. Proficient in Python, R, and basic SQL, with experience in academic projects involving data analysis and visualization. Eager to apply theoretical knowledge in a professional setting and contribute to innovative data-driven projects.

Responsive expert experienced in monitoring database performance, troubleshooting issues and optimizing database environment. Possesses strong analytical skills, excellent problem-solving abilities, and deep understanding of database technologies and systems. Equally confident working independently and collaboratively as needed and utilizing excellent communication skills.

Overview

13
13
years of professional experience

Work History

Data Engineer Intern

URMC Wilmot Cancer Center
03.2024 - 07.2024
  • During my internship at URMC Wilmot Cancer Center, I played a pivotal role in designing and optimizing data pipelines using Apache Spark.
  • I successfully reduced data processing time by 30%, allowing our team to deliver insights more rapidly.
  • Additionally, I collaborated with cross-functional teams to ensure data accuracy and integrity, contributing to critical research projects that directly impact patient care.

Lead of Data Group

SUNY Buffalo State University
02.2024 - 05.2024
  • The project focuses on the practical exploration of web scraping and API usage using Python.
  • It provides a collaborative learning experience, allowing students to apply various skill sets to tasks related to web scraping and APIs.
  • To enhance efficiency, I utilize Scrapy and Playwright, significantly improving the performance and effectiveness of the scraping process.
  • This combination enables robust data extraction while seamlessly handling dynamic web content.
  • The project has successfully scraped over 10,000 data points from various websites, reducing data collection time by approximately 40% compared to traditional methods.

Automation Engineer (Remote)

May Digital Music Publisher
02.2017 - 12.2022
  • In my role as an Automation Engineer at May Digital Music Publisher, I led the development and implementation of an automated deployment pipeline that reduced deployment time by 50%.
  • I collaborated closely with cross-functional teams to identify bottlenecks and optimize workflows, which resulted in a significant increase in our team's productivity and efficiency.
  • Additionally, I proactively monitored system performance and resolved critical issues, ensuring high availability of our services.

Music Producer (Piano Teacher)

Hong Yin International Music School
03.2011 - 05.2014
  • In my role at Hong Yin International Music School, I taught advanced piano techniques to students of varying skill levels.
  • I organized bi-monthly performance assessments to track student progress and provide personalized feedback, resulting in a notable increase in student retention rates.
  • I also collaborated with other instructors to integrate cross-disciplinary learning experiences, enriching our students' musical education.
  • Collaborated with performers and producers to determine and achieve desired sound for production.

Education

Master - Data Science & Analytics

SUNY Baffalo State University
Buffalo, NY
05.2024

Bachelor - Music Composition

Henan Normal University
Xinxiang,Henan China
07.2010

Skills

  • Programming Languages
  • Python
  • SQL
  • JavaScript
  • Machine Learning & Data Tools
  • TensorFlow/PyTorch
  • Apache Airflow
  • Data Build Tool (DBT)
  • ETL Development
  • Cloud & Infrastructure
  • AWS Lambda
  • Docker/Compose
  • Apache Kafka
  • PostgreSQL/MongoDB
  • Web & Automation
  • HTML/CSS
  • API Development
  • Workflow Automation (Bash, C#, N8N)
  • Prompt Engineering

Languages

Chinese (Mandarin)
Native or Bilingual
English
Full Professional

Software

Visual Studio Code

Jupyter Notebook/JupyterLab

Tableau/Power BI

Cursor/MCP

Excel/Google Sheets

Apache Spark

Linux Command

AWS/GCP/Azure

Databricks

Docker/Airflow

Hugging Face

Timeline

Data Engineer Intern

URMC Wilmot Cancer Center
03.2024 - 07.2024

Lead of Data Group

SUNY Buffalo State University
02.2024 - 05.2024

Automation Engineer (Remote)

May Digital Music Publisher
02.2017 - 12.2022

Music Producer (Piano Teacher)

Hong Yin International Music School
03.2011 - 05.2014

Master - Data Science & Analytics

SUNY Baffalo State University

Bachelor - Music Composition

Henan Normal University
Wenhao Pei