Summary
Overview
Work History
Education
Skills
Websites
Projects
Timeline
Generic

Yanli Zhang

Los Angeles,CA

Summary

Seeking a 2024 Summer data science/analytics internship. Solid data scientist with strong programming, statistical, and mathematical skills. Has a wide range of data science internships and project experience, including machine learning, deep learning, NLP, data analysis, and data visualization with Python, R, and SQL. 3+ years of experience in analyzing large data sets, building models, and implementing data-driven solutions.

Overview

3
3
years of professional experience

Work History

Data Scientist Intern

Thaddeus Resources Center
09.2023 - Current
  • Engineered relevant features from the application data, including creating numerical representations of skills and qualifications
  • Developed a predictive model using machine learning algorithms such as SVM and XGBoost, and deep learning neural network to predict the likelihood of an applicant being offered an internship position based on their application details, resulting in over 85% accuracy
  • Reduced the time and resources required for initial applicant screening by 30% less time and improved the quality of interns hired by 20% more accurate suitability.

Data Analyst

IvyMax San Marino
07.2022 - 12.2022
  • Conducted rigorous correlation analysis to investigate complex interrelationships among variables, including attendance, study hours, and academic performance
  • Employed regression analysis techniques to forecast students' future academic performance, leveraging historical data and pertinent factors such as prior grades, attendance records, and study habits
  • Utilized Tableau to deliver compelling data visualizations, including bar charts and pie charts, to communicate findings
  • Produced routine reports and provided ad-hoc analyses to empower academic advisors in informed decision-making processes
  • 8 out of 10 seniors successfully gained admission to their first-choice colleges.

Data Scientist Intern

China Southern Grid Company
07.2021 - 08.2021
  • Conducted research into the root causes of power outages within the Huangpu district of the city
  • Collected and cleaned a dataset comprising 260,000 rows using Python and SQL, ensuring data integrity and accuracy
  • Conducted distribution analysis and built a dashboard including tracking the power outages over time and categorical comparison
  • Identified key contributing factors to power blackouts like humidity, temperature, and the education level of the residents
  • Achieved a significant impact by successfully reducing the total number of blackout events by 20% within the subsequent month.

Education

Master of Science - Applied Data Science

University of Southern California
Los Angeles, CA
05.2025

Bachelor of Science - Computer Information Management

University of California, Irvine
Irvine, CA
03.2022

Skills

  • SQL
  • Python
  • Java
  • R Programming
  • Spark
  • Hadoop
  • MongoDB
  • MY SQL

Projects

University of Southern California - Image Classification

04/2023 - 05/2023

  • GitHub: https://github.com/Yanli-Zhang/Image-classification.git
  • Implemented deep learning by using and fine-tuning the pre-trained model 'EfficientNetB0' to classify 420 images into 10 categories and 30 landmarks
  • Conducted data preprocessing: image transformation and normalization
  • Tackled the challenge of a limited dataset by applying data augmentation strategies, Delivered outstanding performance with classification accuracies of approximately 95% for category assignments and 80% for landmark recognition


University of Southern California - Emulating Firebase

04/2023 - 05/2023

  • GitHub: https://github.com/Yanli-Zhang/Emulating-Firebase.git
  • Constructed a system resembles Firebase using Flask, WebSockets, and MongoDB, Created a Flask RESTful API, including PUT, GET, POST, PATCH, DELETE, and filtering functions
  • Leveraged MongoDB to efficiently store JSON data, implementing index creation to enhance data retrieval performance and optimization
  • Developed an engaging web interface operating Python that showcased real-time data synchronization with the server, seamlessly integrating CRUD (Create, Read, Update, Delete) operations

Timeline

Data Scientist Intern

Thaddeus Resources Center
09.2023 - Current

Data Analyst

IvyMax San Marino
07.2022 - 12.2022

Data Scientist Intern

China Southern Grid Company
07.2021 - 08.2021

Master of Science - Applied Data Science

University of Southern California

Bachelor of Science - Computer Information Management

University of California, Irvine
Yanli Zhang