Efficiently managed a comprehensive dataset of over 23,000 rows, applying text mining techniques to illustrate linguistic trends and thematic shifts in the series.
Developed a Shiny app for visualizing sentiment trends in Game of Thrones scripts using the Bing lexicon, enabling users to explore episode-wise positive and negative sentiments across seasons.
Implemented bigram analysis on the scripts to reveal word relationships, using tokenization and the igraph package in R for visualizing connected word pairs, highlighting narrative patterns.
Storm Trajectories Project
Upper Division Coursework
10.2023 - 10.2023
Utilized the Tidyverse package to create a dynamic visualization of North Atlantic tropical storms in R.
Developed an interactive Shiny application in R, designed to plot, filter,and analyze the trajectories of storms based on user-selected parameters such as year, month, wind speed, pressure, and storm category.
Incorporated an interactive table detailing storm names, dates, and key meteorological data, crafted using dplyr methods in R, complementing visual data with in-depth textual analysis.
The Bioinformatics Research Project of COVID-19
Beijing Genomics Institute Co Ltd
07.2020 - 09.2020
Contributed to a pivotal research project investigating the correlation between SARS-CoV-2 mutations and factors such as gender and geographic location, as acknowledged in the study.
Assisted in analyzing 93,625 genomes using Python scripts to identify mutation hotspots, contributing to the understanding of virus evolution across different regions and genders.
Supported the team in generating data visualizations and conducting statistical tests with R language, leading to the conclusion that there's no significant association between mutation frequencies and the studied factors.
Education
Bachelor of Arts - Applied Mathematics And Statistics
University of California, Berkeley
Berkeley, CA
12-2026
Bachelor of Science - Applied Mathematics and Statistics
University of California, Davis
Sacramento, CA
06-2023
Skills
Programming Languages: Python, R, MATLAB
Skills: Data Visualization, Exploratory Data Analysis, Text Analysis
Timeline
Text Analysis Project
Upper Division Coursework
11.2023 - 12.2023
Storm Trajectories Project
Upper Division Coursework
10.2023 - 10.2023
The Bioinformatics Research Project of COVID-19
Beijing Genomics Institute Co Ltd
07.2020 - 09.2020
Bachelor of Arts - Applied Mathematics And Statistics
University of California, Berkeley
Bachelor of Science - Applied Mathematics and Statistics
Production Supervisor ( Comm and Production) at ADNOC OFFSHORE – Upper Zakum Oil & GaS Field DivisionProduction Supervisor ( Comm and Production) at ADNOC OFFSHORE – Upper Zakum Oil & GaS Field Division