3+ years of ML Research Experience. I am a Data Science/ Machine Learning practitioner who tackles real world problems with the help of analytics and ML libraries. I design algorithms and frameworks to provides actionable insights from the data.
Overview
3
3
years of professional experience
1
1
Certification
Work History
Graduate Research Assistant
URI | Kingston, RI
Kingston, RI
08.2021 - Current
Conducted IBM-supported research.
Created a prevention-focused task fairness framework using principles from information theory, enabling early integration into the model development phase for bias evaluation.
Published the work at EAI-KDD22.
Enhanced task fairness framework by designing a model-agnostic fair feature selection technique.
Mitigated biases by reducing differences observed in performance and group fairness metrics by close to 20%
Published work as part of master's thesis.
Analyzed user study findings to assess framework's applicability as a real-world practice.
Results indicated that users were interested in using the framework to analyze biases and applying bias mitigation techniques prior to the model fitting phase.
Data science Researcher
Laboratory for Analytic Sciences | NC State
06.2024 - 07.2024
Collaborated with Intelligence analysts on analyzing memory retention between full texts and their extractive summaries using similarity analysis.
Conducted a human-based evaluation to identify gaps in recommender systems and gather actionable insights for tailored recommenders platform implementation for IC.
Demonstrated how integrating MIND data with SUBER, a simulated user behavior recommender system utilizing large language models through reinforcement learning, enhanced performance of news recommendation systems.
Used Optuna to optimize SUBER performance and achieved promising results in hyperparameter tuning.
Graduate Assistant
URI | Kingston, RI
09.2023 - 05.2024
Assisted in CSC310 Data Science course by facilitating lab sessions, providing office hour assistance to undergraduate students and by designing a course on fair ML.
Data science Researcher
Laboratory for Analytic Sciences | NC State
06.2023 - 07.2023
Utilized BertViz to visualize and interpret the working of news encoder of the Neural News Recommendation with Multi-Head SelfAttention model(NRMS), facilitating model explainability for Intelligence analysts.
Authored paper for SCADS 2023, highlighting contributions to NRMS model
Conducted Human-Computer Interaction feasibility study with Intelligence analysts, leveraging neurocognitive data to evaluate scientific content summarization
The findings were presented at the NARST 2024 conference.
Graduate Assistant
URI | Kingston, RI
07.2021 - 08.2021
Developed MySQL database schema for a student portal, enabling academic performance tracking
Implemented front-end components.
Graduate Assistant
URI | Kingston, RI
02.2021 - 06.2021
Utilized SAS and SQL queries to analyze healthcare data, facilitating physician monitoring of patient loads across disease categories
Generated comprehensive reports, summarizing hospitalized and non-hospitalized patients with diverse health conditions over a specified timeframe.
Education
Ph.D. - Computer and Statistical Science
University of RI
Kingston, RI
12-2025
M.S. - Computer Science
University of RI
Kingston, RI
12.2022
Skills
Python
R
SQL
C
C
Pandas
NumPy
PyTorch
Tensorflow
Keras
Predictive modeling
Statistical Techniques
Recommender System
Time Series Analysis
Natural Language Processing
Big Data pipeline
DynamoDB
Scikit-learn
Research and analysis
Qualitative Research
Matplotlib
PySpark
HTML
CSS
RStudio
Jupyter Notebook
Git
Google Analytics
Tableau
Certification
Getting Started with Deep Learning NVIDIA, March 2021
Research & Publications
Information Theoretic Framework for Evaluation of Task Level Fairness- EAI-KDD22 - https://charliezhaoyinpeng.github.io/EAI-KDD22/camera_ready/information.pdf
Use of neurocognitive data to evaluate text summarization of science content - presented at NARST 2024
Model-agnostic feature selection for fairness - 2022 - https://digitalcommons.uri.edu/cgi/viewcontent.cgi?article=3264&context=theses
Integrating Neuroscience Principles into the Development of a Tailored Daily Report: A Universal Design Approach (Article in International Journal of Psychology and Neuroscience · August 2023 - https://neuropsyjournal.wordpress.com/articles/psycho-tailored-daily-reports/)
Projects
Time Series Analysis; R, Python, statsmodels, forecast, TSA, Executed a demand forecasting project by implementing a Linear Regression model to evaluate the interdependencies among variables. Applied ARIMA and SARIMA models to perform comparative analysis, accounting for the seasonality and upward trend in product demand.
Big Data Analysis; R, Python, Matplotlib, ggplot, Developed predictive models, including the Susceptible-Infectious-Recovered (SIR) model and both simple and multiple linear regression models, to quantitatively analyze the impact of COVID-19.
Neural Networks & Deep Learning; Python, ML, NumPy, PyTorch, Keras, pandas, Matplotlib, Image classification using CNN, MLP and Multiclass Logistic Regression to compare accuracies and losses.
Big Data Algorithms; Apache Spark- PySpark, DynamoDB, Kafka, Implemented Lambda Architecture (Big Data pipeline) to process real time and batch data. Implemented customer segmentation using K means clustering.