Summary
Overview
Work History
Education
Skills
Awards
Timeline
Generic

Yussef Ali

Denton,Tx

Summary

Motivated and detail-oriented Data Scientist with a strong academic foundation in developing data-driven solutions and streamlining workflows. Experienced in constructing and automating scalable data pipelines to support effective decision-making and enhance operational efficiency. Passionate about leveraging analytical skills to tackle complex challenges, with a commitment to continuous learning and professional growth in the tech industry.

Overview

1
1
year of professional experience

Work History

Customer Churn Prediction

Project
09.2023 - 11.2024
  • I performed exploratory data analysis by visualizing several metrics across the churn attribute.
  • Next, I preprocessed the data, looking for any duplicate or null values, and changed categorical variables into numerical data using OneHotEncoder.
  • I then saw that the dataset was very imbalanced, so I used an oversampling method (SMOTE) to help improve model accuracy and handle bias.
  • Boosted logistic regression performance from 80% to 90% accuracy with Random Forest.
  • Developed strategies to reduce costs by negotiating with vendors, optimizing inventory levels, and streamlining processes.
  • Identified cost reduction opportunities through data analysis and comparison of historical pricing trends.

Business Analyst Intern

Capital One
McLean, VA
06.2024 - 08.2024
  • As a Business Analyst, I met with various leaders to learn their stories and best practices, gaining valuable insights that informed my approach
  • I strengthened my SQL knowledge, leveraging what I learned to manipulate large datasets efficiently
  • Additionally, I honed my ability to communicate insights effectively, focusing on telling compelling stories with the data to make my findings more engaging and impactful

Life Expectancy Predictor

Project Details
08.2023 - 09.2023
  • Throughout this project, I worked with a Kaggle dataset containing over 12 attributes.
  • I performed data cleaning and exploratory analysis, creating visualizations using Python.
  • After splitting the data into training and testing sets (80/20), I trained a linear regression model using Pandas, NumPy, Seaborn, and Scikit-learn.
  • I evaluated the model's performance with metrics like the R² score, RMSE, and generated a residual plot.

Education

BS - Data Science

University of North Texas
Denton, Tx
05.2025

Skills

  • Python
  • C
  • R
  • SQL
  • JavaScript
  • Snowflake
  • Tableau
  • RapidMiner
  • Microsoft Office
  • Statistical Analysis
  • Data Visualization
  • Customer Focused
  • Data Analysis
  • Problem Solving
  • Continuous Learning
  • Cost-benefit Analysis

Awards

University of North Texas, 08/01/22 - Present, Blue and Green scholarship based on merit. Deans & President's List

Timeline

Business Analyst Intern

Capital One
06.2024 - 08.2024

Customer Churn Prediction

Project
09.2023 - 11.2024

Life Expectancy Predictor

Project Details
08.2023 - 09.2023

BS - Data Science

University of North Texas
Yussef Ali