Summary
Work History
Education
Skills
Websites
Projects
Awards
Personal Information
Timeline
Generic

S M SHAMSIL AREFIN

Queens

Summary

Data Science student with over one year of research experience in machine learning, statistical modeling, and data visualization. Expertise in Python libraries such as Pandas, NumPy, and scikit-learn, along with SQL and Tableau. Demonstrated ability to build data pipelines, design A/B tests, and develop predictive models that drive actionable insights. Strong collaboration skills with a track record of delivering impactful results across teams.

Work History

Undergraduate Research Assistant

CUNY Brooklyn College
Brooklyn
09.2024 - 12.2024
  • Developed Differential Population Growth Rate method to estimate viral transmission fitness from genomic surveillance data.
  • Conducted data cleaning, exploratory data analysis, and regression modeling, reducing bias from under-reporting by 30%.
  • Applied HuggingFace BERT and Transformer models for genome sequence classification and masked language modeling.
  • Created statistical simulations to validate model performance in noisy data conditions.

Education

Bachelor of Science - Computer Science

CUNY Brooklyn College
Brooklyn, NY
12-2026

Associate in Applied Science - Computer Engineering Technology

Queensborough Community College
Queens, NY
05.2023

Skills

  • Python
  • SQL
  • R
  • C
  • Java
  • JavaScript
  • Pandas
  • NumPy
  • Matplotlib
  • Seaborn
  • Scikit-learn
  • PyTorch
  • TensorFlow
  • HuggingFace
  • Tableau
  • Power BI
  • Excel
  • ArcGIS
  • Git
  • Linux
  • Statistical Analysis
  • A/B Testing
  • Feature Engineering
  • Machine Learning
  • Deep Learning
  • PostgreSQL
  • MySQL
  • SQLite
  • Statistical Analysis
  • A/B Testing
  • Feature Engineering
  • Machine Learning
  • Deep Learning
  • PostgreSQL
  • MySQL
  • SQLite

Projects

Music Recommendation System using Facial Expressions, Python, MySQL, ReactJS, Firebase, Wrote optimized SQL queries improving execution time by 20%., Built a mood-detection AI model with NLP preprocessing (SpaCy), improving detection accuracy by 25%., Designed interactive UI/UX increasing user engagement by 35%. Biodiversity in National Parks, Python (Pandas, Seaborn, Matplotlib), Analyzed 5,000+ species observations to identify biodiversity and endangerment trends., Built visual dashboards to inform conservation strategy for National Park Service. Movie Recommendation System, Python, CLI, Implemented genre/director/rating-based recommendation algorithms using efficient data structures., Ensured system reliability via comprehensive testing and error handling.

Awards

  • Dean’s Merit Scholarship, CUNY Brooklyn College, 04/24
  • Dean’s Associate High GPA Merit Award, QCC, 04/23
  • Long Island Engineers Club Annual Scholarship, 05/21

Personal Information

Relocation: Open to Relocate

Timeline

Undergraduate Research Assistant

CUNY Brooklyn College
09.2024 - 12.2024

Bachelor of Science - Computer Science

CUNY Brooklyn College

Associate in Applied Science - Computer Engineering Technology

Queensborough Community College
S M SHAMSIL AREFIN