Tennis

Experienced Data Scientist passionate about healthcare and technology. I build data centric solutions by using technological tools and mathematical models to extract information from data used to improve patient care. As a data professional, I translate complex problems into frameworks which allow such problems to be solved through machine learning and artificial intelligence.
- Classifier for Breast Cancer Classification with scikit-learn. Models used Logistic Regression.
o Tools used: Pandas, Numpy, Matplotlib, and Scikit-learn.
- Did data pre-processing and built a seasonal autoregressive model to forecast the amount of SO2 in air.
o Tools used: Pandas, Numpy, Scikit-learn, Statsmodels, Pyramid Arima
- Built a Natural Language Processing application to rate hotel reviews by users in real time.
o Tools used: Python, NLTK, Pandas, Numpy, Scikit-learn, Heroku, Flask, and Matplotlib
- Built a movie recommender system by using natural language processing. Designed a project pipeline including data storage, data pre-processing, feature engineering and selection, and modeling.
o Tools used: Pandas, Numpy, SQLite, Scikit-learn, Natural Language Toolkit NLTK, Matplotlib.
- Built a tweet classifier to classify if a given tweet based on its text, location, and keywords signifies a disaster/emergency or not. Designed a project pipeline including data storage and retrieval, data pre-processing, feature engineering and selection, and modeling.
o Tools used: Pandas, Numpy, SQLite, Scikit-learn, Natural Language Toolkit NLTK, Matplotlib.
- Did data pre-processing, feature engineering, and model building and evaluation on the Lending Club loan data to classify defaulted loans. Models used include Logistic Regression, Naïve Bayes, Random Forest, and K-Nearest Neighbors
o Tools used: Pandas, Numpy, Scikit-learn, Matplotlib, and Seaborn.
Python, SQL, SAP, DB2, MySQL, MS SQL Server, Teradata, Pandas, Numpy, Scipy, Github, Git, R
Scikit learn, Statsmodels, NetworkX, Natural Language Toolkit, Pyramid Arima
HTML, CSS, Javascript, Flask, Matplotlib, Seaborn
Microsoft Office Tools - Excel, Access, Power Point, Microsoft Power BI
Cloud Computing, Terraform IaS, Google Cloud, Vertex AI, Google Cloud Storage, Artifact Registry, Big Query, HealthcareAPI
LinkedIn: https://www.linkedin.com/in/ali-murad-b90a7a112/
GitHub: https://github.com/amuraddd
Medium: https://medium.com/@alimuradd7
Data Science Project Portfolio: https://github.com/amuraddd
- Classifier for Breast Cancer Classification with scikit-learn. Models used Logistic Regression.
o Tools used: Pandas, Numpy, Matplotlib, and Scikit-learn.
- Did data pre-processing and built a seasonal autoregressive model to forecast the amount of SO2 in air.
o Tools used: Pandas, Numpy, Scikit-learn, Statsmodels, Pyramid Arima
- Built a Natural Language Processing application to rate hotel reviews by users in real time.
o Tools used: Python, NLTK, Pandas, Numpy, Scikit-learn, Heroku, Flask, and Matplotlib
- Built a movie recommender system by using natural language processing. Designed a project pipeline including data storage, data pre-processing, feature engineering and selection, and modeling.
o Tools used: Pandas, Numpy, SQLite, Scikit-learn, Natural Language Toolkit NLTK, Matplotlib.
- Built a tweet classifier to classify if a given tweet based on its text, location, and keywords signifies a disaster/emergency or not. Designed a project pipeline including data storage and retrieval, data pre-processing, feature engineering and selection, and modeling.
o Tools used: Pandas, Numpy, SQLite, Scikit-learn, Natural Language Toolkit NLTK, Matplotlib.
- Did data pre-processing, feature engineering, and model building and evaluation on the Lending Club loan data to classify defaulted loans. Models used include Logistic Regression, Naïve Bayes, Random Forest, and K-Nearest Neighbors
o Tools used: Pandas, Numpy, Scikit-learn, Matplotlib, and Seaborn.
Reinforcement Learning - University of Alberta | Coursera
Tennis
Running
Reading fiction and non-fiction books
Listening music, watching movies and TV shows, and playing guitar
Foundations of Project Management - Google | Coursera
Reinforcement Learning - University of Alberta | Coursera
Matrix Algebra for Engineers - The Hong Kong University of Science and Technology | Coursera
Sentiment Analysis in Python - DataCamp
Introduction to Natural Language Processing in Python - DataCamp
Natural Language Processing with Classification and Vector Spaces - DeepLearning.AI | Coursera
Mathematics for Machine Learning - Imperial College London | Coursera
Divide and Conquer, Sorting and Searching, and Randomized Algorithms - Stanford, Online | Coursera
Time Series with Python Track - DataCamp
Intermediate Python for Data Science - DataCamp
Machine Learning Foundations: A Case Study Approach - University of Washington | Coursera