SQL for Data Science (IBM) - Edx
Using Random Forest to Learn Imbalanced Data, Read the data, analyzed data, Sampled data, built random forest models., Constructed random forest models using different cut-off to balance the data (Oversampling, underdamping, hybrid), weighted methods and Smote sampling., Reported and presented to class. Predicting the Supercomputer Usage for NSF Awarded Projects, Extracted data, Explore to detect patterns, Transformed data and Implemented., Constructed machine learning algorithms (XGBoost, Light GBM, MLM, Random Forest and Elastic Net Regression models) to predict the usage rate of supercomputer, Tuned Parameters, Selected the best methods., Posted to Kaggle for competition between student (3rd place). Analysis of Walmart retail goods sales, Extracted, Cleaned, transformed sales data for analysis using R studio., Performed data visualization of the sales to identify patterns, trends and outliers., Built different machine learning and times series models to predict twenty-eight days forecast of item sale in various location.