Description: Performed analysis on women well being in 52 countries considering the major indicators like Education, Domestic violence, age, fertility and health.
Role: Developed and designed the database pipeline using the AWS RDS/ Postgres SQL free tier architecture and created a pipeline for the ETL and machine learning analysis for tableau stories.
Technologies Used: Python, AWS-RDS, PgAdmin, Clustering algorithms,Tableau.
GIT Repo: https://github.com/RoopaRaghav/FinalProject
Description: Supervised Machine Learning using different linear and logistic regression algorithms on unbalanced classification problem like Credit Risk analysis.
Role: Performed ETL on Credit card dataset and cleaned data for machine learning models. Created different Supervised ML models to compare and contrast based on accuracy score and summarized best one that suits credit risk analysis.
Technologies Used: Python, Imbalanced-learn,Scikit learn, Linear regression and logistic regression models, GIT
GIT Repo: https://github.com/RoopaRaghav/Credit_Risk_Analysis
Description: Unsupervised Machine Learning using clustering algorithms and developed this project.
Role: Developed Unsupervised machine learning model for vast dataset of Cryptocurrencies that places cryptocurrencies in clusters. Scaled and fit data and performed PCA analysis for feature engineering before data is fit to ML model. Created visualizations using Plotly for model outcome and summarized tradable cryptocurrencies from dataset.
Technologies Used: Python, Scikit learn ,K-Means Clustering algorithms, GIT
GIT Repo: https://github.com/RoopaRaghav/Cryptocurrencies
Credit Card Analysis
Description: Used PySpark to perform ETL process, connected to AWS RDS instance, and load transformed data into pgAdmin.
Role: Developed end-end database pipeline and performed ETL on amazon vine dataset and populated required tables in PostgresSQL.
Technologies Used: Python, AWS- RDS, PgAdmin, PostgresSQL, GIT
GIT Repo: https://github.com/RoopaRaghav/Amazon_Vine_Analysis