Passionate about data science, with a focus on applying skills in data analysis, machine learning, and visualization to address business challenges. Experienced in data preparation and statistical modeling, with a goal to contribute to meaningful projects that drive data-driven decisions and enhance customer satisfaction.
This project aims to predict the probability of a patient missing their medical appointment using a dataset of 100k appointments from the Brazilian public health system. The focus is on minimizing Type II errors, where patients are predicted to attend but do not, as this has a greater negative impact on healthcare facilities. Using techniques like logistic regression, random forests, and gradient boosting, the model will prioritize high recall to capture as many no-show cases as possible. Data preprocessing, feature engineering, and class imbalance handling will ensure optimal model performance.