TY - GEN
T1 - Development and evaluation of machine learning models for diabetes risk prediction using BRFSS 2023 data
AU - Mokhtar, Sarah
AU - Abbass, Jad
PY - 2026/5/6
Y1 - 2026/5/6
N2 - Diabetes remains a major global health burden, with its rising prevalence demanding scalable approaches to early risk detection. Traditional clinical screening is resource-intensive, whereas survey-based modelling offers a complementary tool for population-level surveillance. In this study, we develop and evaluate machine learning classifiers on the 2023 U.S. Behavioural Risk Factor Surveillance System (BRFSS) to predict self-reported diabetes and prediabetes. Following a comprehensive computational comparison, three models—elastic net logistic regression, gradient boosting, and XGBoost—were selected for hyperparameter optimisation, probability calibration, and recall-oriented threshold tuning. Given that diabetes risk prediction prioritises minimising false negatives (recall) over false positives (precision), we emphasised PR-AUC as the primary metric, with AUROC as a secondary measure. XGBoost and gradient boosting achieved the strongest overall performance (PR-AUC ≈ 0.48, AUROC ≈ 0.83), and emerged as reliable models, balancing discrimination, calibration, and sensitivity for practical screening applications while elastic net maximised sensitivity at the cost of precision.
AB - Diabetes remains a major global health burden, with its rising prevalence demanding scalable approaches to early risk detection. Traditional clinical screening is resource-intensive, whereas survey-based modelling offers a complementary tool for population-level surveillance. In this study, we develop and evaluate machine learning classifiers on the 2023 U.S. Behavioural Risk Factor Surveillance System (BRFSS) to predict self-reported diabetes and prediabetes. Following a comprehensive computational comparison, three models—elastic net logistic regression, gradient boosting, and XGBoost—were selected for hyperparameter optimisation, probability calibration, and recall-oriented threshold tuning. Given that diabetes risk prediction prioritises minimising false negatives (recall) over false positives (precision), we emphasised PR-AUC as the primary metric, with AUROC as a secondary measure. XGBoost and gradient boosting achieved the strongest overall performance (PR-AUC ≈ 0.48, AUROC ≈ 0.83), and emerged as reliable models, balancing discrimination, calibration, and sensitivity for practical screening applications while elastic net maximised sensitivity at the cost of precision.
U2 - 10.1109/DASA68193.2025.11498954
DO - 10.1109/DASA68193.2025.11498954
M3 - Conference contribution
SN - 9798331588601
T3 - Decision Aid Sciences and Application (DASA), International Conference on
SP - 891
EP - 895
BT - 2025 International Conference on Decision Aid Sciences and Applications (DASA)
PB - IEEE
CY - Piscataway, U.S
T2 - 2025 International Conference on Decision Aid Sciences and Applications (DASA)
Y2 - 1 December 2025 through 2 December 2025
ER -