Skip to main navigation Skip to search Skip to main content

Development and evaluation of machine learning models for diabetes risk prediction using BRFSS 2023 data

  • Kingston University

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Diabetes remains a major global health burden, with its rising prevalence demanding scalable approaches to early risk detection. Traditional clinical screening is resource-intensive, whereas survey-based modelling offers a complementary tool for population-level surveillance. In this study, we develop and evaluate machine learning classifiers on the 2023 U.S. Behavioural Risk Factor Surveillance System (BRFSS) to predict self-reported diabetes and prediabetes. Following a comprehensive computational comparison, three models—elastic net logistic regression, gradient boosting, and XGBoost—were selected for hyperparameter optimisation, probability calibration, and recall-oriented threshold tuning. Given that diabetes risk prediction prioritises minimising false negatives (recall) over false positives (precision), we emphasised PR-AUC as the primary metric, with AUROC as a secondary measure. XGBoost and gradient boosting achieved the strongest overall performance (PR-AUC ≈ 0.48, AUROC ≈ 0.83), and emerged as reliable models, balancing discrimination, calibration, and sensitivity for practical screening applications while elastic net maximised sensitivity at the cost of precision.
Original languageEnglish
Title of host publication2025 International Conference on Decision Aid Sciences and Applications (DASA)
Place of PublicationPiscataway, U.S
PublisherIEEE
Pages891-895
Number of pages6
ISBN (Electronic)9798331588595
ISBN (Print)9798331588601
DOIs
Publication statusPublished - 6 May 2026
Event2025 International Conference on Decision Aid Sciences and Applications (DASA) - Manama, Bahrain
Duration: 1 Dec 20252 Dec 2025

Publication series

NameDecision Aid Sciences and Application (DASA), International Conference on
PublisherInstitute of Electrical and Electronics Engineers, Inc.

Conference

Conference2025 International Conference on Decision Aid Sciences and Applications (DASA)
Country/TerritoryBahrain
CityManama
Period1/12/252/12/25

Fingerprint

Dive into the research topics of 'Development and evaluation of machine learning models for diabetes risk prediction using BRFSS 2023 data'. Together they form a unique fingerprint.

Cite this