期刊文献+

A Machine Learning Classification Model for Detecting Prediabetes

A Machine Learning Classification Model for Detecting Prediabetes
下载PDF
导出
摘要 The incidence of prediabetes is in a dangerous condition in the USA. The likelihood of increasing chronic and complex health issues is very high if this stage of prediabetes is ignored. So, early detection of prediabetes conditions is critical to decrease or avoid type 2 diabetes and other health issues that come as a result of untreated and undiagnosed prediabetes condition. This study is done in order to detect the prediabetes condition with an artificial intelligence method. Data used for this study is collected from the Centers for Disease Control and Prevention’s (CDC) survey conducted by the Division of Health and Nutrition Examination Surveys (DHANES). In this study, several machine learning algorithms are exploited and compared to determine the best algorithm based on Average Squared Error (ASE), Kolmogorov-Smirnov (Youden) scores, areas under the ROC and some other measures of the machine learning algorithm. Based on these scores, the champion model is selected, and Random Forest is the champion model with approximately 89% accuracy. The incidence of prediabetes is in a dangerous condition in the USA. The likelihood of increasing chronic and complex health issues is very high if this stage of prediabetes is ignored. So, early detection of prediabetes conditions is critical to decrease or avoid type 2 diabetes and other health issues that come as a result of untreated and undiagnosed prediabetes condition. This study is done in order to detect the prediabetes condition with an artificial intelligence method. Data used for this study is collected from the Centers for Disease Control and Prevention’s (CDC) survey conducted by the Division of Health and Nutrition Examination Surveys (DHANES). In this study, several machine learning algorithms are exploited and compared to determine the best algorithm based on Average Squared Error (ASE), Kolmogorov-Smirnov (Youden) scores, areas under the ROC and some other measures of the machine learning algorithm. Based on these scores, the champion model is selected, and Random Forest is the champion model with approximately 89% accuracy.
作者 A. K. M. Raquibul Bashar Mahdi Goudarzi Chris P. Tsokos A. K. M. Raquibul Bashar;Mahdi Goudarzi;Chris P. Tsokos(Department of Mathematics & Computer Science, Augustana College, Rock Island, Illinois, USA;Independent Researcher, San Francisco, California, USA;Department of Mathematics & Statistics, University of South Florida, Tampa, Florida, USA)
出处 《Journal of Data Analysis and Information Processing》 2024年第3期462-478,共17页 数据分析和信息处理(英文)
关键词 PREDIABETES Machine Learning SVM FOREST Cumulative Lift Prediabetes Machine Learning SVM Forest Cumulative Lift
  • 相关文献

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部