A Machine Learning Classification Model for Detecting Prediabetes

A Machine Learning Classification Model for Detecting Prediabetes

下载PDF

导出

摘要 The incidence of prediabetes is in a dangerous condition in the USA. The likelihood of increasing chronic and complex health issues is very high if this stage of prediabetes is ignored. So, early detection of prediabetes conditions is critical to decrease or avoid type 2 diabetes and other health issues that come as a result of untreated and undiagnosed prediabetes condition. This study is done in order to detect the prediabetes condition with an artificial intelligence method. Data used for this study is collected from the Centers for Disease Control and Prevention’s (CDC) survey conducted by the Division of Health and Nutrition Examination Surveys (DHANES). In this study, several machine learning algorithms are exploited and compared to determine the best algorithm based on Average Squared Error (ASE), Kolmogorov-Smirnov (Youden) scores, areas under the ROC and some other measures of the machine learning algorithm. Based on these scores, the champion model is selected, and Random Forest is the champion model with approximately 89% accuracy. The incidence of prediabetes is in a dangerous condition in the USA. The likelihood of increasing chronic and complex health issues is very high if this stage of prediabetes is ignored. So, early detection of prediabetes conditions is critical to decrease or avoid type 2 diabetes and other health issues that come as a result of untreated and undiagnosed prediabetes condition. This study is done in order to detect the prediabetes condition with an artificial intelligence method. Data used for this study is collected from the Centers for Disease Control and Prevention’s (CDC) survey conducted by the Division of Health and Nutrition Examination Surveys (DHANES). In this study, several machine learning algorithms are exploited and compared to determine the best algorithm based on Average Squared Error (ASE), Kolmogorov-Smirnov (Youden) scores, areas under the ROC and some other measures of the machine learning algorithm. Based on these scores, the champion model is selected, and Random Forest is the champion model with approximately 89% accuracy.

作者 A. K. M. Raquibul Bashar Mahdi Goudarzi Chris P. Tsokos A. K. M. Raquibul Bashar;Mahdi Goudarzi;Chris P. Tsokos(Department of Mathematics & Computer Science, Augustana College, Rock Island, Illinois, USA;Independent Researcher, San Francisco, California, USA;Department of Mathematics & Statistics, University of South Florida, Tampa, Florida, USA)

机构地区 Department of Mathematics & Computer Science Independent Researcher Department of Mathematics & Statistics

出处《Journal of Data Analysis and Information Processing》 2024年第3期462-478,共17页 数据分析和信息处理（英文）

关键词 PREDIABETES Machine Learning SVM FOREST Cumulative Lift Prediabetes Machine Learning SVM Forest Cumulative Lift

分类号 H31 [语言文字—英语]

引文网络
相关文献

1张华辉,冯林,荆沁璐.基于融合对抗网络的方面级情感分类方法[J].中文信息学报,2024,38(7):147-157.
2Tingting Yan,Hui Zheng,Mingshuang Li,Chao Ma,Xuanyi Wang,Xiaoqi Wang,Zhenjun Li,Yuansheng Chen,Wenshang Hu,Lance Rodewald,Zhijie An,Zundong Yin,Zijian Feng.The COVID-19 Vaccines Evaluation Program:Implementation,Management,and Experiences,2021-2023[J].China CDC weekly,2024,6(26):642-648.
3Reported Cases and Deaths of National Notifiable Infectious Diseases—China,June 2024[J].China CDC weekly,2024,6(34):883-884.
4Navya Nori.Machine Learning Based Virtual Screening for Biodegradable Polyesters[J].Journal of Materials Science and Chemical Engineering,2024,12(8):1-11.
5Shicheng Yu,Mengxian Zhang,Zhaofeng Ye,Yalong Wang,Xu Wang,Ye-Guang Chen.Development of a 32-gene signature using machine learning for accurate prediction of inflammatory bowel disease[J].Cell Regeneration,2023,12(1):423-435.
6李明影.受害者视角:亲密关系暴力的认知及影响[J].争议解决,2024,10(8):236-244.
7徐悦,李佳潼,郭齐韵,李慧珊,吴华.南昌市NDVI时空演化特征及其气候驱动因子分析[J].森林工程,2024,40(5):50-61.
8任文静.基于智能手机传感器数据的运动状态分类与特征预测[J].应用数学进展,2024,13(8):3976-3988.
9Armstrong Manuvakola Ezequias Ngolo,Teiji Watanabe.Integrating geographical information systems,remote sensing,and machine learning techniques to monitor urban expansion:an application to Luanda,Angola[J].Geo-Spatial Information Science,2023,26(3):446-464.

Journal of Data Analysis and Information Processing

2024年第3期

浏览历史

内容加载中请稍等...

A Machine Learning Classification Model for Detecting Prediabetes

相关作者

相关机构

相关主题

浏览历史