摘要
Diabetes is a chronic health condition that impairs the body’s ability to convert food to energy,recognized by persistently high levels of blood glucose.Undiagnosed diabetes can cause many complications,including retinopathy,nephropathy,neuropathy,and other vascular disorders.Machine learning methods can be very useful for disease identification,prediction,and treatment.This paper proposes a new ensemble learning approach for type 2 diabetes prediction based on a hybrid meta-classifier of fuzzy clustering and logistic regression.The proposed approach consists of two levels.First,a baselearner comprising six machine learning algorithms is utilized for predicting diabetes.Second,a hybrid meta-learner that combines fuzzy clustering and logistic regression is employed to appropriately integrate predictions from the base-learners and provide an accurate prediction of diabetes.The hybrid metalearner employs the Fuzzy C-means Clustering(FCM)algorithm to generate highly significant clusters of predictions from base-learners.The predictions of base-learners and their fuzzy clusters are then employed as inputs to the Logistic Regression(LR)algorithm,which generates the final diabetes prediction result.Experiments were conducted using two publicly available datasets,the Pima Indians Diabetes Database(PIDD)and the Schorling Diabetes Dataset(SDD)to demonstrate the efficacy of the proposed method for predicting diabetes.When compared with other models,the proposed approach outperformed them and obtained the highest prediction accuracies of 99.00%and 95.20%using the PIDD and SDD datasets,respectively.
基金
The authors extend their appreciation to the Deputyship for Research&Inno-vation,Ministry of Education in Saudi Arabia“for funding this research work through the project number IFPHI-193-830-2020”and King Abdulaziz University,DSR,Jeddah,Saudi Arabia.