摘要
目的探讨糖尿病早期肾病的相关危险因素,并基于BP神经网络算法构建其风险预测模型。方法回顾性分析永康某中医院2020年1月至2022年12月收治的1048例糖尿病患者,其中糖尿病肾病患者115例,占10.97%,并以此分为DKD组(糖尿病肾病组115例)和DM组(糖尿病组933例)。收集患者相关资料,采用倾向性评分匹配(PSM)排除混杂因素后按1∶1最邻近方法进行匹配。以单因素分析中具有统计学意义的指标,运用BP神经网络算法基于相关因素构建预测模型。以平均绝对值误差(MAE)进行模型效能分析,以受试者工作特征曲线(ROC)评估风险预测模型的预测价值,并进行外部验证,采用校准曲线评估模型一致性。结果混杂因素有性别、合并高血压、空腹血糖、尿酸,将建模集按1∶1比例以最邻近方法进行PSM排除混杂因素后,DKD组95例,DM组95例。单因素分析结果提示患者年龄、2型糖尿病、总胆固醇(TC)、尿蛋白排泄率、糖尿病病程、胱抑素C(Cys C)组间差异具有统计学意义(P<0.05)。预测精度从大到小依次为BP神经网络算法、决策树、支持向量机、逻辑回归。BP神经网络结果显示影响糖尿病早期肾病发生重要性的前4位因素依次为蛋白尿排泄率、年龄、糖尿病病程、Cys C。预测模型AUC为0.959(95%CI:0.917~1.000),约登指数0.867,对应的敏感度与特异性分别为0.867、1.000。外部验证AUC为0.958(95%CI:0.922~0.995),其敏感度与特异性分别为0.804、1.000,校准图中校准曲线贴近于标准曲线。结论基于机器学习法构建的以年龄、病程、尿蛋白排泄率、TC、Cys C、2型糖尿病为预测特征的BP神经网络算法模型对糖尿病早期肾病有较好的预测价值,可以把该模型临床应用于此类高风险人群的管理识别。
Objective To investigate the risk factors of early diabetic nephropathy and construct a risk prediction model based on BP neural network algorithm.Methods A total of 1048 diabetic patients admitted to Yongkang Hospital of Traditional Chinese Medicine from January 2020 to December 2022 were retrospectively analyzed,including 115 diabetic nephropathy patients(10.97%),and were divided into the DKD group(115 diabetic nephropathy patients)and the DM group(933 diabetic patients).The relevant data of patients were collected and matched according to 1∶1 nearest proximity method after the confounding factors were excluded by propensity score matching(PSM).The prediction model was built based on the correlation factors by using the statistically significant indicators in the single factor analysis and BP neural network algorithm.Mean absolute error(MAE)was used to analyze the model efficacy,the predictive value of the risk prediction model was evaluated by receiver operating characteristic curve(ROC),and external validation was performed.The model consistency was evaluated by calibration curve.Results The confounding factors were gender,hypertension,fasting blood glucose and uric acid.After the modeling set was 1∶1 and PSM was performed by the nearest method,the confounding factors were excluded:95 cases in DKD group and 95 cases in DM group.Univariate results indicated that there were significant differences in age,type 2 diabetes,total cholesterol,urinary protein excretion rate,diabetes course,and cystatin C(CysC)between groups(P<0.05).The prediction accuracy was BP neural network algorithm,decision tree,support vector machine and logistic regression in the descending order.The results of BP neural network showed that the top 4 factors affecting the occurrence of early diabetic nephropathy were proteinuria excretion rate,age,diabetes course and cystatin C(CysC)in order.The AUC of the prediction model was 0.959(95%CI:0.917-1.000),the Yoden index was 0.867,and the corresponding sensitivity and specificity were 0.867 and 1.000,respectively.The external validation AUC was 0.958(95%CI:0.922-0.995),and its sensitivity and specificity were 0.804 and 1.000,respectively.The calibration curve in the calibration diagram was close to the standard curve.Conclusion The BP neural network algorithm model based on machine learning,which takes age,disease course,urinary protein excretion rate,TC,CysC and type 2 diabetes as predictive features,has good predictive value for early diabetic nephropathy,and can be clinically applied to the management and identification of high-risk population.
作者
杜燕华
朱洪挺
Du Yanhua;Zhu Hongting(Protection Section,Yongkang Hospital of Traditional Chinese Medicine,Yongkang 321300,China;Yongkang Center for Disease Control and Prevention,Yongkang 321300,China)
出处
《中国医院统计》
2024年第2期95-101,共7页
Chinese Journal of Hospital Statistics
关键词
BP神经网络
糖尿病肾病
早期肾病
预测模型
影响因素
back propagation(BP)neural network
diabetic nephropathy
early kidney disease
predictive model
influencing factor