期刊文献+

基于可解释机器学习构建脑卒中患者日常生活自理能力风险预测模型 被引量:1

Constructing a prediction model for stroke patients'activities of daily living risk based on interpretable machine learning
下载PDF
导出
摘要 目的:利用机器学习算法预测影响脑卒中患者日常生活自理能力(activities of daily living,ADL)的风险因素,为其ADL管理决策提供参考。方法:对2015年1月—2019年2月在南京医科大学附属第一医院康复医学中心治疗的423例脑卒中患者进行回顾性分析。根据Barthel指数(Barthel index,BI)评定量表,将患者分为ADL较好组(BI≥60分)和ADL较差组(BI<60分),并进行数据预处理。采用共线性诊断及最小绝对收缩和选择算子(least absolute shrinkage and selection operator,LASSO)筛选特征变量。选择逻辑回归、支持向量机、随机森林(random forest,RF)、极限梯度提升及K最近邻5种机器学习算法进行预测建模,十倍交叉验证后,使用受试者工作特征曲线、受试者工作特征曲线下面积(area under curve,AUC)、精确召回率曲线、精确召回率曲线下的面积(area under the precision recall curve,PRAUC)、准确率、灵敏度、特异度分别对模型进行综合评估,引入Shapley加性解释(Shapley additive explanation,SHAP)对最优机器学习模型进行可解释化处理。结果:经LASSO回归分析后,确定16个特征变量用于构建机器学习模型。RF模型具有最高的AUC(0.74)、PRAUC(0.64)、准确率(0.97)、灵敏度(0.75)和特异度(0.97)。SHAP模型解释性分析显示,对ADL贡献度前5的特征中,Brunnstrom分期(下肢)的影响最为显著,其次是Brunnstrom分期(上肢)、D-二聚体、血清白蛋白水平及年龄。结论:RF模型预测脑卒中患者ADL的效能最优,为脑卒中患者ADL管理决策提供了有价值的参考。 Objective:To utilize machine learning algorithms to predict risk factors affecting the activities of daily living(ADL)of stroke patients,providing references for their ADL management decisions.Methods:A retrospective analysis was conducted on 423 stroke patients treated at the Rehabilitation Medicine Center of the First Affiliated Hospital of Nanjing Medical University from January 2015 to February 2019.Patients were categorized into a better ADL group(BI≥60 points)and a poorer ADL group(BI<60 points)based on the Barthel Index(BI)assessment scale,and data preprocessing was performed.Feature variables were selected using colinearity diagnostics and the least absolute shrinkage and selection operator(LASSO).Logistic regression(LR),support vector machine(SVM),random forest(RF),extreme gradient boosting(XGBoost),and K nearest neighbor(KNN)were selected as the five machine learning algorithms for predictive modeling.Afterten-fold cross-validation,the models were comprehensively evalutated using receiver operating characteristic(ROC)curves,area under aerue(AUC),precision recall(PR)curves,area under the precision recall curve(PRAUC),accuracy,sensitivity,and specificity.The Shapley additive interpretation(SHAP)was introduced to interpret the optimal machine learning model.Results:After LASSO regression analysis,16 feature variables were identified for constructing the machine learning model.The RF model demonstrated superior performance with the highest AUC(0.74),PRAUC(0.64),accuracy(0.97),sensitivity(0.75),and specificity(0.97).Interpretive analysis of the SHAP model revealed that among the top 5 features contributing to ADL,Brunnstrom stage(lower limb)exerted the most significant effect,followed by Brunnstrom stage(upper limb),D-dimer,serum albumin level,and age.Conclusion:The RF model emerged as the most effective in predicting ADL in stroke patient,providing valuable references for ADL management decisions in stroke patients.
作者 叶倩 杨云 徐文韬 刘玲玲 YE Qian;YANG Yun;XU Wentao;LIU Linging(Center of Rehabilitation Medicine,the First Affiliated Hospital of Nanjing Medical University,Nanjing 210029;College of Acupuncture and Massage,Nanjing University of Chinese Medicine,Nanjing 210029;School of Psychology,Nanjing Normal University,Nanjing 210023,China)
出处 《南京医科大学学报(自然科学版)》 CAS 北大核心 2024年第5期672-680,共9页 Journal of Nanjing Medical University(Natural Sciences)
基金 国家自然科学基金(82104993)。
关键词 机器学习 预测模型 脑卒中 日常生活自理能力 machine learning predictive modeling stroke activities of daily living
  • 相关文献

参考文献10

二级参考文献82

共引文献1278

同被引文献4

引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部