期刊文献+

基于集成学习模型预测重症患者再入重症监护病房的风险 被引量:8

Prediction of intensive care unit readmission for critically ill patients based on ensemble learning
下载PDF
导出
摘要 目的:基于集成学习算法建立患者再入重症监护病房(intensive care unit,ICU)的风险预测模型,并比较各个模型的预测性能。方法:使用美国重症医学数据库(medical information mart for intensive care,MIMIC)-Ⅲ,根据纳入、排除标准筛选患者,提取人口学特征、生命体征、实验室检查、合并症等可能对结局有预测作用的变量,基于集成学习方法随机森林、自适应提升算法(adaptive boosting,AdaBoost)和梯度提升决策树(gradient boosting decision tree,GBDT)建立再入ICU预测模型,并比较集成学习与Logistic回归的预测性能。使用五折交叉验证后的平均灵敏度、阳性预测值、阴性预测值、假阳性率、假阴性率、受试者工作特征曲线下面积(area under the receiver operating characteristic curve,AUROC)和Brier评分评价模型效果,基于最佳性能模型给出重要性排序前10位的预测变量。结果:所有模型中,GBDT(AUROC=0.858)优于随机森林(AUROC=0.827),略好于AdaBoost(AUROC=0.851)。与Logistic回归(AUROC=0.810)相比,集成学习算法在区分度上均有较大的提升。GBDT算法给出的变量重要性排序中,平均动脉压、收缩压、舒张压、心率、尿量、血肌酐等变量排序靠前,相对而言,再入ICU患者的心血管功能和肾功能更差。结论:基于集成学习算法的患者再入ICU预测模型表现出较好的性能,优于Logistic回归。使用集成学习算法建立的再入ICU风险预测模型可用于识别再入ICU风险高的患者,医务人员可针对高风险患者采取干预措施,改善患者的整体临床结局。 Objective:To develop machine learning models for predicting intensive care unit(ICU)readmission using ensemble learning algorithms.Methods:A publicly accessible American ICU database,medical information mart for intensive care(MIMIC)-Ⅲas the data source was used,and the patients were selected by the inclusion and exclusion criteria.A set of variables that had the predictive ability of outcome including demographics,vital signs,laboratory tests,and comorbidities of patients were extracted from the dataset.We built the ICU readmission prediction models based on ensemble learning methods including random forest,adaptive boosting(AdaBoost),and gradient boosting decision tree(GBDT),and compared the prediction performance of the machine learning models with a conventional Logistic regression model.Five-fold cross validation was used to train and validate the prediction models.Average sensitivity,positive prediction value,negative prediction value,false positive rate,false negative rate,area under the receiver operating characteristic curve(AUROC)and Brier score were used as performance measures.After constructing the prediction models,top 10 predictive variables based on importance ranking were identified by the model with the best discrimination.Results:Among these ICU readmission prediction models,GBDT(AUROC=0.858)had better performance than random forest(AUROC=0.827),and was slightly superior to AdaBoost(AUROC=0.851)in terms of AUROC.Compared with Logistic regression(AUROC=0.810),the discrimination of the three ensemble learning models was much better.The feature importance provided by GBDT showed that the top ranking variables included vital signs and laboratory tests.The patients with ICU readmission had higher mean arterial pressure,systolic blood pressure,diastolic blood pressure,and heart rate than the patients without ICU readmission.Meanwhile,the patients readmitted to ICU experienced lower urine output and higher serum creatinine.Overall,the patients having repeated admissions during their hospitalization showed worse heart function and renal function compared with the patients without ICU readmission.Conclusion:The ensemble learning based ICU readmission prediction models had better performance than Logistic regression model.Such ensemble learning models have the potential to aid ICU physicians in identifying those patients with high risk of ICU readmission and thus help improve overall clinical outcomes.
作者 林瑜 吴静依 蔺轲 胡永华 孔桂兰 LIN Yu;WU Jing-yi;LIN Ke;HU Yong-hua;KONG Gui-lan(National Institute of Health Data Science, Peking University, Beijing 100191, China;Department of Epidemiology and Biostatistics, Peking University School of Public Health, Beijing 100191, China;Advanced Institute of Information Technology, Peking University, Hangzhou 311215, China;Peking University Medical Informatics Center, Beijing 100191, China)
出处 《北京大学学报(医学版)》 CAS CSCD 北大核心 2021年第3期566-572,共7页 Journal of Peking University:Health Sciences
基金 国家自然科学基金(81771938、91846101) 北京市自然科学基金(7212201) 北京大学医学部-密歇根大学医学院转化医学与临床研究联合研究所项目(BMU2020JI011)。
关键词 重症监护病房 病人再入院 机器学习 试验预期值 Intensive care units Patient readmission Machine learning Predictive value of tests
  • 相关文献

同被引文献80

引证文献8

二级引证文献7

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部