期刊文献+

随机生存森林模型在肺癌患者预后分析中的应用 被引量:12

Application of Random Survival Forests Model in Prognosis Analysis of Lung Cancer Patients
下载PDF
导出
摘要 目的应用随机生存森林模型探讨肺癌患者预后影响因素的重要性并对预测结果进行评价。方法对山西省某三甲医院342例确诊的肺癌患者进行随访研究,建立随机生存森林模型,并与传统的Cox回归模型进行比较。结果342例肺癌患者中226例患者发生死亡,中位生存时间为28.23月。治疗方式、肿瘤大小、临床分期等变量是影响肺癌患者预后的重要因素,淋巴结转移、分化程度、病理分型、年龄是中度预测因素,并分析了变量之间的交互作用。二者的模型比较结果显示随机生存森林模型预测错误率以及预测误差均低于Cox回归模型。结论随机生存森林模型拟合效果好,可用于右删失生存数据的分析,不但能发现重要的影响因素,还能发现变量之间的交互作用,为肺癌患者预后状况的改善,提升生命质量提供科学依据。 Objective To explore the importance of prognostic factors of lung cancer patients using random survival forests model and predict the prognosis.Methods A follow-up study was conducted on 342 confirmed lung cancer patients in a top three hospital in Shanxi Province.A random survival forests model was established and compared with the traditional Cox model.Results Of the 342 lung cancer patients,226 died,with a median survival time of 28.23 months.Treatment methods,tumor size,clinical stage are important factors affecting the prognosis of patients with lung cancer.Lymph node metastasis,degree of differentiation,pathological classification,and age are moderate predictive factors.The interaction between variables was analyzed.The comparison results show that the consistency error rate and prediction error of the RSF model are lower than the Cox regression model.Conclusion The random survival forests model can be used for the analysis of right censored survival data.It can not only find important influencing factors,but also discover the interaction between variables,and provide a scientific basis for improving the prognosis of lung cancer patients and improving the quality of life.
作者 李淼 罗天娥 郭强 于智凯 赵晋芳 段燕 Li Miao;Luo Tiane;Guo Qiang(Department of Biostatistics,School of Public Health,Shanxi Medical University,030001,Taiyuan)
出处 《中国卫生统计》 CSCD 北大核心 2021年第3期327-331,共5页 Chinese Journal of Health Statistics
基金 山西省面上自然基金项目(201801D121210) 国家青年科学基金项目(81001294)。
关键词 随机生存森林模型 COX回归模型 肺癌 预后分析 Random survival forests model Cox regression model Lung cancer Prognosis analysis
  • 相关文献

参考文献1

二级参考文献11

  • 1Breiman L. Random forests. Machine Learning,2001,45( 1) :5-32.
  • 2Ishwaran H, Kogalur UB,Blackstone EH, et al. Random survival for-ests. The Annals of Applied Statistics, 2008,2(3) : 841-860.
  • 3Ishwaran H, Kogalur UB. Random survival forests for R. Rnews,2007,7(2) :25-31.
  • 4Evers L, Messow CM. Sparse kernel methods for high-dimensional sur-vival data. Bioinformatics,2008,24(14) :1632-1638.
  • 5Li H, Gui J. Partial Cox regression analysis for high-dimensional mi-croarray gene expression data. Bioinformatics,2004,20(4) :208-215.
  • 6Ishwaran H,Kogalur UB,Gorodeski EZ,et al. High-Dimensional Varia-ble Selection for Survival Data. Journal of the American Statistical As-sociation,2010,105(489) :205-217.
  • 7Ishwaran H, Kogalur UB, Chen X, et al. Random survival forests forhigh-dimensional data. Statistical Analysis and Data Mining,2011, 4(1):1]5-132.
  • 8Harrell FE, Lee KL, Mark DB, et al. Evaluating the yield of medicaltests. Journal of the American Statistical Association, 1982,247 (18):2543-2546.
  • 9Rosenwald A, Wright G, Wing C,et al. The use of molecular profilingto predict survival after chemotherapy for diffuse large-B-cell lympho-ma. The New England Journal of Medicine, 2002, 346 ( 25 ) : 1937-1947.
  • 10Bullinger L,Dohner K,Bair E,et al. Gene expression profiling identifiesnew subclasses and improves outcome prediction in adult myeloid leu-kemia. The New England Journal of Medicine, 2004,350 (16) : 1605-1616.

共引文献5

同被引文献73

引证文献12

二级引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部