期刊文献+

基于深度学习和多组学数据的肺腺癌分期预测研究

Stage prediction of lung adenocarcinoma based on deep learning and multi-omics data
下载PDF
导出
摘要 为解决癌症分期难以精准决策这一问题,对452例肺腺癌患者的信使核糖核酸(mRNA)转录数据、微核糖核酸(miRNA)转录数据和DNA甲基化3种组学数据进行集成融合,并采用随机森林算法进行分期预测。首先对从癌症基因组图谱(TCGA)数据库获取的3种组学数据进行预处理,将mRNA转录数据和DNA甲基化数据进行基因位点匹配,再使用4种不同的多组学集成策略对预处理后的组学数据进行集成,最后使用随机森林算法对集成后的数据进行分期预测并使用准确度、卡帕系数以及曲线下面积(AUC)作为预测效果的评价指标。研究结果显示,采用多组学集成策略在分期预测上具有更高的准确率,其中基于深度学习的集成策略的预测效果最好,评价指标分别为0.940、0.931和0.986,有希望应用于未来的肺腺癌分期预测中。 To improve accuracy in decision-making in cancer staging,this study integrated three kinds of omics data,including messenger ribonucleic acid(mRNA)transcript data,micro ribonucleic acid(miRNA)transcript data and DNA methylation,from 452 lung adenocarcinoma patients,and used random forest algorithm to predict stages.First,three kinds of omics data obtained from the cancer genome altas(TCGA)database were preprocessed and the mRNA sequencing data were matched up with DNA methylation data at gene loci,then four different multi-omics integration strategies were adopted to integrate the preprocessed data,and finally a random forest algorithm was applied to the integrated data for the prediction of staging,and accuracy,Kappa coefficient and the area under the curve(AUC)were used to evaluate the performance of the prediction.The results show that adoption of the multi-omics integration strategies can achieve high accuracy.The integration strategy based on deep learning is considered as the most effective one,with accuracy,Kappa coefficient and AUC values of 0.940,0.931 and 0.986,respectively,and it can offer relevant guidance for the lung adenocarcinoma staging prediction in the future.
作者 刘德真 李圆媛 LIU Dezhen;LI Yuanyuan(School of Optical Information and Energy Engineering,School of Mathematics and Physics,Wuhan Institute of Technology,Wuhan 430205,China)
出处 《武汉工程大学学报》 CAS 2024年第2期190-196,共7页 Journal of Wuhan Institute of Technology
基金 国家自然科学基金(12001408)。
关键词 肺腺癌分期 深度学习 集成策略 随机森林算法 staging of lung adenocarcinoma deep learning integration strategy random forest algorithm
  • 相关文献

参考文献3

二级参考文献13

  • 1Spitzer RL, Gibbon M, Williams JBW. Structured Clinical Interview for Axis I DSM-IV Disorders. Biometrics Research Department: New York State Psychiatric Institute; 1994.
  • 2Cohen J. A coefficient of agreement for nominal scales. Educ Psychol Meas. 2960; 20(1): 37-46.
  • 3Duberstein PR, Ma Y, Chapman BP, Conwell Y, McGriff J, Coyne JC, et al. Detection of depression in older adults by family and friends: distinguishing mood disorder signals from the noise of personality and everyday life. Int Psychogerietr. 2011; 23(4): 634-643. doi: http://dx.doi.org/10.1017/ $1041610210001808.
  • 4Tang W, He H, Tu XM. Applied Categorical and Count Data Analysis. Chapman & HalI/CRC; 2012.
  • 5Landis JR, Koch ~36. The measurement of observer agreement for categorical data. Biometrics. 1977; 33: 159- 174. doh http://dx.doi.org/10.2307/2529310.
  • 6Ma Y, Tang W, Feng C, Tu XM. Inference for kappas for longitudinal study data: applications to sexual health research. Biometrics. 2008; 64: 781-789. doi: http://dx.doi. org/10.1111/j. 1541-0420.2007.00934. x.
  • 7Feinstein AR, Cicchetti DV. High agreement but low kappa: I. The problems of two paradoxes. J Clin Epidemiol. 1990; 43(6): 543-549. doi: http://dx.doi.org/10.1016/0895- 4356(90)90158-L.
  • 8Lin L. A concordance correlation coefficient to evaluate reproducibility. Biometrics. 1989; 45(1): 255-268. doi: http:// dx.doi.org/10.2307/2532051.
  • 9Shrout PE, Fleiss J. Intraclass correlations: Uses in assessing rater reliability. Psychol Bull. 1979; 86(2): 420-428.
  • 10李彤,孙长俭,张乐.6种血清肿瘤标志物在肺癌诊断中的价值[J].当代医学,2021,27(5):24-26. 被引量:4

共引文献54

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部