期刊文献+

基于高斯混合和BP神经网络的卵巢癌质谱数据三分类模型 被引量:1

A Tri-classification Model of Ovarian Cancer Mass Spectrometry based on Gaussian Mixture and BP Neural Network
原文传递
导出
摘要 癌症的早期诊断可以显著提高癌症患者的存活率,三分类问题就是将未知样本与已知样本进行匹配度检测,预测样本是健康状态,良性发展状态,还是癌症状态.针对复杂难分的卵巢癌蛋白质质谱数据,提出了一种基于高斯混合模型和BP神经网络的三分类预测模型.首先,去除原数据中的冗余,对其进行方差排序及交集筛选提取特征集合一,再利用高斯混合模型处理求得参数作为特征集合二,最后使用BP神经网络进行样本三分类,准确率达到72.9%.结果表明:模型可以作为卵巢癌质谱数据三分类的可选择工具. Early diagnosis of cancer can significantly improve the survival rate of cancer patients.Tri-classification problem is to detect the matching degree between unknown and known samples and to predict whether the samples is healthy,benign or cancer.According to the complex and difficult mass spectra of ovarian cancer proteins,this paper presented a tri-classification method based on Gaussian mixture model and BP neural network algorithm.Firstly,remove redundancy from the original data.Secondly,extract the first feature set with variance sequencing and intersection screening,and obtain the second feature set by Gaussian mixture model processing.Finally,predict sample classification by BP neural network.The accuracy of the proposed method is 72.9%,higher than the known result.
作者 马敬山 魏东 任福全 李玉双 MA Jing-shan;WEI Dong;REN Fu quan;LI Yu-shuang(School of Science,Yanshan University,Qinhuangdao 066004,China;Hebei Dataport Technology Co.,Ltd,Qinhuangdao 066004,China)
出处 《数学的实践与认识》 北大核心 2020年第7期147-153,共7页 Mathematics in Practice and Theory
基金 国家自然科学基金(61807029)。
关键词 卵巢癌质谱数据 高斯混合模型 BP神经网络 三分类 mass spectrometric data for ovarian cancer Gaussian mixture model BP neural network tri-classification
  • 相关文献

参考文献5

二级参考文献25

  • 1常青,王仰麟,吴健生,李双成.城市土地集约利用程度的人工神经网络判定——以深圳市为例[J].中国土地科学,2007,21(4):26-31. 被引量:78
  • 2Vorderwubecke S, Cleverley S, Weinberger SR, et al. Protein quantification by the SELDI-TOF-MS-based protein chip system[J]. Nat Methods, 2005, 2(5):393.
  • 3Somorjai RL, Dolenko B, Baumgartner R. Class prediction and discovery using gene microarray and proteomics mass spectroscopy data: curses, caveats, cautions[ J]. Bioinforma- tics, 2003, 19 (12) : 1484-1491.
  • 4Dueck D, Frey BJ. Clustering by passing messages between data points[J]. Science ,2007,315 (5814) :972-976.
  • 5Zhu Xiaojin. Semi-supervised learning literature survey [D]. Madison : University of Wisconsin, 2007.
  • 6Zha Zhengjun, Mei Tao. Graph-based semi-supervised learning with multiple labels [ J ]. Visual hnage, 2009, 20 ( 2 ) : 97- 103.
  • 7Yasui Y, Pepe M, Thompson M, et al. A data-analytic strategy for protein biomarker discovery : profiling of high dimensional proteomic data for cancer detection [J]. Biostatistics, 2003, 4(3) : 449-463.
  • 8Pascal, Caroline T. Protein mass spectra data analysis for clinical biomarker discovery : a global review [J].Brief Bioin- form, 2011, 12(2) : 1_76-186.
  • 9Chapelle O, Scholkopf B. Semi-supervised learning [ M ]. Cambridge: MIT Press, 2006: 193-196.
  • 10Zhou D ,Bousquel O, Lal TN, et al. Learning with local and global eonsisteney[ C]. Proe of Advanees in Neural Informa- tion Proeessing Systems. Massaehusetts: MIT Press, 2003: 321-328.

共引文献43

同被引文献3

引证文献1

二级引证文献3

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部