期刊文献+

基于相关性分析的微阵列数据集成分类研究 被引量:5

Ensemble Classification of Microarray Data Based on Correlation Analysis
下载PDF
导出
摘要 基于微阵列数据的肿瘤诊断方法有望在不久的将来成为临床医学上一种快速且有效的分子层肿瘤诊断方法,但由于微阵列数据存在高维小样本的特点,因而对传统的分类方法提出了挑战,为此研究人员开始关注于性能更好的集成分类算法.针对现有的微阵列数据集成分类算法分类精度不高、计算量过大等问题,提出了一种基于相关性分析的微阵列数据集成分类算法.该算法可以通过计算训练子集间的相关性挑选出差异度最大的一组子集来进行训练,有效地增强了集成中的多样性.应用支持向量机作为基分类器,在急性白血病与结肠癌数据集上的实验结果表明了所提算法的有效性和可行性.同时,测试了算法在不同参数设置下的性能,测试结果为合理的参数设置提供了参考依据. The tumor diagnosis method based on microarray data will be developed into a fast and effective molecular-level diagnosis method applied in clinic in the near future. However, it is a challenging task for traditional classification approaches due to the characteristics of high dimensionality and small samples for microarray data. Therefore, ensemble classification algorithms with better performance have attracted more researchers. A novel ensemble classification algorithm for microarray data based on correlation analysis is proposed in this paper to solve the problems of low classification accuracy and excessive computation for current ensemble classification algorithms. The proposed algorithm may extract some training subsets which have the most difference between each other by computing their correlation. Therefore, the proposed algorithm could effectively improve diversity among base classifiers. Support vector machine is selected as base classifier in this paper and the experiment results on leukemia dataset and colon tumor dataset show the effectiveness and feasibility of the proposed algorithm. Meanwhile, the performances of the proposed algorithm based on different parameters are tested and the results are helpful for selecting appropriate parameters.
出处 《计算机研究与发展》 EI CSCD 北大核心 2010年第2期328-335,共8页 Journal of Computer Research and Development
基金 国家自然科学基金项目(60873036) 中国博士后科学基金资助项目(20060400809) 黑龙江省青年科技基金项目(QC06C022)~~
关键词 集成分类 微阵列数据 相关性分析 特征选择 支持向量机 ensemble classification microarray data correlation analysis feature selection support vector machine
  • 相关文献

参考文献18

  • 1Alon U, Barkai N, Notterman D A, et al. Broad patterns of gene expression revealed by clustering analysis of tumor and normal colon tissues probed by oligonucleotide array [J]. Proc of the National Academy of Sciences, 1999, 96 (12) : 6745-6750.
  • 2Golub T R, Slonim D K, Tamayo P, et al. Molecular classification of cancer: class discovery and class prediction by gene expression monitoring [J]. Science, 1999, 286 (5439) : 531-537.
  • 3Guyon I, Weston J, Barnhill S, et al. Gene selection for cancer classification using support vector machines [J]. Machine Learning, 2002, 46(1): 389-422.
  • 4Dettling M. Bagboosting for tumor classification with gene expression data [J]. Bioinformatics, 2004, 20(18): 3583- 3593.
  • 5Bertoni A, Folgieri R, Valentini G. Bio-molecular cancer prediction with random subspace ensembles of support vector machines [J]. Neurocomputing, 2005, 63(1): 535-539.
  • 6Hu H, Li J Y, Wang H, et al. A maximally diversified multiple decision tree algorithm for microarray data classification [C] //Proc of 2006 Workshop on Intelligent Systems for Bioinformatics. Hobart, Australia: CRPIT, 2007 : 35-38.
  • 7Peng Y H. A novel ensemble machine learning for robust mieroarray data classification [J]. Computers in Biology and Medicine, 2006, 36(6) : 553-573.
  • 8Kim K J, Cho S B. An evolutionary algorithm approach to optimal ensemble classifiers for DNA microarray data analysis [J]. IEEE Trans on Evolutionary Computation, 2008, 12 (3) : 377-388.
  • 9Chen Y H, Zhao Y O. A novel ensemble of classifiers for microarray data classification [J]. Applied Soft Computing, 2008, 8(4): 1664-1669.
  • 10Krogh A, Vedelsby J. Neural network ensembles, cross validation, and active learning [G] //Tesauro G, Touretzky D S, Leen T K. Advances in Neural Informatior; Processing Systems. Cambridge, MA: MIT Press, 1995:231-238.

二级参考文献53

共引文献63

同被引文献75

引证文献5

二级引证文献14

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部