期刊文献+

基于多数据集的胃癌亚型标志基因选择 被引量:1

Marker Gene Selection of Gastric Cancer Subtype Based on Multi Microarray Data Sets
下载PDF
导出
摘要 基于机器学习方法分析胃癌微阵列数据,寻找和发现新的胃癌亚型分类的相关基因,可为进一步研究胃癌发生的分子机制及其基因水平的诊断和治疗提供标志与依据.现有方法大多使用单个数据集提取特征基因,样本量少,提取的特征基因应用于其他同类数据分类效果差.本文提出了一种遗传算法与支持向量机(support vector machine,SVM)相结合的特征基因提取方法,并行分析了3个胃癌微阵列数据集,提取的特征基因在所有数据集中均达90%以上的分类准确率.进行了4 580次实验,统计基因在遗传算法种群中出现的次数依次排序,得出了可能对胃癌亚型分类起关键作用的基因(AGT、FBLN1等).对提取的特征基因的生物学意义分析结果表明,本方法能很好地识别胃癌亚型分类基因,所选择的特征基因对人类胃癌肿瘤的诊断和分型有重要意义. Using machine learning methods to analyze microarray data of gastric cancer and discover novel marker gene can provide suggestion for further study of the molecular mechanism, gene level diagnosis and treatment, of gastric cancer. Most existing methods use machine learning methods to extract marker gene using only one data set. This paper proposed a hybrid genetic algorithm (GA)/support vector machine (SVM) approach to analyze multi gastric cancer microarray dataset in parallel and select marker genes. Three datasets are analyzed. The experiment was performed 4 580 times. The top 20 genes with highest occurrence times in the final populations of the GA (the occurrence times can represent the significance of classification in a sense) are selected as marker genes. Based on these genes the classification accuracies are above 90% in each of the three datasets. Meanwhile, biological significance analyses show that this method can identify the tumor related genes efficaciously. These genes are vital for human gastric cancer diagnosis and classification.
出处 《北京工业大学学报》 CAS CSCD 北大核心 2013年第10期1590-1595,共6页 Journal of Beijing University of Technology
基金 国家科技重大专项(2009ZX07212-003) 北京市教育委员会科技计划项目(JC002011200903)
关键词 标志基因 胃癌 遗传算法 支持向量机(SVM) marker gene gastric cancer genetic algorithm (GA) support vector machine (SVM)
  • 相关文献

参考文献10

  • 1LAUREN P. The two histological main types of gastric carcinoma: diffuse and so-called intestinal type [ J ]. Acta Pathol Microbiol Scand, 1965, 64: 31-49.
  • 2GOLUB T R, SLONIM D K, TAMAYO P, et al. Molecular classification of cancer: class discovery and class prediction by gene expression monitoring [ J ]. Science, 1999, 286(10): 531-537.
  • 3ALON U, BARKA I N, NOTTERMAN D A, et al. Broad patterns of gene expression revealed by clustering analysis of tumor and normal colon tissues probed by oligonucleotide arrays [J]. Proc Natl Acad Sci, 1999, 96: 6745-6750.
  • 4ZHANG He-ping, YU Chang-yung, SINGER B, et al. Recursive partioning for tumor classification with gene expression microarray data [ J ]. Proc Natl Acad Sci, 2001, 98: 6730-6735.
  • 5LI Xia, RAO Shao-qi, ZHANG Tian-wen, et al. The method uses the data from DNA chip to obtain information of complex disease related genes [ J ]. Science in China Series C: Life Sciences, 2004, 34(2) : 195-202.
  • 6SUGIMOTO M, FURUTA T, SHIRAI N, et al. Role of angiotensinogen gene polymorphism on helicobacter pylori infection-related gastric cancer risk in Japanese [ J ]. Carcinogenesis, 2007, 28 (9) : 2036-2040.
  • 7BOUSSIOUTAS Alex, LI Hong, LIU Jia, et al. Distinctive patterns of gene expression in premalignant gastric mucosa and gastric cancer [ J ]. Cancer Research, 2003, 63: 2569-2577.
  • 8TROYANSKAYA O, CANTOR M, SHERLORK G, et al. Missing value estimation methods for DNA microarrays [J]. Bioinformatics, 2001,17(6) : 520-525.
  • 9CHENG Y Y, JIN H, LIU X, et al. Fibulin 1 is downregulated through promoter hypermethylation in gastric cancer[J]. Sr J Cancer, 2008, 99( 12): 2083-2087.
  • 10CHAN A S, TSUI W Y, CHEN Xin, et al. Downregulation of ID4 by promoter hypermethylation in gastric adenocarcinoma[ J ]. Oncogene, 2003, 22 (44) :6946-6953.

同被引文献9

引证文献1

二级引证文献6

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部