期刊文献+

基于GMM核的LS-SVM真核启动子模型

Eukaryotic promoter LS-SVM with GMM kernel
下载PDF
导出
摘要 由于真核启动子DNA序列结构复杂、数据量巨大,启动子序列辨识一直是一个难点。首先对真核启动子序列寡核苷酸位置分布特征进行高斯混合模型建模,能够将出现频率少但重要的基序提取出来。并将高斯混合模型作为真核启动子最小二乘支持向量机分类器中的核函数,将最小二乘支持向量机模型简化为最小二乘模型,计算量减少。辨识结果表明,该算法的辨识精度优于贝叶斯辨识算法,和RBF核LS-SVM相比,辨识精度基本相同,建模时间略有缩短。 Recognition of gene promoter DNA sequence is difficult with the complex structure and the huge amount of data. In this paper, the positional densities of oligonucleotides are modeled by Gaussian mixture model. It can identify less frequent but important motifs, since the positional density is independent of the actual occurrence frequency of the oligonucleotide. These motifs generally correspond to the consensus sequences of transcription factor binding site. GMM is used as eukaryotic promoter LS-SVM kernel, which simplifies the LS-SVM as LS model. The algorithm is simplified and the computational complexity is decreased. The simulation results show the accuracy is improved compared with Bayesian classifier, and is same to LS-SVM with RBF kernel, moreover the model building time is shorter.
出处 《化工学报》 EI CAS CSCD 北大核心 2013年第12期4662-4666,共5页 CIESC Journal
基金 国家自然科学基金项目(61104093) 辽宁省科学研究基金项目(L2012141) 辽宁省教学研究基金项目(2011A017)~~
关键词 高斯混合模型 核函数 最小二乘支持向量机 脱氧核糖核酸 模型简化 算法 Gaussian mixture model kernel function least square support vector machine DNA modelreduction algorithm
  • 相关文献

参考文献21

  • 1郭烁,朱义胜.基于加权贝叶斯分类器的人类启动子辨识方法[J].电路与系统学报,2010,15(4):33-37. 被引量:1
  • 2Fickett J W, Hatzigeorgiou A G. Eukaryotic promoter recognition [J]. Genome. Res., 1997, 7 (9): 861-78.
  • 3Hutchn G B. The prediction of vertebrate promoter regions using differential hexamer frequency analysis [J]. Comp. Appl. Biosci., 1996, 12:391- 398.
  • 4Chen Q K, Hertz G Z, Stormo G D. PromFD 1.0: a computer program that predicts eukaryotic pol ll promoters using strings and IMD matrices [J]. Comp. Appl. Biosci., 1997, 13 (1): 29-35.
  • 5Scherf M, Klingenhoff A, Werner T. Highly specific localization of promoter regions in large genomic sequences by promoter inspector: a novel context analysis approach [J]. J. Mol. Biol., 2000, 297 (3): 599-606.
  • 6Down T A, Hubbard T J. Computational detection and location of transcription start sites in mammalian genomic DNA [J]. GenomeRes., 2002, 12 (3): 458-461.
  • 7Vladimir B Bajic, Seng Hong Seah. Dragon gene start finder identifies approximate locations of the 5 ends of genes [J]. Nucleic Acids Research, 2003, 31 (13) : 3560-3563.
  • 8Hannenhalli S, Levy S. Promoter prediction in the human genome[J]. Bioinformatics, 2001, 17 (1): 90.
  • 9Davuluri R V, Grosse I, Zhang M Q. Computational identification of promoters and first exons in the human genome [J]. Nature Genetics , 2001, 29 (4): 412.
  • 10Michael Towsey, Peter Timms, James Hogan, Sarah A Mathews. The cross species prediction of bacterial promoters using a support vector machine[J]. Computational Biology and Chemistry, 2008, 32 (5) : 359-366.

二级参考文献30

共引文献26

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部