期刊文献+

A new method for splice site prediction based on the sequence patterns of splicing signals and regulatory elements 被引量:3

A new method for splice site prediction based on the sequence patterns of splicing signals and regulatory elements
原文传递
导出
摘要 It is of significance for splice site prediction to develop novel algorithms that combine the sequence patterns of regulatory elements such as enhancers and silencers with the patterns of splicing signals.In this paper,a statistical model of splicing signals was built based on the entropy density profile(EDP) method,weight array method(WAM) and κ test;moreover,the model of splicing regulatory elements was developed by an unsupervised self-learning method to detect motifs associated with regulatory elements.With two models incorporated,a multi-level support vector machine(SVM) system was de-vised to perform ab initio prediction for splice sites originating from DNA sequence in eukaryotic ge-nome.Results of large scale tests on human genomic splice sites show that the new method achieves a comparative high performance in splice site prediction.The method is demonstrated to be with at least the same level of performance and usually better performance than the existing SpliceScan method based on modeling regulatory elements,and shown to have higher accuracies than the traditional methods with modeling splicing signals such as the GeneSplicer.In particular,the method has evident advantage over splice site prediction for the genes with lower GC content. It is of significance for splice site prediction to develop novel algorithms that combine the sequence patterns of regulatory elements such as enhancers and silencers with the patterns of splicing signals. In this paper, a statistical model of splicing signals was built based on the entropy density profile (EDP) method, weight array method (WAM) and K test; moreover, the model of splicing regulatory elements was developed by an unsupervised self-learning method to detect motifs associated with regulatory elements. With two models incorporated, a multi-level support vector machine (SVM) system was devised to perform ab initio prediction for splice sites originating from DNA sequence in eukaryotic genome. Results of large scale tests on human genomic splice sites show that the new method achieves a comparative high performance in splice site prediction. The method is demonstrated to be with at least the same level of performance and usually better performance than the existing SpliceScan method based on modeling regulatory elements, and shown to have higher accuracies than the traditional methods with modeling splicing signals such as the GeneSplicer. In particular, the method has evident advantage over splice site prediction for the genes with lower GC content.
出处 《Chinese Science Bulletin》 SCIE EI CAS 2008年第21期3331-3340,共10页
基金 the State Basic Research Program of China (Grant No. 2003CB715905) National Nature Science Foundation of China (Grant Nos. 30300071, 30770499 and 10721403) Youth Foundation of College of Engineering of Peking University
关键词 基因预报 结合位置 结合信号 有限元分析 gene prediction splice site splicing signal regulatory element
  • 相关文献

参考文献11

  • 1Igor B. Rogozin,Luciano Milanesi.Analysis of donor splice sites in different eukaryotic organisms[J].Journal of Molecular Evolution.1997(1)
  • 2Zhu H Q,,Hu G Q,Yang Y F, et al.MED: A new non-supervised gene prediction algorithm for bacterial and archaeal genomes[].BMC Bioinformatics.2007
  • 3Silverman B D,,Linsker R.A measure of DNA periodicity[].Journal of Theoretical Biology.1986
  • 4Haiminen N,Mannila H,Terzi E.Comparing segmentations by ap- plying randomization techniques[].BMC Bioinformatics.2007
  • 5Burge C,Karlin S.Prediction of complete gene structures in human genomic DNA[].Journal of Molecular Biology.1997
  • 6Fairbrother WG,Yeh RF,Sharp PA,et al.Predictive identification of exonic splicing enhancers in human genes[].Science.2002
  • 7Chang CC,Lin CJ.LIBSVM:a library for support vector machines. http://www.csie.ntu.edu.tw/~cjlin/libsvm . 2001
  • 8T-M Chen,C-C. Lu,and W-H. Li."Prediction of splice sites with dependency graphs and their expanded bayesian networks,"[].Bioinformatics.2005
  • 9Reese,MG,Eeckman,FH,Kulp,D,Haussler,D.Improved splice site detection in Genie[].Journal of Computational Biology.1997
  • 10Degroeve S,Saeys Y,De Baets B, et al.SpliceMachine: predicting splice sites from high-dimensional local context representations[].Bioinformatics.2005

同被引文献13

  • 1Zhang X H F, Heller K A, Hefter L, et al. Sequence information for the splicing of human pre-mRNA identified by support vector machine classification[J]. Genome Research, 2003,13(12) :2637 - 2650.
  • 2Zhang Q W, Peng Q K, Zhang Q, et al. Splice sites prediction of Human genome using length-variable Markov model and feature selection[J]. Expert Systems with Applications, 2010,37 (4) : 2771 - 2782.
  • 3Blencowe B J. Exonie splicing enhancers: mechanism of action, diversity and role in human genetic diseases[J]. Trends in Biochemical Sciences, 2000,25(3): 106- 110.
  • 4Dogan R I, Getoor L, Wilbur W J, et al. Feature generated for computational splice-site prediction correspond to functional elements [ J ]. BMC Bioinformatics, 2007,8 (1) : 410.
  • 5Saeys Y, Degroeve S, Aeyels D, et al. Feature selection for splice site prediction: A new method using EDA- based feature ranking[J]. BMC Bioinformatics, 2004, 5(1):64.
  • 6Senapathy P, Shapiro M B, Harris N L. Splice junctions: branch point sites and exons: sequence statistics, identification, and applications to genome project[J]. Methods Enzymol, 1990,183 : 252 - 278.
  • 7Stadler M B, Shomron N, Yeo G W, et al. Inference of splicing regulatory activities by sequence neighborhood analysis[J]. Plos Genetics, 2006,2(11) :1849 - 1860.
  • 8Wang Z F, Rolish M E, Yeo G, et al. Systematic iden- tification and analysis of exonic splicing silencers[J]. Cell, 2004,119(6) :831 - 845.
  • 9Tsonis A A, Elsner J B, Tsonis P A. Periodicity in DNA coding sequenees: implications in gene evolution [J]. J Theor Biol, 1991,151(3) :323 - 331.
  • 10Wang L Y, Stein L D. Localizing triplet periodicity in DNA and cDNA sequences [J]. BMC Bioinformatics, 2010,11:550.

引证文献3

二级引证文献6

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部