期刊文献+

基于离散增量结合支持向量机方法的果蝇启动子预测 被引量:1

Predicting Drosophila melanogaster Promoter Using the Algorithm Increment of Diversity and Support Vector Machines
下载PDF
导出
摘要 目的:改进真核生物启动子的理论预测方法。方法:基于启动子序列的信号特征和内容特征,构建6个标准离散源,计算每条序列相对于标准离散源的离散增量;构建信号特征的启动子位置权重矩阵,计算其对应位置的位置权重打分函数,将所得到的两类参数输入支持向量机对果蝇启动子进行预测。结果:利用self-consistency和cross-validation两种方法对此算法进行检验,均获得了较高的预测成功率,结果表明五种转录因子结合位点的预测成功率均超过91%。结论:结果显示结合了支持向量机的离散增量算法能够有效的提高预测成功率,是进行真核生物启动子预测的一种很有效的方法。 Objective:To improve the predictive capacity of the algorithm for Eukaryotic promoter sequences.Method:Based on the six least increment diversity,three kinds of position weight matrix,and the percent of GC in the sequences,the content vectors and the signals vector were distilled from the promoter sequences.The vectors calculated were input into a support vector machine(SVM) algorithm to build a promoter classification model.Result:The human Pol II promoter sequences are predicted by using of support vector machine,the 10-fold cross-validation and the independent test data were used for validate the support vector machine model,the results show that the overall prediction accuracies(sensitivity) and specificity are more than 91%.Conclusion:These results indicate that the increment of diversity and support vector machines algorithm is an effective method for predicting the Eukaryotic promoter sequences.
作者 杨科利 许强
出处 《生物技术》 CAS CSCD 2008年第2期39-42,共4页 Biotechnology
基金 宝鸡文理学院硕士科研启动项目(ZK0791 ZK0792)
关键词 启动子 位置权重矩阵(PWM) 支持向量机 信号参数 内容参数 promoter sequences increment diversity support vector machines
  • 相关文献

参考文献13

  • 1Ohler. U. Identification of core promoter modules in Drosophila and their application in accurate transcription start site prediction [ J ]. Nucleic Acids Res., 2006,34(20): 5943-5950.
  • 2Berg O.G., yon Hippel P.H., Selection of DNA binding sites by regulatory proteins[J] .J Mol Biol., 1988,200(4) :709 - 723.
  • 3Barrick D., Villaneuba K., Childs J., et al. Quantitative analysis of ribosome binding sites in E. coli [ J ]. Nucleic Acids Res., 1994, 22 ( 7 ) : 1287 - 1295.
  • 4Prestridge D.S., Predicting Pal Ⅱ promoter using transcription factor binding sites[J]. J Mol. Biol,1995, 249(5):923-932.
  • 5Knudsen S., Promoter2.0: for the recognition of Pom promoter sequences[J]. Bioinformatics, 1999,15(5): 356 - 361.
  • 6Reese M.G. Application of a time - delay neural network to promoter annotation in the Drosophila melanogaster genome[J].Comput Chem,2001, 26 (1):51-56.
  • 7Bajic V.B., Seah S.H.,Chong A., et al. Dragon promoter finder:recognition of vertebrate RNA polymerase Ⅱ promoters[J].Bioinformatics, 2002, 18(1): 198-199.
  • 8Gangal R., Sharma P., Human pol Ⅱ promoter prediction: time series descriptors and machine learning[J]. Nucleic Acids Research,2005, 33(4): 1332 - 1336.
  • 9吕军,罗辽复.人类polⅡ启动子的识别[J].生物化学与生物物理进展,2005,32(12):1185-1191. 被引量:26
  • 10林昊,李前忠.基于二次判别的果蝇启动子识别[J].生物物理学报,2006,22(5):345-350. 被引量:7

二级参考文献36

  • 1吕军,罗辽复.人类polⅡ启动子的识别[J].生物化学与生物物理进展,2005,32(12):1185-1191. 被引量:26
  • 2杜耀华,王正志,倪青山,李冬冬.一种基于特征筛选的原核生物启动子判别分析方法[J].生物物理学报,2006,22(1):39-48. 被引量:6
  • 3Xie X H, Lu J, Kulbokas E J, et al. Systematic discovery of regulatory motifs in humanpromoters and 3′UTRs by comparison of several mammals. Nature, 2005, 434 (7031): 338~345
  • 4Laxton R R. The measure of diversity. J Theor Biol, 1978, 71(1):51~67
  • 5McLachlan G J. Discriminant Analysis and Statistical Pattern Recognition. New York:Wiley, 1992. 1~526
  • 6Zhang M Q. Identification of protein coding regions in the human genome by quadraticdiscriminant analysis. Proc Natl Acad Sci USA, 1997, 94 (2): 565~568
  • 7Zhang L R, Luo L F. Splice site prediction with quadratic discriminant analysis usingdiversity measure. Nucleic Acids Research, 2003, 31(21): 6214~6220
  • 8Schmid C D, Praz V, Delorenzi M, et al. The eukaryotic promoter database EPD: theimpact of in silico primer extension. Nucleic Acids Research, 2004, 32:D82~85
  • 9Matthias S, Andreas K, Kornelie F, et al. First pass annotation of promoters on humanchromosome 22. Genome Res, 2001, 11 (3):333~340
  • 10Luo L F, Li H, Zhang L R. ORF organization and gene recognition in the yeast genome.Comp Funct Genomics, 2003, 4 (3): 318~328

共引文献28

引证文献1

二级引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部