期刊文献+

基于碱基关联二联体位置权重矩阵预测酵母转录因子结合位点 被引量:3

Recognition of the Transcription Factor Binding Sites in Saccharomyces cerevisiae Genome Based on Dinucleotides Position Weight Matrix
下载PDF
导出
摘要 基于已知的酵母转录因子结合位点数据资料,构建转录因子结合位点碱基关联二联体位置权重矩阵,整合碱基关联二联体位置权重矩阵和碱基保守性参量M2i,提出一种新的预测转录因子结合位点的方法(PWMSA).利用self-consistency和cross-validation两种方法对此算法进行检验,均获得了较高的预测成功率,结果表明9种转录因子结合位点的总体预测成功率超过81%,明显高于单碱基位置权重矩阵,同时与已有预测转录因子结合位点的软件进行比较,核苷酸水平上的关联系数和结合位点水平上的关联系数分别达到0.42和0.52,优于现有预测方法. Based on the known transcription factor binding sites in Saccharomyces cerevisiae genome, a dinucleotides position weight matrix for transcription factor binding sites is constructed.By calculating the site conservative index vectors Mli in transcription factor binding sites, a novel position weight matrices scoring algorithm (PWMSA) for predicting yeast transcription factor binding sites is presented. The 9 yeast transcription factor binding sites sets which were confirmed by experiment are used to train this algorithm. The predictive capacity of the algorithm is tested by the 10-fold cross-validation test. The results show that the correct prediction is 81.1% more than mononucleotide PWM. By comparing our algorithm with other ten softwares using the new performance measures and benchmarked database, the results show that the overall prediction accuracies of PWMSA are 0.52 and 0.42 more than the other ten algorithms, at binding sites segment level and nucleotide level, respectively.
作者 杨科利 许强
出处 《生命科学研究》 CAS CSCD 2008年第2期115-120,共6页 Life Science Research
基金 宝鸡文理学院硕士科研启动项目(ZK0791 ZK0792)
关键词 转录因子结合位点(TFBS) 位置权重矩阵(PWM) 碱基保守性 transcription factor binding sites(TFBS) position weight matrices(PWM) site conservation
  • 相关文献

参考文献25

  • 1STORMO G D.DNA binding sites:representation and discovery [J]. Bioinformaties,2000,20 ( 1 ) : 16-23.
  • 2van HELDEN J,ANDRE B,COLLADO-VIDES J. Extracting regulatory sites from the upstream region of yeast genes by computational analysis of oligonucleotide frequencies[J]. J Mol Blot, 1998,281 (5) : 827-842.
  • 3SCHONES D E,SUMAZIN P,ZHANG M Q. Similarity of position frequency matrices for transcription factor binding sites[J]. Bioinformatics, 2005,21 (3) : 307-303.
  • 4MAN T K,STORMO G D. Non-independence of Mnt repressor operator interaction determined by a new quantitative multiple fluorescence relative affinity (QuMFRA) assay[J]. Nucleic Acids Res, 2001,29: 2471-2478.
  • 5BULYK M L,JOHNSON P L,CHURCH G M. Nucleotides of transcription factor binding sites exert interdependent effects on the binding affinities of transcription factors[J]. Nucleic Acids Res,2002,30:1255-1261.
  • 6CARTHARIUS K,FRECH K,GROTE K,et al. Mat inspector and beyond:promoter analysis based on transcription factor binding sites[J]. Bioinformatics, 2005,21 ( 13 ) : 2933-2942.
  • 7陈颖丽,李前忠,马克健.大肠杆菌与酵母菌基因特定序列信息参量的研究[J].生物物理学报,2001,17(4):676-684. 被引量:8
  • 8CHEKMENEV D S, HAID C, KEL A E. P-Match: transcription factor binding site search by combining patterns and weight matriees[J]. Nucleic Acids Res, 2005,33:432-437.
  • 9TOMPA M,LI N,BAILEY T L,et al. Assessing computational tools for the discovery of transcription factor binding sites[J]. Nat Biotechnol, 2005,23 ( 1 ) : 137-144.
  • 10HU J J,LI B,KIHARA D. Limitations and potentials of current motif discovery algorithms[J]. Nucleic Acids Res, 2005, 33 ( 15 ) :4899-4913.

二级参考文献2

共引文献7

同被引文献26

  • 1刘嵩,于皆平,刘浩,武向悦,周慧明.胃癌中PTEN基因异常甲基化的检测[J].华中医学杂志,2005,29(4):263-264. 被引量:1
  • 2杨科利,李前忠,林昊.预测酵母(Yeast)基因转录因子结合位点[J].内蒙古大学学报(自然科学版),2006,37(5):524-530. 被引量:16
  • 3胡秀珍,李前忠.用离散量的方法识别蛋白质的超二级结构[J].生物物理学报,2006,22(6):424-428. 被引量:16
  • 4Sun Z R,Jing B. Pattems and conformations of commonly occunring super-secondary structures (basic motifs) in protein data bank[J]. Protein Chem, 1996,15(7) : 675-690.
  • 5Sun Z, Rao X, Peng L , et al. Prediction of protein supersecondary structures based on artificial neural network method [J]. Protein Engineering, 1997,10(7) : 763-769.
  • 6Cruz X, Hutchinson E G, Shepherd A, et al. Toward predicting protein topology: an approach to identifying B hairpins [J]. Proc. Natl Acad. Sci. USA, 2002,99(17):11157- 11162.
  • 7Kuhn M, Meile J,Baker D. Strand-Loop-Strand Motifs: Prediction of Hairpins and Diverging Turns in Proteins[J]. Bioinformaties, 2004,54 : 282-288.
  • 8Kumar M,Bhasin M,Natt N K,et al. BhairPred: prediction of b-hairpins in a protein from multiple alignment information using ANN and SVM techniques[J]. Nucleic Acids Research, 2005,33 : 154-159.
  • 9Kabsch, W. , Sander, C. Dictionary of protein secondary structure: pattern recognition of hydrogen-bonded and geometrical features[J]. Biopolymers, 1983(22) : 2577-2637.
  • 10Hutchinson, E. G. , Thornton,J. M. PROMOTIF--a program to identify and analyze structural motifs in proteins[J]. Protein Sci, 1996,5:212-220.

引证文献3

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部