期刊文献+

估计一个蛋白质家族属于一个新折叠子的概率

ESTIMATE THE PROBABILITY OF WHETHER A PROTEIN FAMILY CONTRIBUTES A NEW FOLDS
下载PDF
导出
摘要 关于蛋白质家族、结构和新功能的统计推断是应用数理统计的一个前沿交叉研究方向.本文以蛋白质结构分类数据库SCOP和序列分类数据库Pfam为基础,结合SCOP数据库的动态信息,我们估计出覆盖当前Pfam数据库所需的折叠子总数;通过SCOP中新增家族在Pfam中的对应家族所属的折叠子是否已知为先验信息构建贝叶斯模型,估计了不同规模的Pfam家族贡献新折叠子的概率分布. The statistical inference for the protein families,structures and new functions is a frontier research field in the applied statistics.Based on the SCOP(Structural Classification of Proteins database)and Pfam(Sequence Classification database),and the dynamic information of the SCOP database,we first estimate the total number of the folds,which are needed to cover the current Pfam database.By constructing the Bayesian Model with the prior information of whether the folds are previously known which cover the Pfam families mapping by the newly appeared SCOP families,we then estimate the probability distributions of whether a Pfam family with given size contributes a new folds.
作者 吕波 刘心声
出处 《南京大学学报(数学半年刊)》 CAS 2011年第1期88-96,共9页 Journal of Nanjing University(Mathematical Biquarterly)
基金 国家自然科学基金(10971097 10732040) 国家973计划(2007CB936204)资助
关键词 蛋白质家族 新折叠子 概率 贝叶斯方法 protein family folds probability Bayesian method
  • 相关文献

参考文献17

  • 1Chandonia J M,Brenner S E.The Impact of Structural Genomics:Expectations and Outcomes. Science . 2006
  • 2Portugaly E,Linial M.Estimating the Probability for a Protein to have a New fold:A Statistical Computational Model. Proceedings of the National Academy of Sciences of the United States of America . 2000
  • 3Wolf Y I,Grishin N V,Koonin EV.Estimating the Number of Protein Folds and Families from Complete Genome Data. Journal of Molecular Biology . 2000
  • 4Liu X,Fan K,Wang W.The Number of Protein Folds and Their Distribution over Families in Nature. Proteins . 2004
  • 5Kunin V,Cases I,Enright A J,de Lorenzo V,Ouzounis C A.Myriads of Protein Families,and Still Counting. Genome Biology . 2003
  • 6Finn RD,Tate J,Mistry J, et al.The Pfam protein families database. Nucleic Acids Research . 2008
  • 7Bairoch A,Apweiler R,Wu CH,et al.The Universal Protein Resource (UniProt). Nucleic Acids Research . 2005
  • 8Benson DA,Karsch-Mizrachi I,Lipman DJ,et al.GenBank. Nucleic Acids Research . 2005
  • 9Andreeva A,Howorth D,Brenner SE,et al.SCOP database in 2004: refinements integrate structure and sequence family data. Nucleic Acids Research . 2004
  • 10Murzin AG,Brenner SE,Hubbard T,et al.SCOP: a structural classification of proteins database for the investigation of sequences and structures. Journal of Molecular Biology . 1995

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部