期刊文献+

基于位点相关概率模型的富亮氨酸重复序列预测

Sequence prediction of leucine-rich repeat based on position-related possibility model
下载PDF
导出
摘要 富亮氨酸重复序列(leucine-rich repeat,LRR)是一种广泛存在的蛋白质结构基序,在诸多重要生命过程中起关键性作用并与诸多人类疾病紧密相关。研究LRR中各个位点之间的氨基酸分布的相关性,并基于此相关性建立概率模型,可应用于序列水平上的LRR预测,以提高LRR预测的准确度。本文从LRRML数据库中提取已知的LRR蛋白质序列作为训练集和测试集;为LRR各个位点上氨基酸的分布数据构建4种不同的概率模型,包括位点相关和位点不相关概率模型;再通过机器学习和K-折交叉验证的方法,确定可以用于LRR预测的最佳模型。结果表明,位点相关概率模型和位点不相关概率模型以不同权重相加之后的综合模型在LRR预测中显示出高的准确度。LRR中各个位点之间的氨基酸分布存在一定的相关性,此相关性可作为重要参数应用于LRR预测。 Leucine-rich repeat(LRR)is a widely distributed protein motif,which is related to a large number of important life processes and human diseases.The correlation of amino acid distributions between different positions in LRRs was investigated,and the correlation was applied to sequence-level LRR predictions to improve the accuracy of LRR predictions.Known LRR protein sequences were extracted from the LRRML database as training set and test set.Four different possibility models were built for the amino acid distribution data at every position in LRRs,including position-related and position-irrelated models.The best model for LRR prediction was selected through machine-learning experiments with k-fold validations.A weighted model integrating aposition-related possibility model and a position-irrelated possibility model exhibited the highest accuracy in LRR prediction experiments.There is a correlation of amino acid distributions between different positions in LRRs,and this is significant enough to be used as an important parameter for LRR predictions.
出处 《中国科技论文》 CAS 北大核心 2015年第6期626-628,637,共4页 China Sciencepaper
基金 高等学校博士学科点专项科研基金资助项目(20110131120024 20110131120045)
关键词 生物信息学 富亮氨酸重复序列 序列算法 位点相关概率模型 bioinformatics leucine-rich repeat motif prediction position-related possibility model
  • 相关文献

参考文献6

  • 1Wei T, Gong J, Jamitzky F, et al. LRRML: a confor- mational database and an XML description of leucine- rich repeats (LRRs) [J]. BMC Structural Biology, 2008, 8(1).
  • 2Enkhbayer P, Kamiya M, Osaki M, et al. Structural principles of leucine-rich repeat (LRR) proteins [J]. Proteins, 2004, 54(3): 394-403.
  • 3Kajava A V, Kobe B. Assessment of the ability to mod- el proteins with leucine-rich repeats in light of the latest structural information [J]. Protein Science, 2002, 11 (5) : 1082-1090.
  • 4Bella J, Hindle K L, Mcewan P A, et al. The leucine- rich repeat structure [J]. Cellular and Molecular Life Sciences, 2008, 65(15): 2307-2333.
  • 5Matsushima N, Tanaka T, Enkhbayar P, et al. Com- parative sequence analysis of leucine-rich repeats (LRRs) within vertebrate toll-like receptors [J]. BMC Genomics, 2007, 8(1): article No. 124.
  • 6Berman H M, Westbrook J, Feng Z, et al. The protein data bank [J]. Nucleic Acids Research, 2000, 28(1) : 235-242.

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部