期刊文献+

应用支持向量机预测蛋白质相互作用位点 被引量:1

Prediction of Protein-Protein Interaction Sites Using Support Vector Machine
下载PDF
导出
摘要 蛋白质相互作用位点的识别对于突变设计和预测蛋白质相互作用的网络是非常重要的.基于支持向量机学习方法,该文提出一种用于预测蛋白质相互作用位点的有效数据属性抽取方法,该方法利用蛋白质的序列信息、蛋白质残基的可及表面积和进化率来构造向量,通过十倍交叉验证来对数据进行训练和预测.实际计算的结果显示,该方法的准确率为72.19%,比只利用序列信息和进化率信息的方法提高了5.71%. Identification of protein-protein interaction sites is essential for the mutant design and prediction of protein-protein networks. This paper proposes a method for predicting protein-protein interaction sites by combining support vector machine (SVM) and the sequence profiles, the accessible surface area (ASA) and the evolution rate of a residue. The dataset is trained and tested using 10-fold cross-validation. Accuracy of the proposed method is 72.91%, 5.71% higher than that of the method only using the sequence profiles and the evolution rate of a residue.
机构地区 上海大学数学系
出处 《应用科学学报》 CAS CSCD 北大核心 2008年第4期403-408,共6页 Journal of Applied Sciences
基金 国家自然科学基金(No.30571059) 国家"863"高技术研究发展计划(No.2006AA02Z190)资助项目
关键词 蛋白质相互作用位点 支持向量机 序列信息 可及表面积 进化率 protein-protein interaction sites, support vector machine ( SVM), sequence profiles, accessible surfacearea, evolution rate
分类号 O024 [理学]
  • 相关文献

参考文献13

  • 1KINI R M, EVANS H J. Prediction of potential proteinprotein interaction sites from amino acid sequence identification of a fibrin polymerization site[J]. FEBS Letters, 1996, 385:81-86.
  • 2JONES S, THORNTON J M. Analysis of protein-protein interaction sites using surface patches [J]. Journal of Molecular Biology, 1997, 272 : 132 - 143.
  • 3EISENBERG D, SCHWARZ E, KOMAROMY M, WALL R. Analysis of membrane and surface protein sequences with the hydrophobic moment plot [J]. Journal of Molecular Biology, 1984, 179(1): 125- 142.
  • 4LI Minghui, LIN Lei, WANG Xiaolong , Liu Tao. Protein- protein interaction site prediction based on conditional random fields [J]. Bioinformatics, 2007 23 (5) : 597 -604.
  • 5CHUNG J L, WANG Wei, BOURNE P E. High-throughput identification of interacting protein-proteinbinding sites [J]. BMC Bioinforrnatics,2007 ,8 :223.
  • 6WANG Bing, CHEN Peng, HUANG Deshuang, LI Jingjing, TAT-MING Lok, LYU M R. Prediction protein-protein interaction sites from residue spatial sequence profile and evolution rate [J]. FEBS Letters, 2006, 580:380 -384.
  • 7MURZIN A G, BRENNER S E, HUBBARD T, CHOTHIA C. A structural classification of proteins database for the investigation of sequence and structures [J]. Journal of Molecular Biology, 1995, 247 : 536 - 540.
  • 8NADERI-MANESH H, SADEGHI M, ARAB S. Prediction of protein surface accessibility with information theory [ J ]. Proteins, 2001,42:452-459.
  • 9JONES S, THORNTON J M. Principles of protein-protein interactions [ J ]. Proceedings of the National Academy of Sciences, 1996, 93: 13-20.
  • 10VAPNIK V N. Statistical learning theory [J]. John Wiley, 1995,

共引文献1

同被引文献10

  • 1CAMPBELL A M,HEYER L J.探索基因组学、蛋白质组学和生物信息学[M].孙之荣,译.北京:科学出版社,2007.
  • 2XUE Bin, ESHEL Faraggi, ZHOU Yaoqi. Predicting residue-residue contact maps by a two-layer, integrated neutral network method[J]. Proteins, 2009, 76:176-183.
  • 3MILE Sikic, SANJA Tomic, KR/STIAN Vlahovicek. Prediction of protein- protein interaction sites in sequences and 3D structures by random forests [ J]. PloS Computational Biology, 2009, 5 ( 1 ) : 1-9.
  • 4YAN Changhui, DRENA Dobbs, VASANT Honavar. A two stage classifier for identification of protein-protein interface residues [ J ]. Bioinformatics, 2004, 20 ( 1 ) : 371 - 378.
  • 5IAKES Ezkurdia, LISA Bartoli, PIERO Fariselli, et al. Progress and challenges in predicting protein-protein interaction sites[ J]. Briefings in bioinformatics, 2009, 10 (3) :233-246.
  • 6LAN Man, CHEW Limtan, SU Jian. Feature generation and representations for protein-protein interaction classification[J]. Journal of Biomedical Informatics, 2009, 42 (5) :866-872.
  • 7HUANG Wenlin, CHUN Weitung, HUANG Huiling, et al. Predicting protein subnuclear localization using GO- amino acid composition features[ J]. Biosystems, 2009, 98(2) :73-79.
  • 8LIU Liang, CAI Yudong, LU Wencong, et al. Prediction of protein - protein interactions based on PseAA composition and hybrid feature selection [ J ]. Biochemical and Biophysical Research Communications, 2009, 380(2) : 318-322.
  • 9PETER Briggs. Comparison of SURFACE and AREA- IMOL for accessible surface area calculationsE EB/OL ]. (2000-09-29) [ 2009-12-26 ]. http ://www. ccp4. ac. uk/ Newsletters/newsletter38/03 surfarea, html.
  • 10EILEN Nordlie, MARC Oliver Gewaltig, HANS Ekkehard Plesser. Towards reproducible descriptions of neuronal network models [ J ]. PLoS Computational Biology, 2009, 8(5) :1-18.

引证文献1

二级引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部