期刊文献+

使用伪氨基酸模型和K近邻分类器预测酶的分类

Using pseudo-amino acids mode and K Nearest Neighbors'classifier to predict enzyme subfamily classes
下载PDF
导出
摘要 酶作为一种重要的生物催化剂在生物代谢过程中扮演着非常重要的角色。一种酶的功能与它所属的类或子类有着密切的关系。所以,不论是在基础研究的过程中还是药物发现的过程中,研究预测酶的分类方法都显得非常有用。通过采用一种基于伪氨基酸组成作为酶序列的特征向量,同时又加入了更多的氨基酸信息,来对酶进行分类。对于分类器,考虑到它是多分类问题,采用了最优证据理论-K近邻算法。实验结果证明这样做是有效的,达到83%的准确率。 Enzyme plays an important role in biological metabolism pathways as catalyzer. Furthermore, the function of an enzyme has close relationship with subfamily it belongs to. Thus, the enzyme class problem becomes useful in biology. When constructing its feature vector, the pseudo-amino acids mode is incited, combining the components of the amino acid pair and more useful biophysical features. At the same time, the nice multi-class classifier is chosen: Optimized Evidence Theory-K Nearest Neighbors (OET-KNN) to train these feature vectors. The classifying performance reaches as high as 83 %.
作者 孙晶京
出处 《计算机工程与应用》 CSCD 2013年第9期123-126,共4页 Computer Engineering and Applications
关键词 特征向量 伪氨基酸模型 最优证据理论-K近邻算法 feature vector pseudo-amino acids mode Optimized Evidence Theory-K Nearest Neighbors' (OET-KNN) algorithm
  • 相关文献

参考文献15

  • 1Webb E C.Enzyme nomenclature[J].The FASEB Journal, 1993,7.
  • 2Chou K C, Elrod D W.Prediction of enzyme family classes[J]. Proteome Res, 2003,2 : 183-190.
  • 3Chou K C.Prediction of protein cellular attributes using pseudo amino acid composition[J].PROTEINS: Sucture, Function,and Genetics, 2001,43 : 246-255.
  • 4Chou K C.Using amphiphilic pseudo amino acid composi- tion to predict enzyme subfamily elasses[J].Bioinformatics, 2005,21 : 10-19.
  • 5Huang W L,Chen H M,Hwang S F,et al.Accurate predic- tion of enzyme subfamily class using an adaptive fuzzy k-nearest neighbor method[J].Biosystems,2007,90.
  • 6Zhou Xi bin,Chen Chao,Li Zhan chao,et al.Using Chou's amphiphilic pseudo-amino acid composition and support vector machine for prediction of enzyme subfamily classes[J]. Journal of Theoretical Biology,2007,248:546-551.
  • 7Bairoch A, Apweiler R.The SWISS-PROT protein sequence data bank and its supplement TrEMBL[J].Nucleic Acids Res, 2000,25 : 31-36.
  • 8Cover T M, Hart P E.Nearest neighbour pattem classification[J]. IEEE Trans on Informat Theory, 1967,13:21-27.
  • 9Denoeux T.A k-nearest neighbor classification rule based on Dempster-Shafer theory[J].IEEE Trans on Syst Man Cyber- net, 1995,25 : 804-813.
  • 10Sharer G.A mathematical theory of evidence[M].Princeton, NJ:Princeton University Press,1976.

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部