使用伪氨基酸模型和K近邻分类器预测酶的分类

Using pseudo-amino acids mode and K Nearest Neighbors'classifier to predict enzyme subfamily classes

下载PDF

导出

摘要酶作为一种重要的生物催化剂在生物代谢过程中扮演着非常重要的角色。一种酶的功能与它所属的类或子类有着密切的关系。所以,不论是在基础研究的过程中还是药物发现的过程中,研究预测酶的分类方法都显得非常有用。通过采用一种基于伪氨基酸组成作为酶序列的特征向量,同时又加入了更多的氨基酸信息,来对酶进行分类。对于分类器,考虑到它是多分类问题,采用了最优证据理论-K近邻算法。实验结果证明这样做是有效的,达到83%的准确率。 Enzyme plays an important role in biological metabolism pathways as catalyzer. Furthermore, the function of an enzyme has close relationship with subfamily it belongs to. Thus, the enzyme class problem becomes useful in biology. When constructing its feature vector, the pseudo-amino acids mode is incited, combining the components of the amino acid pair and more useful biophysical features. At the same time, the nice multi-class classifier is chosen： Optimized Evidence Theory-K Nearest Neighbors （OET-KNN） to train these feature vectors. The classifying performance reaches as high as 83 %.

作者孙晶京

机构地区山西农业大学文理学院

出处《计算机工程与应用》 CSCD 2013年第9期123-126,共4页 Computer Engineering and Applications

关键词特征向量伪氨基酸模型最优证据理论-K近邻算法 feature vector pseudo-amino acids mode Optimized Evidence Theory-K Nearest Neighbors＇（OET-KNN） algorithm

分类号 TP391 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献15

1Webb E C.Enzyme nomenclature[J].The FASEB Journal, 1993,7.
2Chou K C, Elrod D W.Prediction of enzyme family classes[J]. Proteome Res, 2003,2 : 183-190.
3Chou K C.Prediction of protein cellular attributes using pseudo amino acid composition[J].PROTEINS: Sucture, Function,and Genetics, 2001,43 : 246-255.
4Chou K C.Using amphiphilic pseudo amino acid composi- tion to predict enzyme subfamily elasses[J].Bioinformatics, 2005,21 : 10-19.
5Huang W L,Chen H M,Hwang S F,et al.Accurate predic- tion of enzyme subfamily class using an adaptive fuzzy k-nearest neighbor method[J].Biosystems,2007,90.
6Zhou Xi bin,Chen Chao,Li Zhan chao,et al.Using Chou's amphiphilic pseudo-amino acid composition and support vector machine for prediction of enzyme subfamily classes[J]. Journal of Theoretical Biology,2007,248:546-551.
7Bairoch A, Apweiler R.The SWISS-PROT protein sequence data bank and its supplement TrEMBL[J].Nucleic Acids Res, 2000,25 : 31-36.
8Cover T M, Hart P E.Nearest neighbour pattem classification[J]. IEEE Trans on Informat Theory, 1967,13:21-27.
9Denoeux T.A k-nearest neighbor classification rule based on Dempster-Shafer theory[J].IEEE Trans on Syst Man Cyber- net, 1995,25 : 804-813.
10Sharer G.A mathematical theory of evidence[M].Princeton, NJ:Princeton University Press,1976.

1NPD In-Stat公司称，2015年视频监控半导体销售收入将接近35亿美元[J].A&S（安全&自动化）,2012(3):34-34.
2张文举,陈曙东,刘了,马范援,沈建华.药物发现网格设计与实现[J].计算机工程,2006,32(11):259-261. 被引量：2
3冬至（编译）.浏览药物发现数据[J].生物技术世界,2008(1):45-47.
4曹薇.基于粒子群的供应链认知图优化方法研究[J].微型电脑应用,2008,24(12):10-12.
5黎琳,赵英.基于内容的图像检索反馈技术概述[J].图书情报工作,2006,50(11):95-98. 被引量：3
6夏茂林.与酶有关的坐标曲线图解读[J].新高考（理化生）,2009(10):51-52.
7戴曰梅.核磁共振技术在生命科学领域的应用[J].兵工自动化,2013,32(4):84-88. 被引量：4
8曹凯.解读德国的“大数据”——智慧数据[J].计算机与网络,2014,40(21):6-7.
9聚焦RFID——无论是在优化生产还是物流供给以及质量管理等方面,RFID都为用户提供了一种可能的优化方案[J].国内外机电一体化技术,2007,10(6):56-57.
10沈国红.机器人做城市设计,还远吗?[J].交通与运输,2017,0(2):1-3.

计算机工程与应用

2013年第9期

浏览历史

内容加载中请稍等...

使用伪氨基酸模型和K近邻分类器预测酶的分类

参考文献15

相关作者

相关机构

相关主题

浏览历史