期刊文献+

支持向量机及组合预测在蛋白质四级结构分类中的应用

Classification of Protein Quaternary Structure based on Support Vector Machines and Combinatorial Forecast
下载PDF
导出
摘要 目的:基于支持向量机建立一个自动化识别新肽链四级结构的方法,提高现有方法的识别精度。方法:改进4种已有的蛋白质一级序列特征值提取方法,采用线性和非线性组合预测方法建立一个有效的组合预测模型。结果:以同源二聚体及非同源二聚体为例,对4种特征值提取方法进行改进后其分类精度均提升了2~3%;进一步实施线性与非线性组合预测后,其分类精度再次提高了2~3%,使独立测试集的分类精度达到了90%以上。结论:4种特征值提取方法均较好地反应出蛋白质一级序列包含四级结构信息,组合预测方法能有效地集多种特征值提取方法优势于一体。 Objective: To establish a method of automatically identifying protein structures based on support vector machine for improving the present classification accuracies. Methods: The former four methods of feature extraction from the amino acid sequences were improved, and then an effective combinatorial forecast model was established based on linear and non-linear method. Results: The classification precision of the four improved models has increased by 2-3 % over before.Then,combinatorial forecast was further introduced, and the classification precision has increased by 2-3 % again.Finally,the precision of independent testing set exceeded 90 %. Conclusion: The results indicate that protein primary sequence contains quaternary structure information. And the combinatorial forecast method can effectively integrate with several kinds of methods of feature value extraction in the primary sequences.
出处 《现代生物医学进展》 CAS 2008年第4期646-648,637,共4页 Progress in Modern Biomedicine
基金 国家自然科学基金(No.30570351) 教育部新世纪优秀人才支持计划(NCET-06-0710)
关键词 蛋白质四级结构 分类 支持向量机 组合预测 Protein quaternary structure Classification Support vector machines Combinatorial forecast
  • 相关文献

参考文献8

二级参考文献158

  • 1李程雄,丁月华,文贵华.SVM-KNN组合改进算法在专利文本分类中的应用[J].计算机工程与应用,2006,42(20):193-195. 被引量:22
  • 2刘红艳,覃礼堂,易忠胜,刘树深.黄酮类醛糖还原酶抑制剂的三维定量构效关系研究[J].现代生物医学进展,2006,6(12):13-16. 被引量:5
  • 3阎隆飞 孙之荣.蛋白质分子结构[M].北京:清华大学出版社,2000..
  • 4Anfinsen CB, Haber E, Sela M, et al. The kinetics of formation of native ribonuclease during oxidation of the reduced polypepfide chain[J]. Proc Nail Acad Sci USA, 1961,47: 1309-1314.
  • 5Klotz IM, Darnall DW, Langerman NR. The protein, 3rd edition[M]. New York: Academic Press, 1975,1:293--411.
  • 6Price NC. Assembly of multi-subtmit structure[M]. New York:Oxford University Press, 1994.
  • 7Robert G. Prediction of quaternary structure from primary structure[J]. Biolnformatics, 2001,17:551-556.
  • 8Vapnik V. The nature of statistical loaming theory[M]. NewYork: Springer, 1995.
  • 9Vapnik V. Statistical learning theory[M]. New York: Wiely,1998.
  • 10Brown M, Grundy W, Lin D, et al. Knowledge-based analysis of microarray gene expression data by using support vector machines[J]. Proc Nail Acad Sci USA, 2000,97:262-267.

共引文献76

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部