期刊文献+

基于GEP-BP网络集成的蛋白质二级结构预测方法研究

Study of protein secondary structure prediction methods based on GEP-BP network ensemble
下载PDF
导出
摘要 为提高蛋白质二级结构预测的精度,提出了一种基于GEP-BP网络集成的两层结构预测模型。首先利用基因表达式编程(GEP)的全局搜索能力同时进化设计BP网络的结构和连接权,并将进化最后一代的个体用BP算法进一步训练学习,然后采用组合方法将部分个体集成构成模型的第一层;根据神经网络输出之间具有相关性,用第二层网络对第一层的预测结果进行精炼。用PDBSelect25中的36条蛋白质共6 122个残基进行测试,结果表明提出的模型能有效预测蛋白质二级结构,将预测精度提高到73.02%。 In order to improve the prediction accuracy of protein secondary structure, this paper presented a new prediction model composed of two-level network based on GEP-BP network ensemble. Firstly, evolved simultaneously the structure and connection weights of BP network were by using global research ability of GEP, then trained fatherly all the individuals of last generation by BP algorithm and formed the first-level through a combination method to ensemble part of individuals. Secondly, according to the dependency of neighboring neural network output, refined the results of the first-level by the second-level net- work. Employed the model to predict 36 nonhomologous protein sequences with 6122 residues in PDBSeleet25. The results show that the proposed model can efficiently improve the prediction accuracy, increasing prediction accuracy to 73.02%.
作者 王艳春
出处 《计算机应用研究》 CSCD 北大核心 2009年第10期3687-3689,3693,共4页 Application Research of Computers
基金 国家自然科学基金资助项目(30471138)
关键词 蛋白质 二级结构 基因表达式编程 神经网络集成 protein secondary structure prediction gene expression programming neural network ensemble
  • 相关文献

参考文献11

  • 1邹承鲁.第二遗传密码[M].长沙:湖南科学技术出版社,1997.
  • 2石峰,莫忠息,张楚瑜.隐马尔可夫模型—改进的预测蛋白质二级结构方法[J].生物数学学报,2004,19(2):233-237. 被引量:9
  • 3HUA Su-jun, SUN Zhi-rong. A novel method of protein secondary structure prediction with high segment overlap measure: support vector machine approach [J].Journal of Molecular Biology, 2001,308 (2) :397-407.
  • 4ZHANG Guan-zheng,HUANG D S,ZHU Y P,et al. Improving protein secondary structure prediction by using the residue conformational classes[J]. Pattern Recognition Letters,2005,26:2346-2352.
  • 5王崇骏,于汶滌,陈兆乾,谢俊元.一种基于遗传算法的BP神经网络算法及其应用[J].南京大学学报(自然科学版),2003,39(5):459-466. 被引量:60
  • 6SOLLICH P, KROGH A. Learning with ensembles., how over-fitting can be useful [ C ]//Proc of Advanced in Neural Information Processing Systems. Cambridge, MA : MIT Press, 1996 : 190-196.
  • 7HANSEN L K, SALAMON P. Neural network ensembles [ J]. IEEE Trans on Pattern Analysis and Machine Intelligence, 1990,12 (10) :993-1001.
  • 8QIAN Ning, SEJNOWSKI T J. Predicting the secondary structure of globular proteins using neural network modals[ J]. Journal of Molecular Biology, 1988,202:865- 884.
  • 9ZHOU Z H,WU J,TANG W. Ensembling neural networks:many could be better than all [ J ]. Artificial Intelligence, 2002,137 ( 1 - 2 ) : 239- 263.
  • 10HUANG Xin, HUANG De-shuang,ZHANG Guang-zheng,et al. Prediction of protein secondary structure using improved two-level neural network architecture [ J ]. Protein & Peptide Letter, 2005,12 ( 8 ) : 805-811.

二级参考文献20

  • 1Belew R K, Booker L B. Proeeeedings of the Fourth international Conference on Genetic Algorithms. San Mateo, CA: Morgan Kaufmann Publishers, Inc, 1991.
  • 2Schaffer J D. Procceedings of the Third International Conference on Genetic Algorithms. San Mateo,CA: Morgan kaufmann Publishers, Inc, 1989.
  • 3Zhou Z H, Chen S F, Chen Z Q. A statistics based approach for extracting priority rules from trained neural networks. Proceedings of the IEEE-INNS-ENNS International Joint Conference on Neural Networks. Italy: Como, 2000, 3: 401--406.
  • 4Judd J S. Learning in networks is hard. Proceedings of the 1st IEEE International Conference on Neural Networks, 1987, 2: 685--692.
  • 5Hornik K M, Stinchcombe M, White H. Multilayer feedforward networks are universal approximators.Neural Networks, 1989, 2(2): 359--366.
  • 6Lang K J, Waibel A H, Hinton G E. A time-delay network architecture for isolated word recognition.Neural Networks, 1990, 3(1): 23--44.
  • 7Lippmann R P. Pattern classification using neural networks. IEEE Communication Magzine, 1989, 27(11): 47--64.
  • 8Huang W M, Lippmann R P. Neural net and traditional classifiers. Anderson D. Neural Information Processing Systems. New York: American Institution of Physics, 1988:387--339.
  • 9Baum E B, Haussler D. What size net gives valid generalization? Nenral Computation, 1989, 1(1): 151-160.
  • 10Nick Goldman, Salzberg Searl. Using evolutionary trees in protein secondary structure prediction and other comparative sequence analyses[J]. J Mol Biol, 1996, 263(1): 196-208.

共引文献69

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部