基于GEP-BP网络集成的蛋白质二级结构预测方法研究

Study of protein secondary structure prediction methods based on GEP-BP network ensemble

下载PDF

导出

摘要为提高蛋白质二级结构预测的精度,提出了一种基于GEP-BP网络集成的两层结构预测模型。首先利用基因表达式编程(GEP)的全局搜索能力同时进化设计BP网络的结构和连接权,并将进化最后一代的个体用BP算法进一步训练学习,然后采用组合方法将部分个体集成构成模型的第一层;根据神经网络输出之间具有相关性,用第二层网络对第一层的预测结果进行精炼。用PDBSelect25中的36条蛋白质共6 122个残基进行测试,结果表明提出的模型能有效预测蛋白质二级结构,将预测精度提高到73.02%。 In order to improve the prediction accuracy of protein secondary structure, this paper presented a new prediction model composed of two-level network based on GEP-BP network ensemble. Firstly, evolved simultaneously the structure and connection weights of BP network were by using global research ability of GEP, then trained fatherly all the individuals of last generation by BP algorithm and formed the first-level through a combination method to ensemble part of individuals. Secondly, according to the dependency of neighboring neural network output, refined the results of the first-level by the second-level net- work. Employed the model to predict 36 nonhomologous protein sequences with 6122 residues in PDBSeleet25. The results show that the proposed model can efficiently improve the prediction accuracy, increasing prediction accuracy to 73.02%.

作者王艳春

机构地区青岛农业大学理学与信息学院西北农林科技大学机电工程学院

出处《计算机应用研究》 CSCD 北大核心 2009年第10期3687-3689,3693,共4页 Application Research of Computers

基金国家自然科学基金资助项目(30471138)

关键词蛋白质二级结构基因表达式编程神经网络集成 protein secondary structure prediction gene expression programming neural network ensemble

分类号 TP183 [自动化与计算机技术—控制理论与控制工程]

引文网络
相关文献

参考文献11

1邹承鲁．第二遗传密码[M]．长沙：湖南科学技术出版社,1997.
2石峰,莫忠息,张楚瑜.隐马尔可夫模型—改进的预测蛋白质二级结构方法[J].生物数学学报,2004,19(2):233-237. 被引量：9
3HUA Su-jun, SUN Zhi-rong. A novel method of protein secondary structure prediction with high segment overlap measure: support vector machine approach [J].Journal of Molecular Biology, 2001,308 (2) :397-407.
4ZHANG Guan-zheng,HUANG D S,ZHU Y P,et al. Improving protein secondary structure prediction by using the residue conformational classes[J]. Pattern Recognition Letters,2005,26:2346-2352.
5王崇骏,于汶滌,陈兆乾,谢俊元.一种基于遗传算法的BP神经网络算法及其应用[J].南京大学学报（自然科学版）,2003,39(5):459-466. 被引量：60
6SOLLICH P, KROGH A. Learning with ensembles., how over-fitting can be useful [ C ]//Proc of Advanced in Neural Information Processing Systems. Cambridge, MA : MIT Press, 1996 : 190-196.
7HANSEN L K, SALAMON P. Neural network ensembles [ J]. IEEE Trans on Pattern Analysis and Machine Intelligence, 1990,12 (10) :993-1001.
8QIAN Ning, SEJNOWSKI T J. Predicting the secondary structure of globular proteins using neural network modals[ J]. Journal of Molecular Biology, 1988,202:865- 884.
9ZHOU Z H,WU J,TANG W. Ensembling neural networks:many could be better than all [ J ]. Artificial Intelligence, 2002,137 ( 1 - 2 ) : 239- 263.
10HUANG Xin, HUANG De-shuang,ZHANG Guang-zheng,et al. Prediction of protein secondary structure using improved two-level neural network architecture [ J ]. Protein & Peptide Letter, 2005,12 ( 8 ) : 805-811.

二级参考文献20

1Belew R K, Booker L B. Proeeeedings of the Fourth international Conference on Genetic Algorithms. San Mateo, CA: Morgan Kaufmann Publishers, Inc, 1991.
2Schaffer J D. Procceedings of the Third International Conference on Genetic Algorithms. San Mateo,CA: Morgan kaufmann Publishers, Inc, 1989.
3Zhou Z H, Chen S F, Chen Z Q. A statistics based approach for extracting priority rules from trained neural networks. Proceedings of the IEEE-INNS-ENNS International Joint Conference on Neural Networks. Italy: Como, 2000, 3: 401--406.
4Judd J S. Learning in networks is hard. Proceedings of the 1st IEEE International Conference on Neural Networks, 1987, 2: 685--692.
5Hornik K M, Stinchcombe M, White H. Multilayer feedforward networks are universal approximators.Neural Networks, 1989, 2(2): 359--366.
6Lang K J, Waibel A H, Hinton G E. A time-delay network architecture for isolated word recognition.Neural Networks, 1990, 3(1): 23--44.
7Lippmann R P. Pattern classification using neural networks. IEEE Communication Magzine, 1989, 27(11): 47--64.
8Huang W M, Lippmann R P. Neural net and traditional classifiers. Anderson D. Neural Information Processing Systems. New York: American Institution of Physics, 1988:387--339.
9Baum E B, Haussler D. What size net gives valid generalization? Nenral Computation, 1989, 1(1): 151-160.
10Nick Goldman, Salzberg Searl. Using evolutionary trees in protein secondary structure prediction and other comparative sequence analyses[J]. J Mol Biol, 1996, 263(1): 196-208.

共引文献69

1赵鹏程,王致杰,孟江,王耀才.基于免疫神经网络模型的油气浓度预测研究[J].自动化博览,2005,22(3):69-71.
2范睿,李国斌,景韶光.基于实数编码遗传算法的混合神经网络算法[J].计算机仿真,2006,23(1):161-164. 被引量：26
3秦勇,张永国,赵鹏程.基于免疫神经网络模型的油气浓度预测研究[J].自动化与仪表,2006,21(3):53-56. 被引量：2
4史晓红,王燕,刘文斌,殷志祥.现代优化计算方法在蛋白质结构预测中的应用[J].数学的实践与认识,2006,36(10):86-92. 被引量：4
5王其军,程久龙.瓦斯传感器的故障模式与诊断方法研究[J].煤炭科学技术,2006,34(11):34-36. 被引量：12
6陈希,王瑞.基于遗传神经网络优化调节滴灌施肥液pH值[J].江苏农业学报,2006,22(4):460-464.
7商杰,朱战立.基于遗传算法的神经网络在预测油管钢腐蚀速率中的应用[J].腐蚀科学与防护技术,2007,19(3):225-228. 被引量：6
8商杰,刘明.BP算法的改进及在模式识别中的应用[J].合肥学院学报（自然科学版）,2007,17(3):15-18.
9杜世平.多观测序列HMM2的Baum-Welch算法[J].生物数学学报,2007,22(4):685-690. 被引量：8
10张福平,李堂军,杨磊.基于遗传算法优化模糊Petri网的掘进工作面瓦斯爆炸事故致因分析[J].矿业安全与环保,2008,35(2):56-58.

1孙懿娟.BGP/MPLS IP VPN的安全性分析[J].山东通信技术,2004,24(4):13-17.
2卢雷,万建成,鹿旭东,郭小涛.基于Web应用特点的界面组成及交互模型[J].计算机工程与设计,2006,27(23):4551-4555. 被引量：5
3黄弢,史铁林,何岭松.基于Web的分布式设备监测诊断系统[J].机电一体化,1999,5(6):34-37. 被引量：4
4赵晓曦,屈尔庆,周志栋.Web技术在分布式设备监测诊断系统中的应用[J].数字技术与应用,2011,29(6):184-184. 被引量：1
5庄强,张溪.分析基于神经网络知识库的多神经网络集成方法[J].考试周刊,2013(84):128-129.
6郭爱波.ARP协议与第二层网络安全[J].成铁科技,2015,0(3):46-46.
7张立新,杜刚.物联网与嵌入式技术研究[J].软件导刊,2015,14(2):16-17. 被引量：4
8朱昊,乔振民.局域网实现VLAN分析[J].石家庄职业技术学院学报,2007,19(2):48-50.
9徐延宁,孟祥旭,吕琳.基于知识的参数化设计层次模型[J].计算机辅助设计与图形学学报,2004,16(10):1430-1436. 被引量：13
10杨辉,王毅.物联网与嵌入式系统的关系研究[J].计算机与现代化,2011(8):126-129. 被引量：9

计算机应用研究

2009年第10期

浏览历史

内容加载中请稍等...

基于GEP-BP网络集成的蛋白质二级结构预测方法研究

参考文献11

二级参考文献20

共引文献69

相关作者

相关机构

相关主题

浏览历史