期刊文献+

基于Convolutional-LSTM的蛋白质亚细胞定位研究 被引量:2

Study of Protein Subcellular Localization Based on Convolutional-LSTM
下载PDF
导出
摘要 蛋白质亚细胞位置预测研究是目前蛋白质组学和生物信息学研究的重点问题之一。蛋白质的亚细胞定位决定了它的生物学功能,故研究亚细胞定位对了解蛋白质功能非常重要。由于蛋白质结构的序列性,考虑使用序列模型来进行亚细胞定位研究。尝试使用卷积神经网络(convolutional neural network,CNN)、长短期记忆神经网络(long short-term memory,LSTM)两种模型挖掘氨基酸序列所包含的信息,从而进行亚细胞定位的预测。随后构建了基于卷积的长短期记忆网络(Convolutional-LSTM)的集成模型进行亚细胞定位。首先通过卷积神经网络对蛋白质数据进行特征抽取,随后进行特征组合,并将其送入长短期记忆神经网络进行特征表征学习,得到亚细胞定位结果。使用该模型能达到0.816 5的分类准确率,比传统方法有明显提升。 The prediction study of protein subcellular location is one of the key issues in proteomics and bioinformatics research. Subcellular localization of proteins determines its biological function. Therefore, studying subcellular location is very important for understanding the protein function. Because of the sequential protein structure, this paper uses sequence model to carry out subcellular location research. This paper uses two models,convolutional neural network(CNN) and long short-term memory(LSTM) networks, to mine the information contained in the amino acid sequence so as to predict the subcellular location, followed by the integrated model of Convolutional-LSTM to locate subcellular. First, this paper uses convolutional neural network to extract features of protein sequence data. And then the features are combined and sent to the long short-term memory networks for studying characteristic. After that, the subcellular localization results are obtained. The accuracy of the model classification is 0.8165, which is significantly higher than traditional methods.
作者 王春宇 徐珊珊 郭茂祖 车凯 刘晓燕 WANG Chunyu;XU Shanshan;GUO Maozu;CHE Kai;LIU Xiaoyan(School of Computer Science and Technology,Harbin Institute of Technology,Harbin 150001,China;School of Electrical and Information Engineering,Beijing University of Civil Engineering and Architecture,Beijing 100044,China)
出处 《计算机科学与探索》 CSCD 北大核心 2019年第6期982-989,共8页 Journal of Frontiers of Computer Science and Technology
基金 国家自然科学基金Nos.91735306,61671189,61571163,61532014 国家重点研发计划课题No.2016YFC0901902~~
关键词 蛋白质亚细胞定位 卷积神经网络(CNN) 长短期记忆神经网络(LSTM) 分类 protein subcellular location convolutional neural network (CNN) long short-term memory (LSTM) network classification
  • 相关文献

参考文献1

二级参考文献36

  • 1[1]Huh,W.K.,et al.2003.Global analysis of protein localization in budding yeast.Nature 425:686-691.
  • 2[2]Taylor,S.W.,et al.2003.Characterization of the human heart mitochondrial proteome.Nature Biotechnol.21:281-286.
  • 3[3]Fountoulakis,M.,et al.2002.The rat liver mitochondrial proteins.Electrophoresis 23:311-328.
  • 4[4]Werhahn,W.and Braun,H.P.2002.Biochemical dissection of the mitochondrial proteome from Arabidopsis thaliana by three-dimensional gel electrophoresis.Electrophoresis 23:640-646.
  • 5[5]Claros,M.G.1995.MitoProt,a Macintosh application for studying mitochondrial proteins.Comput.Appl.Biosci.11:441-447.
  • 6[6]Horton,P.and Nakai,K.1997.Better prediction of protein cellular localization sites with the k nearest neighbors classifier.Proc.Int.Conf.Intell.Syst.Mol.Biol.5:147-152.
  • 7[7]Emanuelsson,O.,et al.2000.Predicting subcellular localization of proteins based on their N-terminal amino acid sequence.J.Mol.Biol.300:1005-1016.
  • 8[8]Hua,S.and Sun,Z.2001.Support vector machine approach for protein subcellular localization prediction.Bioinformatics 17:721-728.
  • 9[9]Cui,Q.,et al.2004.Esub8:a novel tool to predict protein subcellular localizations in eukaryotic organisms.BMC Bioinformatics 5:66.
  • 10[10]Sarda,D.,et al.2005.pSLIP:SVM based protein subcellular localization prediction using multiple physicochemical properties.BMC Bioinformatics 6:152.

同被引文献21

引证文献2

二级引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部