基于最优分割位点的蛋白质亚细胞位点预测方法被引量：2

Prediction of protein subcellular location using optimal cleavage site

下载PDF

导出

摘要蛋白质的亚细胞位点信息有助于我们了解蛋白质的功能以及它们之间的相互作用,同时还可以为新药物的研发提供帮助。目前普遍采用的亚细胞位点预测方法主要是基于N端分选信号或氨基酸组分特征,但研究表明,单纯基于N端分选信号或氨基酸组分的方法都会丢失序列的序信息。为了克服此缺陷,本文提出了一种基于最优分割位点的蛋白质亚细胞位点预测方法。首先,把每条蛋白质序列分割为N端、中间和C端三部分,然后在每个子序列和整条序列中分别提取氨基酸组分、双肽组分和物理化学性质,最后我们把这些特征融合起来作为整条序列的特征。通过夹克刀检验,该方法在NNPSL数据集上得到的总体精度分别是87.8%和92.1%。 Protein subcellular locations has immediate relevance for understanding protein function and designing new drug.Present methods are mainly based on sorting signals or amino acid compositions.However,methods based solely on sorting signals or amino acid compositions may lose the sequence order information.To overcome the shortcomings,we divided each chain into three parts：N-terminal,middle,and C-terminal.Then,features were extracted from each part and the whole chain independently.These features are amino acid compositions,dipeptides,and stereochemical properties.Finally,features of different parts are combined and the combined features are used as features of the whole chain.By Jackknife test on the NNPSL dataset,our overall accuracies for prokaryotic and eukaryotic proteins are 87.8% and 92.1%,respectively.

作者王伟郑小琪窦永超刘太岗赵娟王军

机构地区上海师范大学数理学院大连理工大学数学科学学院山东农业大学信息科学与工程学院上海高校科学计算重点实验室

出处《生物信息学》 2011年第2期171-175,180,共6页 Chinese Journal of Bioinformatics

基金国家自然科学基金(No.10731040) 上海市重点科学项目(No.S30405) 上海教育厅创新项目(No.09zz134)

关键词蛋白质序列亚细胞位点夹克刀检验总体精度特征融合 Protein sequence Subcellular location Jackknife test optimal cleavage site Combined feature

分类号 TP391.4 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献18

1Chou KC, Shen HB. Cell - PLoc : a package of web - servers for predicting snbcellular localization of proteins in various organisms [J~. Nat Protoe, 2008, 3:153 - 162.
2Murphy RF, Boland MV, Velliste M. Towards a systematics for protein subcellular location : quantitative description of protein lo- calization patterns and automated analysis of fluorescence micro- scope images [ J]. Proc Int Conf Intell Syst Mol Biol, 2000, 8: 251 - 259.
3Nakai K. Protein sorting signals and prediction of subcellular localization [J]. Adv Protein Chem, 2000, 54:277 - 344.
4Emanuelsson O, Nielsen H, Brunak S, et al.. Predicting subcel- lular localization of proteins based on their N - terminal amino acid sequence [J]. Mol. Biol, 2000, 300:1005 - 1016.
5Nakai K, Kanehisa M. Expert system for predicting protein locali- zation sites in Gram- negative bacteria [ J ]. Proteins, 1999, 11: 95 - 110.
6Hua S, Sun Z. Support vector machine approach for protein subcellular localization prediction [ J ]. Bioinformaties, 2001, 17 : 721 - 728.
7Nakashima H, Nishikawa K. Discrimirtation of intracellular and extracellular proteins using amino acid composition and residue - pair frequencies [J]. Mol Biol, 1994, 238:54 - 61.
8Cedano J, Aloy P, Pe'rez -Pons JA, et al. Relation between amino acid composition and cellular location of proteins [ J ]. Mol. Biol, 1997, 266:594 - 600.
9Reiuhardt A, Hubbard T. Using neural networks for prediction of the subcellular location of proteins [ J ]. Nucleic Acids Res, 1998, 26. 2230 - 2236.
10Chou KC, Elrod DW. Using discriminant function for prediction of subcellular location of prokaryotie proteins [Jl. Biochem. Bio- phys. IRes. Commun, 1998, 252:63-68.

二级参考文献15

1Evangelia I. Petsalaki,Pantelis G. Bagos,Zoi I. Litou,Stavros J. Hamodrakas.PredSL: A Tool for the N-terminal Sequence-based Prediction of Protein Subcellular Localization[J].Genomics, Proteomics & Bioinformatics,2006,4(1):48-55. 被引量：5
2HOGLUND A, DONNES P, BLUM T, et al. MultiLoc: prediction of protein subcellular localization using N-terminal targeting sequences, sequence motifs and amino acid composition [ J ]. Bioinformatics, 2006,22 ( 10 ) : 1158 - 1165.
3REINHARDT A, HUBBARD T. Using neural networks for prediction of the subcellular location of proteins [ J ]. Nucleic Acids Res, 1998,26 (9) :2230 - 2236.
4HUA S J, SUN Z R. Support vector machine approach for protein subcellular location prediction [ J ]. Bioinformatics,2001,17 :721 - 728.
5MATSUDA S, VERT J P, SAIGO H, et al. A novel representation of protein sequences for prediction of subcellular location using support vector machines [ J ]. Protein Sci ,2005,14:2804 - 2813.
6GUO J,LIN Y, SUN Z. A novel method for protein subcellular localization: Combining residue-couple model and SVM[C]. Proceedings of the 3rd Asia-Pacific Bioin- formatics Conference, Singapore,2005,117 - 129.
7CHOU K C, CAI Y D. Using functional domain composition and support vector machines for prediction of protein subcellular location [ J]. J Biol Chem,2002,277 (48) : 45765 - 45769.
8SCOTT M S,THOMAS D Y, HALLETT M T. Predicting subcellular localization via protein motif co-occurrence [J]. Genome Res,2004,14 : 1957 - 1966.
9XIE D, LI A, WANG M. LOCSVMPSI: a web server for subcellular localization of eukaryotic proteins using SVM and profile of PSI-BLAST [ J ]. Nucleic Acids Res, 2005,33:105 - 110.
10TAMURA T, AKUTSU T. Subcellular location prediction of proteins using support vector machines with alignment of block sequences utilizing amino acid composition [ J ]. BMC Bioinformatics,2007,8:466.

共引文献2

1张艳,孙慈,项新媛,左永春,李前忠.氨基酸约化分类对亚线粒体蛋白定位的预测[J].内蒙古大学学报（自然科学版）,2011,42(3):311-317.
2吴泽月,陈月辉.蛋白质亚细胞定位预测研究进展[J].山东师范大学学报（自然科学版）,2012,27(4):33-37. 被引量：6

同被引文献33

1马军伟,高新中,张杰.蛋白质亚细胞定位预测中的序列编码技术研究[J].计算机科学,2012,39(S3):283-287. 被引量：1
2樊玉才,胡秀珍.基于GO-PseAA的凋亡蛋白亚细胞定位[J].生物物理学报,2009,0(S1):332-333. 被引量：1
3张春梅,尹忠科,肖明霞.基于冗余字典的信号超完备表示与稀疏分解[J].科学通报,2006,51(6):628-633. 被引量：71
4张振慧,王正华,王勇献.利用分组重量编码预测细胞凋亡蛋白的亚细胞定位[J].生物物理学报,2006,22(4):275-282. 被引量：5
5陈颖丽,李前忠,杨科利,樊国梁.基于离散增量结合支持向量机方法的凋亡蛋白亚细胞位置预测[J].生物物理学报,2007,23(3):192-198. 被引量：8
6邹凌云,王正志,黄教民.Prediction of Subcellular Localization of Eukaryotic Proteins Using Position-Specific Profiles and Neural Network with Weighted Inputs[J].Journal of Genetics and Genomics,2007,34(12):1080-1087. 被引量：3
7Hua SJ, Sun ZR. Support Vector machine approach for protein subcellular location prediction [ J ]. Bioinformatics, 2001,17 : 721 -728.
8Feng ZP. An overview on predicting the subcellular location of a protein[ J]. In Silico Biol, 2002,2:291-303.
9Chou KC, Cai YD. Using functional domain composition and support vector machines for prediction of protein subeellular location[ J]. Bion Chcm,2002,227:45765-45769.
10Reed JC, Paternostro G. Postmitochondria| regulation of apoptosis during heart failure [ J ]. Proc Natl Acad Sci USA, 1999,96(14) :7614-7616.

引证文献2

1石雪娜,王瑞平.基于压缩感知预测凋亡蛋白亚细胞位点[J].北京生物医学工程,2015,34(1):70-74.
2赵南,张梁,薛卫,王雄飞,任守纲.词袋模型在蛋白质亚细胞定位预测中的应用[J].食品与生物技术学报,2017,36(3):296-301. 被引量：5

二级引证文献5

1胡雪娇,陈行健,赵南,薛卫.PSO＿BFA优化词袋模型及蛋白质亚细胞定位预测[J].计算机工程与应用,2020,56(1):165-171. 被引量：2
2薛卫,洪晓宇,胡雪娇,陈行健,张梁.CL-RBF:一种基于改进ML-RBF的蛋白质亚细胞多点定位预测算法[J].食品与生物技术学报,2020,39(2):66-73.
3李佳楠,李卓,滕小华,高兴泉,唐友.基于机器学习的蛋白质亚细胞定位预测方法[J].安徽农业科学,2022,50(16):198-204. 被引量：1
4陈行健,胡雪娇,薛卫.基于多层次稀疏编码预测蛋白质亚细胞定位[J].生物工程学报,2019,35(4):687-696. 被引量：5
5陈行健,胡雪娇,薛卫.基于关系拓展的改进词袋模型研究[J].小型微型计算机系统,2019,40(5):1040-1044. 被引量：7

1李斌,李义兵,何红波.基于复杂性K近邻规则的蛋白质亚细胞位点预测[J].计算机工程,2007,33(7):28-29. 被引量：1
2张树波,赖剑煌,何建国.一种基于最优局部信息融合的蛋白质亚细胞定位预测方法[J].中山大学学报（自然科学版）,2008,47(6):16-21. 被引量：3
3周雄.多尺度组分特征和位点关联特征相融合的剪接位点识别[J].计算机工程与应用,2014,50(10):120-123. 被引量：1
4张安胜,王爱平.基于深度学习的蛋白质二级结构预测[J].计算机仿真,2015,32(1):392-396. 被引量：5
5周雄,陈国彬.基于权重参数实时更新的室内定位算法[J].计算机工程与应用,2014,50(17):191-194. 被引量：1
6江北书生.无线路由器安全技巧大放送(下)[J].电脑迷,2008,0(8):44-44.
7孙豫峰.基于概率神经网络的蛋白质亚细胞定位[J].太原师范学院学报（自然科学版）,2005,4(2):23-25. 被引量：2
8宋丽丽,吴亚东,孙波.文档图像几何畸变快速校正的新方法[J].计算机应用,2010,30(A12):3317-3320. 被引量：3
9丁玲.考虑相位加权的邻点滤波虚拟成像处理技术[J].科技通报,2014,30(6):46-48. 被引量：1
10刘光徽,胡俊,於东军.基于多视角特征组合与随机森林的G蛋白偶联受体与药物相互作用预测[J].南京理工大学学报,2016,40(1):1-9. 被引量：5

生物信息学

2011年第2期

浏览历史

内容加载中请稍等...

基于最优分割位点的蛋白质亚细胞位点预测方法被引量：2

参考文献18

二级参考文献15

共引文献2

同被引文献33

引证文献2

二级引证文献5

相关作者

相关机构

相关主题

浏览历史

基于最优分割位点的蛋白质亚细胞位点预测方法 被引量：2

参考文献18

二级参考文献15

共引文献2

同被引文献33

引证文献2

二级引证文献5

相关作者

相关机构

相关主题

浏览历史

基于最优分割位点的蛋白质亚细胞位点预测方法被引量：2