一种新的蛋白质亚细胞定位预测方法被引量：1

Novel approach to prediction of protein subcellular localization

下载PDF

导出

摘要蛋白质亚细胞定位是蛋白质组学基本问题之一。某些类型蛋白质可能存在于两个或两个以上的亚细胞位置,这类蛋白质的亚细胞定位问题更为复杂。分别利用Gene Ontology和伪氨基酸成分法,将一条蛋白质表示为一实值向量;采纳多标记学习中的Ranking思想,计算出一得分向量V,该向量的每一分量的值表示被预测蛋白质属于某个亚细胞位置的概率;利用最近邻算法预测蛋白质所属亚细胞位置的个数n,得分向量V中得分最高的n个分量对应的亚细胞位置即为预测的位置。 A It is one of basic problems of proteomics to identify the subcellular locations of a protein. It makes the problem more complicated that some proteins may simultaneously exist in two or more than two subcellular locations. Gene Ontology and pseudo amino acid composition are respectively employed to represent a protein as a real values vector. The idea of Ranking initiating from multi-label learning community is adopted to compute a score vector V, each component value of which indicates the probability that a protein of the corresponding subcellular location.The nearest neighbor algorithm is then employed to predict the number n of subcellular localization of human proteins. Finally, the n subcellular locations correspondin~ to the too n scores components in Vare assign to the ouerv nrotein.

作者程昔恩吴志诚

机构地区景德镇陶瓷学院信息工程学院

出处《计算机工程与应用》 CSCD 2012年第6期126-128,共3页 Computer Engineering and Applications

基金国家自然科学基金(No.60961003) 江西省自然科学基金(No.2010GQS0127)

关键词蛋白质亚细胞定位多标记学习 GENE ONTOLOGY 最近邻算法 protein subcellular localization multi-label learning Gene Ontology k-nearest neighbors algorithm

分类号 TP392 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献11

1Chou K C, Shen H B.Hum-PLoc: a novel ensemble classifier for predicting human protein subcellular localization,Biochem[J].Biophys Res Cornmun,2006,347:150-157.
2Chou K C.Prediction of protein cellular attributes using pseudo amino acid composition[J].Proteins : Struct Funct Genet, 2001,43 : 246-255.
3Chou K C, Cai Y D.Using functional domain composition and support vector machines for prediction of protein subcellular location[J].Journal of Biol Chem,2002,277:45765-45769.
4Park K J, Kanehisa M.Prediction of protein subcellular locations by support vector machines using compositions of amino acid and amino acid pairs[J].Bioinformatics,2003,19:1656-1663.
5Zhou G P,Doctor K.Subcellular location prediction of apoptosis proteins[J].Proteins: Struct Funct Genet, 2003,50: 44-48.
6Garg A, Bhasin M, Raghava G P.Support vector machine-based method for subcellular localization of human proteins using amino acid compositions, their order, and similarity search[J].Joumal Biol Chem, 2005,280 : 14427-14432.
7Shen H B,Chou K C.Hum-mPLoc:an ensemble classifier for large- scale human protein subcellular location prediction by incorporating samples with multiple sites[J].Biochem Biophys Res Commun,2007,355 : 1006-1011.
8Ashburner M.Gene ontology: tool for the unification of biology[J]. Nat Genet,2000,25:25-29.
9Shen H B, Chou K C.A top-down approach to enhance the power of predicting human protein subcellular loealization:Hum-mPLoc 2.0[J].Analytical Biochemistry,2009,394(2) :269-274.
10Schaffer A A.Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements[J].Nucleic Acids Res,2001,29:2994-3005.

同被引文献12

1孙豫峰.基于概率神经网络的蛋白质亚细胞定位[J].太原师范学院学报（自然科学版）,2005,4(2):23-25. 被引量：2
2马翔,王明会,李骜,谢丹,冯焕清.基于加权模糊k近邻方法的蛋白质亚细胞位点预测[J].中国生物医学工程学报,2006,25(1):106-109. 被引量：5
3张振慧,王正华,王勇献.利用分组重量编码预测细胞凋亡蛋白的亚细胞定位[J].生物物理学报,2006,22(4):275-282. 被引量：5
4杨会芳,程咏梅,张绍武,潘泉.基于分段伪氨基酸组成成分特征提取方法预测蛋白质亚细胞定位[J].生物物理学报,2008,24(3):232-238. 被引量：5
5张树波,赖剑煌,何建国.一种基于最优局部信息融合的蛋白质亚细胞定位预测方法[J].中山大学学报（自然科学版）,2008,47(6):16-21. 被引量：3
6张树波,赖剑煌.蛋白质亚细胞定位预测的机器学习方法[J].计算机科学,2009,36(4):29-33. 被引量：7
7魏蓉,赵艳君,张同亮,顾全.使用伪氨基酸和集成分类器预测凋谢蛋白亚细胞定位[J].计算机与应用化学,2009,26(7):921-924. 被引量：2
8赵禹,赵巨东,姚龙.用离散增量结合支持向量机方法预测蛋白质亚细胞定位[J].生物信息学,2010,8(3):237-239. 被引量：4
9李立奇,张瑗,周跃,王开发.KNN法在含纤连蛋白域蛋白质亚细胞定位中的应用[J].山东医药,2011,51(2):20-21. 被引量：2
10SONG Chaohong,SHI Feng.Prediction of Protein Subcellular Localization Based on Hilbert-Huang Transform[J].Wuhan University Journal of Natural Sciences,2012,17(1):48-54. 被引量：1

引证文献1

1吴泽月,陈月辉.蛋白质亚细胞定位预测研究进展[J].山东师范大学学报（自然科学版）,2012,27(4):33-37. 被引量：6

二级引证文献6

1郑珊珊,石卓兴,代琦,姚玉华.蛋白质亚细胞定位预测研究进展[J].科技视界,2014(12):12-12.
2岳英伟,王鑫,杜淼,马文芝,郭宏.牛MARK2、CREB5基因的克隆和生物信息学分析[J].中国畜牧兽医,2016,43(2):311-318. 被引量：1
3靳聪飞,刘新峰,王婷,杨淑萍,郭宏.牛ARRDC3和ARRDC4基因的克隆和生物信息学分析[J].畜牧与兽医,2016,48(4):39-45.
4叶静,陈伟,金殿川.基于不同物种的热休克蛋白90的生物信息学分析[J].生物信息学,2016,14(3):134-138. 被引量：2
5靳聪飞,梁婷玉,刘新峰,郭宏.牛GUCY1A3和SFXN1基因的克隆及生物信息学分析[J].中国畜牧兽医,2017,44(2):357-364. 被引量：5
6靳聪飞,张瑞,刘新峰,郭宏.牛TNS1基因的克隆和生物信息学分析[J].黑龙江畜牧兽医,2017(8):98-102. 被引量：2

1马军伟,史舵,顾宏,张杰.PCA方法在蛋白质亚细胞定位中应用[J].大连理工大学学报,2012,52(3):426-430. 被引量：1
2孙晶京.基于GO的蛋白质亚细胞定位方法研究[J].农业网络信息,2012(11):21-23. 被引量：1
3冯馨.一种基于改进型伪氨基酸的蛋白质亚细胞定位算法[J].信息与电脑（理论版）,2014,0(11):94-95.
4宋杰.蛋白质亚细胞定位预测的最近邻算法[J].计算机应用研究,2007,24(11):30-31. 被引量：1
5王彤,薛建新,谭文安.利用半监督降维算法预测蛋白质亚细胞位置[J].上海第二工业大学学报,2015,32(3):260-265. 被引量：2
6陈胜荣,董守斌.基于优选链接的中文网页分类方法研究[J].郑州大学学报（理学版）,2007,39(2):78-82. 被引量：3
7张树波,赖剑煌,何建国.一种基于最优局部信息融合的蛋白质亚细胞定位预测方法[J].中山大学学报（自然科学版）,2008,47(6):16-21. 被引量：3
8马军伟,高新中,张杰.蛋白质亚细胞定位预测中的序列编码技术研究[J].计算机科学,2012,39(S3):283-287. 被引量：1
9曹隽喆,顾宏,贺建军.一种新的蛋白质亚细胞定位预测训练集构造方法[J].大连理工大学学报,2012,52(6):884-889. 被引量：2
10孙豫峰.基于概率神经网络的蛋白质亚细胞定位[J].太原师范学院学报（自然科学版）,2005,4(2):23-25. 被引量：2

计算机工程与应用

2012年第6期

浏览历史

内容加载中请稍等...

一种新的蛋白质亚细胞定位预测方法被引量：1

参考文献11

同被引文献12

引证文献1

二级引证文献6

相关作者

相关机构

相关主题

浏览历史

一种新的蛋白质亚细胞定位预测方法 被引量：1

参考文献11

同被引文献12

引证文献1

二级引证文献6

相关作者

相关机构

相关主题

浏览历史

一种新的蛋白质亚细胞定位预测方法被引量：1