期刊文献+

基于PCA和ICA的糖基化位点的预测和分析 被引量:1

Pattern analysis and prediction of O-linked glycosylation sites based on PCA and ICA
原文传递
导出
摘要 为了提高糖基化位点的识别率,提出主成分分析(PCA)和独立成分分析(ICA)相结合的新方法对O-糖基化位点进行预测和分析。以窗口长度为51的蛋白质序列为研究对象,采用稀疏编码方案,首先利用PCA算法对蛋白质序列进行去相关预处理,以降低原始蛋白质序列的维数。然后利用ICA算法进行训练,提取特征向量构建子空间。测试序列投影到每一类子空间,计算测试序列和每类子空间重构序列的距离,根据距离大小确定所属的类。实验表明,提出的新方法有较高的预测性能。 To improve prediction accuracy of glycosylation site.A new method is proposed based on principal component analysis(PCA) and independent component analysis(ICA) for prediction O-linked glycosylation site and pattern analysis.Sparse coding scheme of protein sequence is applied when the window size is 51 in this research.PCA is firstly used to reduce dimension and second order correlation.Then ICA is used to extract independent components to construct a subspace(main basis) of protein sequence by training.The test protein sequence is projected on every subspace.By calculating the distance between the test protein vector and the reconstruction vector of every subspace,the test protein sequence is classified into the nearest class.The experimental results show that the proposed new approach is superior to PCA subspace method.
出处 《计算机与应用化学》 CAS CSCD 北大核心 2011年第5期565-568,共4页 Computers and Applied Chemistry
基金 中南林业科技大学青年基金项目(101-0041) 湖南省教育厅青年基金项目(06C902)
关键词 糖基化位点 主成分分析 独立成分分析 位置概率函数 glycosylation site principal component analysis independent component analysis positional probability functions
  • 相关文献

参考文献13

  • 1Seta D G. N protein glyosylation and diseases:blood and urinary oligosaccharides as markers for diagnosis and therapeutic monitoring. Clin Chem, 2000, 46:795-805.
  • 2Hart G W. Current opinion in cell biology. Glycosylation, 1992, 4:1017-1023.
  • 3Julentus K, Molgaard A and Gupta R. Prediction, conservation analysis and atructural characterization of mammalian mucin-type O-glycosylation sites. Glycobiology, 2004, 15:153 - 164.
  • 4Witlson I B H, Gavel Y and Heijne G. Amino acid distributions around O-linked glycosylation sites. Biochem, 1991, 275: 529-534.
  • 5Elhammer A P, Poorman R A and Brown E. The specificity of UDP-GalNAc:polypeptide N-acetylgalactosaminyl transferase as inferred from a database of in vivo substrates and from the in Vitro glycosylation of proteins and peptides. Biochem, 1993, 268:10029-10038.
  • 6Li S, Liu B and Zeng R. Predicting O-glycosylation sites in mammalian proteins by using SVMs. Computational Biology and Chemistry, 2006, 30:203-208.
  • 7Nishikawa I, Sakamoto H and Nouno I. Prediction of the O-glycosylation sites in protein by layered neural networks and support vector machines. Lecture Notes in Artificial Intelligence, 2006, 4252:953-960.
  • 8Chen Y W, Yang X and Ito M. Pattern analysis and prediction of O-linked glycolsylated sites in protein by principal component subspace analysis, Lecture Notes in Artificial Intelligence, 2007, 4693:326-334.
  • 9Protein Datebase [EB/OL]. http://www.uniprot, org.
  • 10Jolliffe I T. Principal Component Analysis. New York: Springer- Verlag, 1996.

二级参考文献10

  • 1HART G W. Glycosylation[ J]. Current Opinion in Cell Biology, 1992, 4:1017 - 1023.
  • 2WILSON I B H, GAVEL Y, HEUNE G. Amino Acid Distributions around O-linked Glycosylation Sites[J]. Biochem. , 1991, 275 : 529 - 534.
  • 3ELHAMMER A P, POORMAN R A, BROWN E, et al. The Specificity of UDP-Gal NAc : Polypeptide N-Acetylgalactosaminyltransferase as Inferred from a Database of in Vivo Substrates and from the in Vitro Glycosylation of Proteins and Peptides [J ]. J. Biol. Chem. ,1993, 268 : 10029 - 10038.
  • 4JULENIUS K, MOLGAARD A, GUPTA R, et al. Prediction, Conservation Analysis and Structural Characterization of Mammalian Mucin-type O-glycosylation Sites[ J]. Glycobiology, 2004, 15 : 153 - 164.
  • 5LI S, LIU B, ZENG R. , et al. Predicting O-glycosylation Sites in Mammalian Proteins by Using SVMs [ J]. Computational Biology and Chemistry, 2006, 30:203 -208.
  • 6NISHIKAWA I, SAKAMOTO H, NOUNO I, et al. Prediction of the O-glycosylation Sites in Protein by Layered Neural Networks and Support Vector Machines [ J ]. Lecture Notes in Artificial Intelligence ( Springer), 2006, LNAI 4252 : 953 - 960.
  • 7CHEN Y W, YANG X, ITO M, et al. Panern Analysis and Prediction of O-linked Glycolsylated Sites in Protein by Principal Component Subspace Analysis[ J]. Lecture Notes in Artificial Intelligence ( Springer), 2007, LNAI 4693 : 326 -334.
  • 8YANG X, CHEN Y W, ITO M, et al. Principal Component Analysis of O-linked Glycosylation Sites in Protein Sequence[ J]. Lecture Notes In Artificial Intelligence, 2007:121 -126.
  • 9Protein UniProt [ EB/OL]. http://www, uniprot, org/, 2010.
  • 10BISHOP C M. Neural Network for Pattern Recognition[ M]. Oxford : Oxford University Press, 2000.

共引文献2

同被引文献5

引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部