期刊文献+

基于主成分分析的朴素贝叶斯噬菌体病毒蛋白分类

Classification of Phage Virion Proteins Based on Principal Component Analysis Naive Bayes
下载PDF
导出
摘要 噬菌体病毒蛋白质分类是生物信息学热点问题之一。对朴素贝斯分类中的特征独立性假设以及病毒蛋白质特征提取问题,提出一种结合伪氨基酸组成(PAAC)和k间隔氨基酸组成(CKSAAP)的混合特征提取法,且将主成分分析朴素贝叶斯分类模型(PNBC)应用于噬菌体病毒蛋白分类问题。实证分析表明,相比于朴素贝叶斯和支持向量机模型,主成分分析朴素贝叶斯模型分类准确率达80%,效果最优。 The classification of phage virion proteins is one of the hot issues of bioinformatics.Concerning the assumption of feature independence in naive Bayes classification and the problem of viral protein feature extraction,this paper proposes a hybrid feature extraction method combining pseudo amino acid composition(PAAC)and k-spaced amino acid composition(CKSAAP)and applies the principal component analysis naive Bayes classification model(PNBC)to phage viral protein classification.The empirical analysis shows that compared with the naive Bayes classification and support vector machine models,the principal component analysis naive Bayes model has the best classification accuracy of 80%.
作者 徐思蓉 叶仁玉 冷婷 XU Sirong;YE Renyu;LENG Ting(School of Mathematics and Science,Anqing Normal University,Anqing 246133,China)
出处 《皖西学院学报》 2024年第2期44-48,共5页 Journal of West Anhui University
基金 安徽高校自然科学研究重点项目(KJ2019A0557) 安徽省研究生创新创业实践项目(2022cxcysj166) 安庆师范大学院级研究生学术创新项目(2021yjsXSCX041)。
关键词 主成分分析 朴素贝叶斯 噬菌体 蛋白质分类 Principal Component Analysis naive Bayes phage protein classification
  • 相关文献

参考文献5

二级参考文献76

  • 1Oliver K B, Russel J D, Anthony L C. New insights into viral structure and virus-cell interactions through proteomics. Expert Rev Proteomics, 2005, 2:577--588.
  • 2Bortz E, Whitelegge J P, Jia Q, et al. Identification of proteins associated with murine gammaherpesvirus 68 virions. J Virol, 2003, 77: 13425--13432.
  • 3Ying W T, Zhang Y J, Peng W M, et al. Proteomic analysis on structural proteins of severe acute respiratory syndrome coronavirus. Proteomics, 2004, 4:492--504.
  • 4Savalia D, Westblade L F, Goel M, et al. Genomic and proteomic Analysis of phiEco32, a novel Esacherichia coli bacteriophage. J Mol Biol, 2008, 377:774--789.
  • 5Robert M D, Martin N L, Kropinski A M. The genome and proteome of coliphage T1. Virology, 2004, 318:245--266.
  • 6Naryshkina T, Liu J, Florens L, et al. Thermus therrnophilus bacteriophage phiYS40 genome and proteomic characterization of virions. J Mol Biol, 2006, 364:667--677.
  • 7Beijerinck M J. Concerning a contagium vivum fluidum as cause of the spot disease of tobacco leaves. Verhandelingen der Kon/nkyke akademie Wettenschapppen te Amsterdam, 1898, 65:3--21.
  • 8Fields S, Song O. A novel genetic system to detect protein-protein interactions. Nature, 1989, 340:245--246.
  • 9Fields S. Interactive learning: Lessons from two hybrids over two decades. Proteomics, 2009, 9:5209 5213.
  • 10Mendez-Rios J, Uetz P. Global approaches to study protein-protein interactions among viruses and hosts. Future Microbiol, 2010, 5:289--301.

共引文献19

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部