期刊文献+

基于CSPPNet与集成学习的人类蛋白质图像分类方法 被引量:3

Classification Method for Human Protein Images Based on CSPPNet and Ensemble Learning
下载PDF
导出
摘要 人类蛋白图像分类的目的是识别蛋白质细胞器中的细胞核浆、核膜等定位标签。针对蛋白质分类数据集大、多标签类别不平衡以及类间差异小等问题,结合CSPPNet与集成学习,提出一种人类蛋白质图像分类方法。该方法构建了粗细结合的CSPPNet模型,且将该模型前几层卷积生成的特征图加入空间金字塔池化层,并与模型后期卷积生成的特征图相结合,同时利用图片的整体特征和局部特征自动检测图片差异,以提高细粒度图像分类问题的精度,再通过集成学习的方法来进一步提升准确率。实验结果表明,相比经典卷积神经网络(CNN),该模型的精度与F1值均有所提升。 The purpose of classification of human protein images is to identify the localization labels such as nucleus plasma and nuclear membrane in protein organelles.To address the large scale of protein classification data sets,imbalance of multi-label categories and small differences between classes,combining CSPPNet and ensemble learning,this paper proposes a classification method for human protein images.This method constructs a CSPPNet model that combines coarse-grained identification and fine-grained identification.The feature maps generated by the first few layers of the model are added to the spatial pyramid pooling layer,and combined with the feature map generated by the later convolution of the model.The overall features and local features are used to automatically detect differences in pictures to improve the precision of fine-grained image classification,and then ensemble learning is used to further improve accuracy.The experimental results show that the accuracy and F1 value of the model are improved compared with the classic convolutional neural network(CNN).
作者 李培媛 黄迟 LI Peiyuan;HUANG Chi(College of Mathematics,Taiyuan University of Technology,Taiyuan 030024,China;School of Information and Engineering,Southwestern University of Finance and Economics,Chengdu 611130,China)
出处 《计算机工程》 CAS CSCD 北大核心 2020年第8期235-242,共8页 Computer Engineering
基金 国家自然科学基金(61603268)。
关键词 蛋白质 亚细胞定位 图像分类 空间金字塔池化 细粒度识别 集成学习 protein subcellular localization image classification Spatial Pyramid Pooling(SPP) fine-grained identification ensemble learning
  • 相关文献

参考文献5

二级参考文献60

  • 1王煜,白石,王正欧.用于Web文本分类的快速KNN算法[J].情报学报,2007,26(1):60-64. 被引量:33
  • 2[1]PUSTEJOVSKY J,CASTANO,ZHANG J.Robust relational parsing over biomedical literature:extracting inhibit relations[C]// Proceedings of the Seventh Pacific Symposium on Bio-Computing.[S.l.],2002:362-373.
  • 3[2]LEROY G,CHEN H,MARTINEZ J D.A shallow parser based on closed-class words to capture relations in biomedical text[J].Journal of Biomedical Informatics,2003,36(3):145-158.
  • 4[3]PARK J C,KIM H S,KIM J J.Bidirectional incremental parsing for automatic pathway identification with combinatory categorical grammar[C]// Proceedings of the Pacific Symposium on Bio-Computing.Hawaii,USA,2001:396-407.
  • 5[4]TEMKIN J M,GILDER M R.Extraction of protein interaction information from unstructured text using a context-free grammar[J].Bioinformatics,2003,19:2046-2053.
  • 6[5]AHMED S T,CHINDAMBARAM D,DAVULCU H,et al.IntEx:a syntactic role driven protein-protein interaction extractor for bio-medical text[C]// Proceeding of the ACL-ISMB Workshop on Linking Biological Literature,Ontologies and Databases:Mining Biological Semantics.Detroit,Michigan,USA,2005:54-61.
  • 7[6]ONO T,HISHIGAKI H,TANIGAMIi A,et al.Automatic extraction of information on protein-protein interactions from the biological literature[J].Bioinformatics,2001,17 (2):155-161.
  • 8[7]HUANG M L,ZHU X Y,HAO Y,et al.Discovering patterns to extract protein-protein interactions from full texts[J].Bioinformatics,2004,20 (18):3604-3612.
  • 9[8]DAVID C,BEMARD B,WILLIAM L,et al.BioRAT:extracting biological information from full-length papers[J].Bioinformatics,2004,20(17):3206-3213.
  • 10[9]ANDRADE M A,VALENICA A.Automatic extraction of keywords from scientific text:application to the knowledge domain of protein families[J].Bioinformatic,1998,14(7):600-607.

共引文献69

同被引文献16

引证文献3

二级引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部