面向多标签小样本学习的双流重构网络

Dual-stream Reconstruction Network for Multi-label and Few-shot Learning

下载PDF

导出

摘要多标签图像分类问题是计算机视觉领域的重要问题之一,它需要对图像中的所有标签进行预测。而一幅图像中待分类的标签个数往往不止一个,同时图像中对象的大小、位置和姿态的变化都会对模型的分类性能产生影响。因此,如何有效地提高图像特征的准确表达能力是一个亟需解决的难题。针对上述难题,文中提出了一个新颖的双流重构网络来对图像进行特征抽取。具体而言,该模型首先应用一个双流注意力网络来对图像进行基于通道信息和空间信息的特征提取,并经过特征拼接使得图像特征同时兼顾通道特征细节信息和空间特征细节信息。其次,该模型引入了重构损失函数,对双流网络进行特征约束,迫使上述两种分歧特征具有相同的特征表达能力,以此促使提取的双流特征共同向真值特征迫近。在基于VOC 2007和MS COCO多标签图像数据集上的实验结果表明,所提出的双流重构网络能够准确有效地提取出显著特征,并产生更好的分类精度。同时,鉴于重建损失对模型的解拟合作用,将该方法应用在小样本场景上,实验结果显示,所提模型对小样本数据同样具有较好的分类精度。 The multi-label image classification problem is one of the most important problems in the field of computer vision,which needs to predict and output all the labels in an image.However,the number of labels to be classified in an image is often more than one,and the changeable size,posture,and position of objects in the image will increase the difficulty of classification.Therefore,how to effectively improve the accurate expression ability of image features is an urgent problem to be solved.In response to the above-mentioned problem,a novel dual-stream reconstruction network is proposed to extract features from images.Specifically,the model first proposes a dual-stream attention network to extract features based on channel information and spatial information,and uses feature stitching to make image features have both channel detail information and spatial detail information.Secondly,a reconstruction loss function is introduced to constrain the features of the dual-stream network,forcing the above two divergent features to have the same feature expression ability,thereby promoting the extracted dual-stream features to approach the ground-truth features.Experimental results on multi-label image datasets based on VOC 2007and MS COCO show that the proposed dual-stream reconstruction network can accurately and effectively extract salient features and produce better classification accuracy.At the same time,in view of the sparse effect of reconstruction loss on model features,the proposed method is also applied to few-shot learning.The experimental results show that the proposed model also has good classification accuracy for fewshot learning.

作者方仲礼王喆迟子秋 FANG Zhong-li;WANG Zhe;CHI Zi-qiu(School of Information Science and Engineering,East China University of Science and Technology,Shanghai 200237,China)

机构地区华东理工大学信息科学与工程学院

出处《计算机科学》 CSCD 北大核心 2022年第1期212-218,共7页 Computer Science

基金上海市科技计划项目(20511100600) 国家自然科学基金(62076094)。

关键词多标签图像识别特征重构深度学习小样本学习图像注意力机制 Multi-label image recognition Feature reconstruction Deep learning Few-shot learning Image attention mechanism

分类号 TP183 [自动化与计算机技术—控制理论与控制工程]

引文网络
相关文献

参考文献1

1陈天荣,凌捷.基于特征映射的差分隐私保护机器学习方法[J].计算机科学,2021,48(7):33-39. 被引量：2

共引文献1

1周思明,李丹.基于机器学习在公共云中的攻击和防御[J].网络安全技术与应用,2022(1):70-72.

1李波,饶浩波.复杂场景下特征增强的显著性目标检测方法[J].华南理工大学学报（自然科学版）,2021,49(11):135-144. 被引量：1
2王元东,杜宇人.基于改进F3Net网络的显著性目标检测[J].扬州大学学报（自然科学版）,2021,24(5):65-70.
3梁华刚,雷毅雄.增强可分离卷积通道特征的表情识别研究[J].计算机工程与应用,2022,58(2):184-192. 被引量：10
4孙琦,曹蔚,罗建红.含GluN3亚基的N-甲基-D-天冬氨酸受体及其在中枢神经系统的功能[J].浙江大学学报（医学版）,2021,50(5):651-658. 被引量：1
5王光宇,赵曙光,张笑青,郭力争.基于改进DeepLabV3+的COVID-19肺部CT图像语义分割方法[J].计算机科学与应用,2021,11(12):3156-3162.
6陈巧红,孙佳锦,孙麒,贾宇波.基于多层跨模态注意力融合的图文情感分析[J].浙江理工大学学报（自然科学版）,2022,47(1):85-94. 被引量：3
7李余康,翟长远,王秀,袁洪波,张玮,赵春江.基于DeepLab v3+的葡萄叶片分割算法[J].农机化研究,2022,44(2):149-155. 被引量：10
8郭卫涛,帕孜来·马合木提.一种融合上下文多光谱空间通道特征的左心室分割算法研究[J].光电子．激光,2021,32(11):1155-1163. 被引量：1

计算机科学

2022年第1期

浏览历史

内容加载中请稍等...

面向多标签小样本学习的双流重构网络

参考文献1

共引文献1

相关作者

相关机构

相关主题

浏览历史