采用多部件学习的细粒度图像识别

Fine-Grained Image Recognition via Multi-Part Learning

下载PDF

导出

摘要现有基于交叉熵损失函数的细粒度识别方法倾向于发现对象最具有判别性的部件,忽略其他同样关键的具有判别性的次要部件.为了发现尽可能多的、具有辨别性的局部部件,提出采用多部件学习的细粒度图像识别方法.首先提出一个无参数的基于语义块混合的图像数据增强模块,通过交换图像对中最具有判别性的部件,在增广训练数据的同时避免引入无关背景噪声,提高网络对输入扰动灵敏度的鲁棒性和泛化能力;然后提出多部件对抗擦除模块,在注意力和伯努利分布引导下擦除特征图上最具判别性区域,迫使网络学习发现特征图上其他辨别性区域,注意力引导保证擦除区域具有足够的判别性,伯努利分布引导保证擦除区域的多样性;最后通过融合中层特征,进一步提升网络性能.所提方法具有模型无关特性,可以作为一种即插即用模块,与现有多种主干网络相结合.以ResNet-50为主干网络,在3个公开数据集CUB-200-2011,FGVC-Aircraft和Stanford Cars上的实验结果表明,所提方法的分类精度分别达到89.2%,95.5%和94.0%;该方法能够发现更多辨别性部件,且准确率优于相同主干网络下的对比方法. The existing methods mainly use attention to locate the subtle parts.However,convolutional neu-ral networks(CNNs),which employ the cross-entropy loss as the loss function,can only learn the most dis-criminative part and ignore other meaningful regions.In this paper,a novel fine-grained image recognition method via multipart learning(MPL)is presented.Firstly,a parameter-free data augmentation method named Semantic Patch Mix is proposed,which improves the networks’generalization performance on the test dis-tribution and robustness to the sensitivity to input perturbations by exchanging the most discriminative part of the image.Secondly,a parameter-free multi-part adversarial erasing module is proposed,which erases the most discriminative region under the guidance of attention and Bernoulli distribution to force the network to discover other discriminative regions of the object.The attention guidance ensures that the erased regions are sufficiently discriminative,and the Bernoulli distribution guidance ensures that the erased regions are diverse.Finally,mid-level features are incorporated to further improve performance.The proposed method is model-agnostic and thus can serve as a plug-and-play module to be applied to various backbone networks.Taking ResNet-50 as the backbone network,the classification accuracy of the proposed method on three public data sets CUB-200-2011,FGVC-Aircraft and Stanford Cars reached 89.2%,95.5%and 94.0%re-spectively.Experimental results show that the proposed method,which can discover more discriminative parts, outperforms state-of-the-art approaches.

作者蒋海浪刘建明 Jiang Hailang;Liu Jianming(College of Computer Information Engineering,Jiangxi Normal University,Nanchang 330022)

机构地区江西师范大学计算机信息工程学院

出处《计算机辅助设计与图形学学报》 EI CSCD 北大核心 2023年第7期1032-1039,共8页 Journal of Computer-Aided Design & Computer Graphics

基金国家自然科学基金(61662034,62266022) 江西省自然科学基金(20202BAB202020)。

关键词细粒度图像识别数据增强语义块混合多部件对抗擦除 fine-grained image recognition data augmentation semantic patch mix multi-part adversarial erasing

分类号 TP391.41 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献1

1郭文明,王腾亿.类激活映射指导数据增强的细粒度图像分类[J].计算机辅助设计与图形学学报,2021,33(11):1698-1704. 被引量：2

二级参考文献1

1赵毅力,徐丹.联合语义部件的鸟类图像细粒度识别[J].计算机辅助设计与图形学学报,2018,30(8):1522-1529. 被引量：7

共引文献1

1袁姮,胡月,张晟翀.反向目标干扰的图像数据增强[J].计算机系统应用,2024,33(6):48-57.

1邹绵璐,秦芮.基于特征融合的内河船舶尺度自适应跟踪[J].计算机应用文摘,2023,39(16):23-25.
2吕佳,梁浩城,王泽宇.基于图卷积的视网膜血管轮廓及高不确定度区域细化框架[J].光电子．激光,2023,34(6):654-662. 被引量：2
3黄港,郑元林,廖开阳,蔺广逢,曹从军,宋雪芳.互补注意多样性特征融合网络的细粒度分类[J].中国图象图形学报,2023,28(8):2420-2431. 被引量：2
4王长亮,黄翠萍,林红,刘宁.碳粉类墨迹中打印篡改特征的稳定性研究[J].广东公安科技,2023,31(1):12-14.
5国强,吴天昊,徐伟,CHORNOGOR Leonid.基于显著感知与一致性约束的目标跟踪算法[J].北京航空航天大学学报,2023,49(9):2244-2257. 被引量：1
6杨云凯,李炳楠,朱峰,孙立明,吴振龙,党涛涛.基于数据驱动的循环流化床机组PI控制[J].控制工程,2023,30(6):1137-1145.

计算机辅助设计与图形学学报

2023年第7期

浏览历史

内容加载中请稍等...

采用多部件学习的细粒度图像识别

参考文献1

二级参考文献1

共引文献1

相关作者

相关机构

相关主题

浏览历史