期刊文献+

基于注意力自身线性融合的弱监督细粒度图像分类算法 被引量:5

Weakly supervised fine-grained image classification algorithm based on attention-attention bilinear pooling
下载PDF
导出
摘要 随着人工智能的飞速发展,计算机视觉领域对图像的分类任务不仅仅限于识别出物体的大类,更需要对同一类别的图像进行更加细致的子类划分。为了有效区分出类间的微小差异以及减少背景因素的干扰,提出了一种基于AABP的细粒度分类算法。首先,通过Inception V3预训练模型提取全局图像特征,并利用深度可分离卷积在特征映射上预测出局部注意力区域;然后,应用弱监督数据增强网络(WS-DAN)的算法将增强后的图像反馈回网络中,以此加强网络的泛化能力,防止过拟合;最后,将进一步提取的注意力特征区域在AABP网络中进行线性融合,以提升分类的精度。实验结果表明,该算法在数据集CUB-200-2011上达到88.51%的准确率、97.65%的top5准确率,在Stanford Cars数据集上到89.77%的准确率、99.27%的top5准确率,在FGVC-Aircraft数据集上到93.5%的准确率、97.96%的top5准确率。 With the rapid development of artificial intelligence,the purpose of image classification is not only to identify the major categories of objects,but also to classify the images of the same category into more detailed subcategories.In order to effectively discriminate small differences between categories,a fine-grained classification algorithm was proposed based on Attention-Attention Bilinear Pooling(AABP).Firstly,the Inception V3 pre-training model was applied to extract the global image features,and the local attention region on the feature mapping was forecasted with the deep separable convolution.Then,the Weakly Supervised Data Augmentation Network(WS-DAN)was applied to feed the augmented image back into the network,so as to enhance the generalization ability of the network to prevent overfitting.Finally,the linear fusion of the further extracted attention features was performed in AABP network to improve the accuracy of the classification.Experimental results show that this method achieves accuracy of 88.51%and top5 accuracy of 97.65%on CUB-200-2011 dataset,accuracy of 89.77%and top5 accuracy of 99.27%on Stanford Cars dataset,and accuracy of 93.5%and top5 accuracy of 97.96%on FGVC-Aircraft dataset.
作者 陆鑫伟 余鹏飞 李海燕 李红松 丁文谦 LU Xinwei;YU Pengfei;LI Haiyan;LI Hongsong;DING Wenqian(School of Information Science and Engineering,Yunnan University,Kunming Yunnan 650500,China)
出处 《计算机应用》 CSCD 北大核心 2021年第5期1319-1325,共7页 journal of Computer Applications
基金 国家自然科学基金资助项目(62066046)。
关键词 细粒度分类 线性融合 弱监督 数据增强 深度可分离卷积 fine-grained classification linear fusion weakly supervised data augmentation deep separable convolution
  • 相关文献

参考文献1

二级参考文献5

共引文献138

同被引文献56

引证文献5

二级引证文献11

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部