期刊文献+

基于多层聚焦Inception-V3卷积网络的细粒度图像分类 被引量:4

Multi-Layer Focused Inception-V3 Models for Fine-Grained Visual Recognition
下载PDF
导出
摘要 细粒度图片具有结构多变、背景干扰大、类间差异小、类内差异大等特点,准确地定位与提取判别性局部特征至关重要.本文提出一种多层聚焦卷积网络,通过首层聚焦网络能够准确、有效地聚焦于识别局域并生成定位区域,根据定位区域对原图像分别进行裁剪和遮挡后输入下一层的聚焦网络进行训练分类.其中单层聚焦网络以In⁃ception-V3网络为基础,通过卷积块特征注意力模块和定位区域选择机制来聚焦有效的定位区域;使用双线性注意力最大池化提取各个局部的特征;最后进行分类预测.本文在3个常用的细粒度数据集CUB-2011、FGVC-Aircraft以及Stanford Cars上进行了实验验证,分别获得了89.7%、93.6%和95.1%的Top-1准确率.实验结果表明,本模型的分类准确率高于目前主流方法. Fine-grained pictures are characterized by variable structure,large background interference,small interclass difference and large intra-class difference,so accurate positioning and extraction of discriminant local features are cru⁃cial.In this paper,a multi-layer focused convolution network is proposed,which can accurately and effectively focus on identifying local areas and generating locating regions through the first-layer focused network.According to the positioning area,the image is cropped and dropped,and then the focus network of the next layer is input for training and classification.The single-layer focused network is based on the Inception-V3 network and focuses the effective location area through the convolutional block feature attention module,and location area selection mechanism.Bilinear attention maximum pooling was used to extract the features of each part.Classification prediction is made.Experimental verification was carried out on three commonly used fine-grained data sets CUB-2011,Fgvc-Aircraft and Stanford Cars the accuracy of top-1 was obtained at 89.7%,93.6%and 95.1%,respectively.Experimental results show that the classification accuracy of this model is higher than that of the current mainstream methods.
作者 王波 黄冕 刘利军 黄青松 单文琦 WANG Bo;HUANG Mian;LIU Li-jun;HUANG Qing-song;SHAN Wen-qi(Faculty of Information Engineering and Automation,Kunming University of Science and Technology,Kunming,Yunnan 650500,China;Information Center,Yunnan Land and Resources Vocational College,Kunming,Yunnan 652501,China;School of Information,Yunnan University,Kunming,Yunnan 650091,China;Yunnan Key Laboratory of Computer Technology Applications,Kunming,Yunnan 650500,China)
出处 《电子学报》 EI CAS CSCD 北大核心 2022年第1期72-78,共7页 Acta Electronica Sinica
基金 国家自然科学基金(No.81860318,No.81560296) 云南省计算机技术应用重点实验室开放基金(No.2020106)。
关键词 多层聚焦卷积网络 Inception-V3网络 注意力机制 双线性注意力最大池化 multilayer focused convolution network inception-V3 attention mechanism bilinear attention maxi⁃mum pooling
  • 相关文献

参考文献4

二级参考文献10

共引文献72

同被引文献39

引证文献4

二级引证文献5

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部