期刊文献+

基于加强图像块相关性的细粒度图像分类方法

Fine Grained Image Classification Method Based on Enhanced Patch Correlation
下载PDF
导出
摘要 在细粒度图像分类任务中,提取出具有区分性的局部特征对识别图像之间的微小差异非常重要。基于ViT(vision transformer)框架的算法模型在计算机视觉各个研究领域取得了优异的表现。针对基于ViT框架的细粒度图像分类模型对图片局部区域关注度低的问题且为进一步加强图像块特征的上下文联系,提出了一种基于加强图像块相关性的细粒度图像分类方法。首先,提出了赋予图像块相关性权重的方法,并嵌套应用于不同层编码器中丰富不同层次特征信息,解决了ViT对图像局部特征关注不够的问题;其次,结合图像块的位置信息加强了局部特征上下文的联系,同时减少了噪声信息带来的干扰;最后,提出相似损失函数来学习细粒度图像中微小特征的差异性,优化模型的分类效果。在两个公开数据集CUB-200-2011和Standford Dogs上进行实验分别取得了91.33%、92.15%的准确率,提出的方法分别比基准模型ViT网络提升了0.63、0.45百分点,有效提升了细粒度图像分类效果,验证了方法的有效性。 In the fine-grained image classification task,it is crucial to extract distinctive local features to identify small differences between images.The algorithm model based on ViT(vision transformer)framework has achieved excellent performance in various research fields of computer vision.Aiming at the problem that the fine-grained image classification model based on ViT framework pays little attention to the local area of the picture and to further strengthen the context connection of patch features,a fine-grained image classification method based on enhancing the correlation of patch is proposed.Firstly,a method of assigning correlation weights to patches is proposed,and nested application is used in different layer encoders to enrich different layer feature information,which solves the problem that ViT does not pay enough attention to local features of images.Secondly,combining the position information of the patch,the local feature context is strengthened,and the interference caused by the noise information is reduced.Finally,the similarity loss function is proposed to learn the difference of minute features in fine-grained images and optimize the classification effect of the model.Experiments on two public data sets,CUB-200-2011 and Standford Dogs,have achieved an accuracy of 91.33%and 92.15%,respectively.The proposed method improves the benchmark model ViT network by 0.63 and 0.45 percentage points respectively,effectively improving the fine-grained image classification effect,and verifying the effectiveness of the method.
作者 王坤 朱子奇 WANG Kun;ZHU Zi-qi(School of Computer Science&Technology,Wuhan University of Science and Technology,Wuhan 430065,China)
出处 《计算机技术与发展》 2023年第5期56-61,共6页 Computer Technology and Development
基金 国家自然科学基金资助项目(61702382)。
关键词 VIT 细粒度图像分类 局部特征 相关性 图像块特征 编码器 vision transformer fine grained image classification local features correlation patch features encoder
  • 相关文献

参考文献3

二级参考文献7

共引文献179

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部