基于多尺度注意力的鸟类图像识别

Bird Image Recognition Based on Multiscale Attention

下载PDF

导出

摘要鸟类图像不同子类别外观相似,而同类别目标因复杂的背景、姿态等呈现较大的类内差异。针对这个问题,提出了基于多尺度注意力的卷积神经网络模型。模型通过无参数学习的目标模块和部件模块使注意力由全局图像逐渐聚焦到目标和部件图像,形成了能输入多尺度图像的三分支网络模型。此外,引入排序损失以减少背景的干扰。在CUB-200-2011和NABirds数据集上,模型的识别精度分别为87.21%和85.96%,与基线模型相比,识别精度得到有效提高,验证了模型的有效性。 Different sub-categories of bird images have similar appearances,while objects of the same category show large in-tra-class variances due to complex backgrounds and pose.To solve this problem,a convolutional neural network model based on multi-scale attention is proposed.The model gradually focuses on the attention from the global image to the target and component im-ages through the target module and component module of parameter-free learning and forms a three-branch network model that can input multi-scale images.Furthermore,an ordering loss is introduced to reduce background interference.On the CUB-200-2011 and NABirds datasets,the recognition accuracy of the model is 87.21%and 85.96%,respectively.Compared with the baseline mod-el,the recognition accuracy is effectively improved,which verifies the effectiveness of the model.

作者阮涛郝智程 RUAN Tao;HAO Zhicheng(Institute of Applied Mathematics,Beijing Information Science&Technology University,Beijing 100010)

机构地区北京信息科技大学应用数学研究所

出处《计算机与数字工程》 2024年第10期3148-3152,3171,共6页 Computer & Digital Engineering

关键词鸟类图像识别多尺度注意力排序损失卷积神经网络 bird image recognition multiscale attention rank loss convolutional neural networks

分类号 TP751 [自动化与计算机技术—检测技术与自动化装置]

引文网络
相关文献

1祁伟,朱丽媛,黄鑫,辛文豪.血清SCUBE1及Sestrin2与急性ST段抬高型心肌梗死患者PCI术后微血管阻塞的关系[J].国际医药卫生导报,2024,30(24):4160-4165.
2姜苏城,王红林.结合门控机制与多尺度ViT的细粒度图像分类[J].计算机仿真,2024,41(9):139-145.
3高欣宇,杜方,宋丽娟.基于扩散模型的文本图像生成对比研究综述[J].计算机工程与应用,2024,60(24):44-64.
4支凯茹,张凯,门昌骞,王文剑.融合随机傅里叶特征的混合神经网络模型[J].小型微型计算机系统,2024,45(12):2875-2881.
5王雪松,吕理想,程玉虎,王浩宇.注意力集合表示的多尺度度量小样本图像分类[J].中国图象图形学报,2024,29(11):3371-3382.
6庹满先,杨江河,张月莲,汪胜辉,聂建军,樊军辉.耀变体的逆康普顿峰值频率与伽马光子谱指数的关系[J].天文学报,2024,65(6):32-39.
7张伯韬,仉率杰,孙爽爽,袁莹,胡锡峰,贾晓峰,于媛媛,薛付忠.基于贝叶斯网络的缺血性脑卒中筛查模型构建[J].山东大学学报（医学版）,2024,62(11):73-84.

计算机与数字工程

2024年第10期

浏览历史

内容加载中请稍等...

基于多尺度注意力的鸟类图像识别

相关作者

相关机构

相关主题

浏览历史