Attention Guided Food Recognition via Multi-Stage Local Feature Fusion

下载PDF

导出

摘要 The task of food image recognition,a nuanced subset of fine-grained image recognition,grapples with substantial intra-class variation and minimal inter-class differences.These challenges are compounded by the irregular and multi-scale nature of food images.Addressing these complexities,our study introduces an advanced model that leverages multiple attention mechanisms and multi-stage local fusion,grounded in the ConvNeXt architecture.Our model employs hybrid attention(HA)mechanisms to pinpoint critical discriminative regions within images,substantially mitigating the influence of background noise.Furthermore,it introduces a multi-stage local fusion(MSLF)module,fostering long-distance dependencies between feature maps at varying stages.This approach facilitates the assimilation of complementary features across scales,significantly bolstering the model’s capacity for feature extraction.Furthermore,we constructed a dataset named Roushi60,which consists of 60 different categories of common meat dishes.Empirical evaluation of the ETH Food-101,ChineseFoodNet,and Roushi60 datasets reveals that our model achieves recognition accuracies of 91.12%,82.86%,and 92.50%,respectively.These figures not only mark an improvement of 1.04%,3.42%,and 1.36%over the foundational ConvNeXt network but also surpass the performance of most contemporary food image recognition methods.Such advancements underscore the efficacy of our proposed model in navigating the intricate landscape of food image recognition,setting a new benchmark for the field.

作者 Gonghui Deng Dunzhi Wu Weizhen Chen

机构地区 School of Electrical and Electronic Engineering

出处《Computers, Materials & Continua》 SCIE EI 2024年第8期1985-2003,共19页 计算机、材料和连续体（英文）

基金 The support of this research was by Hubei Provincial Natural Science Foundation(2022CFB449) Science Research Foundation of Education Department of Hubei Province(B2020061),are gratefully acknowledged.

关键词 Fine-grained image recognition food image recognition attention mechanism local feature fusion

分类号 TP391.41 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献1

1刘宇昕,闵巍庆,蒋树强,芮勇.多尺度拼图重构网络的食品图像识别[J].软件学报,2022,33(11):4379-4395. 被引量：3

二级参考文献1

1梁华刚,温晓倩,梁丹丹,李怀德,茹锋.多级卷积特征金字塔的细粒度食物图片识别[J].中国图象图形学报,2019,24(6):870-881. 被引量：7

共引文献2

1张志凯,韩红章,赵雪芊,李忠.基于改进YOLOv3模型的软包装食品自动识别方法[J].食品与机械,2023,39(5):95-100. 被引量：4
2朱建学.基于图像识别的输煤皮带纠偏方法[J].信息与电脑,2023,35(13):182-184.

1Yaming Kang,PeishunYe,Yuxiu Bai,Shi Qiu.Hyperspectral Image Based Interpretable Feature Clustering Algorithm[J].Computers, Materials & Continua,2024,79(5):2151-2168.
2Lifu Zhang,Liaoran Gao,Changping Huang,Nan Wang,Sa Wang,Mingyuan Peng,Xia Zhang,Qingxi Tong.Crop classification based on the spectrotemporal signature derived from vegetation indices and accumulated temperature[J].International Journal of Digital Earth,2022,15(1):626-652. 被引量：1
3Peishu Wu,Han Li,Liwei Hu,Jirong Ge,Nianyin Zeng.A Local-Global Attention Fusion Framework With Tensor Decomposition for Medical Diagnosis[J].IEEE/CAA Journal of Automatica Sinica,2024,11(6):1536-1538.
4Xiang LUO,Chang LIU,Gaopeng GOU,Gang XIONG,Zhen LI,Binxing FANG.Identifying malicious traffic under concept drift based on intraclass consistency enhanced variational autoencoder[J].Science China(Information Sciences),2024,67(8):234-248.
5Zewen Zhang,Sheng Zhou,Chunzheng Cao.Curve Classification Based onMean-Variance Feature Weighting and Its Application[J].Computers, Materials & Continua,2024,79(5):2465-2480.

Computers, Materials & Continua

2024年第8期

浏览历史

内容加载中请稍等...

Attention Guided Food Recognition via Multi-Stage Local Feature Fusion

参考文献1

二级参考文献1

共引文献2

相关作者

相关机构

相关主题

浏览历史