非局部注意力双分支网络的跨模态赤足足迹检索被引量：1

Non-local attention dual-branch network based cross-modal barefoot footprint retrieval

导出

摘要目的针对目前足迹检索中存在的采集设备种类多样化、有效的足迹特征难以提取等问题,本文以赤足足迹图像为研究对象,提出一种基于非局部(non-local)注意力双分支网络的跨模态赤足足迹检索算法。方法该网络由特征提取、特征嵌入以及双约束损失模块构成,其中特征提取模块采用双分支结构,各分支均以Res Net50作为基础网络分别提取光学和压力赤足图像的有效特征;同时在特征嵌入模块中通过参数共享学习一个多模态的共享空间,并引入非局部注意力机制快速捕获长范围依赖,获得更大感受野,专注足迹图像整体压力分布,在增强每个模态有用特征的同时突出了跨模态之间的共性特征;为了增大赤足足迹图像类间特征差异和减小类内特征差异,利用交叉熵损失LCE(cross-entropy loss)和三元组损失LTRI(triplet loss)对整个网络进行约束,以更好地学习跨模态共享特征,减小模态间的差异。结果本文将采集的138人的光学赤足图像和压力赤足图像作为实验数据集,并将本文算法与细粒度跨模态检索方法FGC(fine-grained cross-model)和跨模态行人重识别方法HC(hetero-center)进行了对比实验,本文算法在光学到压力检索模式下的m AP(mean average precision)值和rank1值分别为83.63%和98.29%,在压力到光学检索模式下的m AP值和rank1值分别为84.27%和94.71%,两种检索模式下的m AP均值和rank1均值分别为83.95%和96.5%,相较于FGC分别提高了40.01%和36.50%,相较于HC分别提高了26.07%和19.32%。同时本文算法在non-local注意力机制、损失函数、特征嵌入模块后采用的池化方式等方面进行了对比分析,其结果证实了本文算法的有效性。结论本文提出的跨模态赤足足迹检索算法取得了较高的精度,为现场足迹比对、鉴定等应用提供了研究基础。 ObjectiveFootprints are the highest rate of material evidence left and extracted from crime scene in general.Footprint retrieval and comparison plays an important role in criminal investigation.Footprint features are identified via the foot shape and bone structure of the person involved and have its features of specificity and stability.Meanwhile,footprints can reveal their essential behavior in the context of the physiological and behavioral characteristics.It is related to the biological features like height,body shape,gender,age and walking habits.Medical research results illustrates that footprint pressure information of each person is unique.It is challenged to improve the rate of discovery,extraction and utilization of footprints in criminal investigation.The retrieval of footprint image is of great significance,which will provide theoretical basis and technical support for footprint comparison and identification.Footprint images have different modes due to the diverse scenarios and tools of extraction.The global information of cross-modal barefoot images is unique,which can realize retrieval-oriented.The retrieval orientation retrieves the corresponding image of cross-modes.The traditional cross-modal retrieval methods are mainly in the context of subspace method and objective model method.These retrieval methods are difficult to obtain distinguishable features.The deep learning based retrieval methods construct multi-modal public space via convolutional neural network(CNN).The high-level semantic features of image can be captured in terms of iterative optimization of network parameters,to lower the multi-modal heterogeneity.MethodA cross-modal barefoot footprint retrieval algorithm based on non-local attention two-branch network is demonstrated to resolve the issue of intra-class wide distance and inter-class narrow distance in fine-grained images.The collected barefoot footprint images involve optical mode and pressure mode.The median filter is applied to remove noises for all images,and the data augmentation method is used to expand the footprint images of each mode.In the feature extraction module,the pre-trained Res Net50 is used as basic network to extract the inherent features of each mode.In the feature embedding module,parameter sharing is realized by splicing feature vectors,and a multi-modal sharing space is constructed.All the residual blocks in the Layer2 and Layer3 of the Res Net50 use a non-local attention mechanism to capture long-range dependence,obtain a large receptive field,and highlight common features quickly.Simultaneously,cross-entropy loss and triplet loss are used to better learn multi-modal sharing space in order to reduce intra-class differences and increase inter-class differences of features.Our research tool is equipped with two NVIDIA 2070TI graphics CARDS,and the network is built in Py Torch.The size of the barefoot footprint images is 224×224 pixels.The stochastic gradient descent(SGD)optimizer is used for training.The number of iterations is 81,and the initial learning rate is 0.01.The trained network is validated by using the validation set,and the mean average precision(mAP)and rank values are obtained.In addition,the optimal model is saved in accordance with the highest rank1 value.The backup model is based on the test set,and the data of the final experimental results are recorded and saved.ResultA cross-modal retrieval dataset is collected and constructed through a 138 person sample.Our comparative experiments are carried out to verify the effect of non-local attention mechanism in related to the retrieval efficiency,multiple loss functions and different pooling methods based on feature embedding modules.Our illustrated algorithm is compared to fine-grained cross-modal retrieval derived fine-grained cross-model(FGC)method and the RGB-infrared crossmodal person re-identification based hetero-center(HC)method.The number of people in the training set,verification set and test set is 82,28 and 28,respectively,including 16400 images,5600 images and 5600 images each.The ratio of query images and retrieval images in the verification set and test set is 1∶2.The evaluation indexes of the experiment are m AP mean(mAP_Avg)and rank1 mean(rank1_Avg)of two retrieval modes.Our analysis demonstrates that the algorithm illustrated has a higher precision,and the m AP_Avg and rank1_Avg are 83.95%and 96.5%,respectively.Compared with FGC and HC,the evaluation indexes of the proposed algorithm is 40.01%and 36.50%(higher than FGC),and 26.07%and 19.32%(higher than HC).ConclusionA cross-modal barefoot footprint retrieval algorithm is facilitated based on a non-local attention dual-branch network through the integration of non-local attention mechanism and double constraint loss.Our algorithm considers the uniqueness and correlation of in-modal and inter-modal features,and improves the performance of cross-modal barefoot footprint retrieval further,which can provide theoretical basis and technical support for footprint comparison and identification.

作者鲍文霞茅丽丽王年唐俊杨先军张艳 Bao Wenxia;Mao Lili;Wang Nian;Tang Jun;Yang Xianjun;Zhang Yan(College of Electronic Information Engineering,Anhui University,Hefei 230601,China;Hefei Institutes of Physical Science,Chinese Academy of Sciences,Hefei 230031,China)

机构地区安徽大学电子信息工程学院中国科学院合肥物质科学研究院

出处《中国图象图形学报》 CSCD 北大核心 2022年第7期2199-2213,共15页 Journal of Image and Graphics

基金国家重点研发计划资助(2020YFF0303803) 国家自然科学基金项目(61772032) 安徽高校自然科学研究重点项目(KJ2021ZD0004,KJ2019A0027)。

关键词图像检索跨模态足迹检索非局部注意力机制双分支网络赤足足迹图像 image retrieval cross-modal footprint retrieval non-local attention mechanism two-branch network barefoot footprint image

分类号 TP183 [自动化与计算机技术—控制理论与控制工程]

引文网络
相关文献

参考文献5

1鲍文霞,王云飞,王年,唐俊.基于度量学习核函数的光学足迹图像识别算法[J].华中科技大学学报（自然科学版）,2020,48(11):11-16. 被引量：8
2鲍文霞,瞿金杰,王年,唐俊,鲁玺龙.基于空间聚合加权卷积神经网络的力触觉足迹识别[J].东南大学学报（自然科学版）,2020,50(5):959-964. 被引量：6
3薛亚龙,岳佳.论案犯心态对足迹反映的影响[J].福建警察学院学报,2012,26(4):55-60. 被引量：3
4赵博文,张力夫,潘在峰,王蓉,郭雅馨.基于OpenCV的图像滤波方法比较[J].信息与电脑,2020,32(15):78-80. 被引量：20
5朱明,汪桐生,王年,唐俊,鲁玺龙.基于多尺度自注意卷积的足迹压力图像检索算法[J].模式识别与人工智能,2020,33(12):1097-1103. 被引量：6

二级参考文献15

1高浩军,杜宇人.中值滤波在图像处理中的应用[J].电子工程师,2004,30(8):35-36. 被引量：67
2张旭明,徐滨士,董世运.用于图像处理的自适应中值滤波[J].计算机辅助设计与图形学学报,2005,17(2):295-299. 被引量：159
3史力民.足迹学[M].北京:中国人民公安大学出版社,2007:7.
4詹姆斯·马吉尔.解读心理学与犯罪[M].张广宇,译.北京:中国人民公安大学出版社,2009.
5公安部政治部.足迹学[M].北京:中国人民公安大学出版社,2007.
6丁生荣,马苗.基于直方图信息灰色关联的图像噪声类型识别方法[J].陕西师范大学学报（自然科学版）,2011,39(1):18-22. 被引量：6
7史力民,李硕,赵悦岑.基于深度学习的赤足迹性别自动分析研究[J].中国刑警学院学报,2018(3):97-99. 被引量：7
8陈扬,曾诚,程成,邹恩岑,顾建伟,陆悠,奚雪峰.一种基于CNN的足迹图像检索与匹配方法[J].南京师范大学学报（工程技术版）,2018,18(3):39-45. 被引量：7
9邹永宁,姚功杰.自适应窗口形状的中值滤波[J].光学精密工程,2018,26(12):3028-3039. 被引量：26
10李健,丁小奇,陈光,孙旸,姜楠.基于改进高斯滤波算法的叶片图像去噪方法[J].南方农业学报,2019,50(6):1385-1391. 被引量：44

共引文献35

1郑雅琳,张奕玮,肖艳珍,宦智杰,马玮城.基于OpenCV的线束序列检测系统研究[J].产业科技创新,2020(21):30-31.
2薛亚龙,陆希娟.侦查错误中致错要素的结构化研究[J].福建警察学院学报,2014,28(2):1-8.
3刘以强,曾实现,刘树涛,徐宇.基于深度学习的水果识别称重贴签封口一体机[J].科学大众（科技创新）,2021(2):52-53.
4王超,肖拾花,满月娥,刘乐祥,陈勇,万福玺.机器视觉在航空发动机外观缺陷检测中的应用[J].航空计算技术,2021,51(3):82-85. 被引量：3
5吴皓,王钰淏,田国会,路飞.考虑难例挖掘和整体特征分布的损失函数设计[J].华中科技大学学报（自然科学版）,2021,49(6):37-42. 被引量：2
6鲍文霞,茅丽丽,王年,杨先军,刘晋,瞿金杰.基于注意力双分支网络的跨模态足迹检索[J].东南大学学报（自然科学版）,2021,51(5):914-922. 被引量：5
7刘燕,张国平,董谱,杨晓霞.基于OpenCV和Dlib的交互式信息展示相册系统[J].信息技术,2021,45(10):31-37. 被引量：3
8王新年,于丹,张涛.穿鞋足迹序列的足迹能量图组表达与识别[J].中国图象图形学报,2021,26(10):2357-2375. 被引量：1
9宦娟,李明宝,徐宪根,曾一鸣,史兵,张勤兰.基于无人机图像的中华绒螯蟹质量估算研究[J].海洋渔业,2021,43(6):740-750. 被引量：1
10李浩,庞爱民,黄攀,陈家浩,张熙.基于机器视觉的动铁装配检测研究[J].中州大学学报,2021,38(6):122-128.

同被引文献15

1雷航,童莉,平西建.平面赤足迹特征分析与身份识别方法[J].计算机辅助设计与图形学学报,2008,20(5):659-664. 被引量：11
2梁栋,高玮玮,张艳,鲍文霞.基于足底压力图像的静态触觉步态识别[J].华中科技大学学报（自然科学版）,2013,41(10):25-29. 被引量：17
3汪飞跃,姚志明,许胜强,魏凯,杨先军.基于柔性力敏传感器的左右脚动态识别方法[J].传感技术学报,2015,28(7):964-971. 被引量：7
4丁汉,唐云祁,郭威.自然行走状态下的足底压力稳定性研究[J].计算机技术与发展,2017,27(4):153-156. 被引量：9
5郑远攀,李广阳,李晔.深度学习在图像识别中的应用研究综述[J].计算机工程与应用,2019,55(12):20-36. 被引量：386
6王颢.深度学习在图像识别中的研究与应用[J].科技视界,2020(24):37-38. 被引量：14
7鲍文霞,瞿金杰,王年,唐俊,鲁玺龙.基于空间聚合加权卷积神经网络的力触觉足迹识别[J].东南大学学报（自然科学版）,2020,50(5):959-964. 被引量：6
8鲍文霞,王云飞,王年,唐俊.基于度量学习核函数的光学足迹图像识别算法[J].华中科技大学学报（自然科学版）,2020,48(11):11-16. 被引量：8
9朱明,汪桐生,王年,唐俊,鲁玺龙.基于多尺度自注意卷积的足迹压力图像检索算法[J].模式识别与人工智能,2020,33(12):1097-1103. 被引量：6
10王鹏鹏,吴洛天,汪曙光,张艳,鲁玺龙.基于关系网络的赤足足迹识别[J].传感器与微系统,2021,40(4):126-130. 被引量：1

引证文献1

1王昆,郭威,王尊严,韩文强.赤足足迹识别研究综述[J].计算机科学与探索,2024,18(1):44-57.

1陈扬,曾诚,程成,邹恩岑,顾建伟,陆悠,奚雪峰.一种基于CNN的足迹图像检索与匹配方法[J].南京师范大学学报（工程技术版）,2018,18(3):39-45. 被引量：7
2鲍文霞,茅丽丽,王年,杨先军,刘晋,瞿金杰.基于注意力双分支网络的跨模态足迹检索[J].东南大学学报（自然科学版）,2021,51(5):914-922. 被引量：5
3李宏伟.π宇宙[J].小说界,2022(4):10-41.
4朱明,汪桐生,王年,唐俊,鲁玺龙.基于多尺度自注意卷积的足迹压力图像检索算法[J].模式识别与人工智能,2020,33(12):1097-1103. 被引量：6
5金益锋,孙晰锐,吴文达,李岱熹,蒋雪梅,耿小鹏.基于深度学习跨清晰度的鞋面检索——从足迹图像到视频中锁定犯罪嫌疑人的应用[J].科学技术与工程,2022,22(19):8406-8413. 被引量：2
6林建吾,张欣,陈孝玉龙,陈洋,曹藤宝,喻殿智.基于轻量化卷积神经网络的番茄病害图像识别[J].无线电工程,2022,52(8):1347-1353. 被引量：13
7张天飞,龙海燕,丁娇,周荣强.基于独立区域3D注意力机制的人群位置计数方法[J].平顶山学院学报,2022,37(2):44-49.
8朱德涛(文/图).从两幅佛足迹图像看明清时期汉藏佛教的交往交流交融[J].中国西藏,2022(4):72-73.
9周万良,邓欢.基于碱激发矿渣和硅酸盐水泥的功能梯度混凝土的耐久性[J].工业建筑,2022,52(6):162-166. 被引量：1
10孔令亮.校园生物资源在初中生物课堂中的应用策略[J].进展,2022,17(14):83-85.

中国图象图形学报

2022年第7期

浏览历史

内容加载中请稍等...

非局部注意力双分支网络的跨模态赤足足迹检索被引量：1

参考文献5

二级参考文献15

共引文献35

同被引文献15

引证文献1

相关作者

相关机构

相关主题

浏览历史

非局部注意力双分支网络的跨模态赤足足迹检索 被引量：1

参考文献5

二级参考文献15

共引文献35

同被引文献15

引证文献1

相关作者

相关机构

相关主题

浏览历史

非局部注意力双分支网络的跨模态赤足足迹检索被引量：1