融合姿态引导和多尺度特征的遮挡行人重识别

Pose guidance and multi-scale feature fusion for occluded person re-identification

导出

摘要目的在行人重识别任务中,行人外观特征会因为遮挡发生变化,从而降低行人特征的辨别性,仅基于可视部分的传统方法仍会识别错误。针对此问题,提出了一种融合姿态引导和多尺度特征的遮挡行人重识别方法。方法首先,构建了一种特征修复模块,根据遮挡部位邻近信息恢复特征空间中被遮挡区域的语义信息,实现缺失部位特征的修补。然后,为了从修复的图像中提取有效的姿态信息,设计了一种姿态引导模块,通过姿态估计引导特征提取,实现更加精准的行人匹配。最后,搭建了特征增强模块,并融合显著性区域检测方法增强有效的身体部位特征,同时消除背景信息造成的干扰。结果在3个公开的数据集上进行了对比实验和消融实验,在Market1501、DukeMTMC-reID(Duke multi-tracking multi-camera re-identification)和Occluded-DukeMTMC(occluded Duke multi-tracking multi-camera re-identification)数据集上的平均精度均值(mean average precision,mAP)和首次命中率(rank-1 accuracy,Rank-1)分别为88.8%和95.5%、79.2%和89.3%、51.7%和60.3%。对比实验结果表明提出的融合算法提高了行人匹配的准确率,具有较好的竞争优势。结论本文所提的姿态引导和多尺度融合方法,修复了因遮挡而缺失的部位特征,结合姿态信息融合了不同粒度的图像特征,提高了模型的识别准确率,能有效缓解遮挡导致的误识别现象,验证了方法的有效性。 Objective Person re-identification(ReID)is an important task in computer vision,and it aims to accurately identify and associate the same person between multiple visual surveillance cameras by extracting and matching features of pedestrian under different scenarios.Occluded person ReID is a challenging and specialized task in the existing person ReID problems.In real-world settings,occlusion is a common issue,and it impacts the practical application of person ReID technique to a certain extent.Recently,occluded person ReID has gradually attracted the attention of many research⁃ers,and several methods have been proposed to address the issue of occlusion,which achieve impressive results.Cur⁃rently,these methods primarily focus on the visible regions in images.Concretely,it first locates the visible regions in the image and then specially designs a model to extract discerning feature information from these regions,which achieves accu⁃rate person matching.These methods typically remove features coming from the occluded areas and then exploit discrimina⁃tive features from the non-occluded regions for matching.Although these methods achieve impressive results,the influence of occluded regions and background interference in images are ignored,which results in the aforementioned solutions fail⁃ing to effectively address the misclassification issue resulting from similar appearances in non-occluded regions.Conse⁃quently,merely relying on visible regions for subsequent recognition task leads to a sharp performance drop of the model,and the interference coming from image backgrounds also affects the further improvement in recognition accuracy.Some methods have been proposed to recover the occluded regions in images for overcoming the abovementioned issues.Specifi⁃cally,these methods restore the occluded parts by utilizing the unobstructed image information at the image level.How⁃ever,the restoration approaches may cause image distortion and introduce an excessive number of parameters.Method We propose a person ReID method based on pose guidance and multi-scale feature fusion to alleviate the aforementioned issues.This method can enhance the feature representation capability of the model and obtain more discriminative fea⁃tures.First,a feature restoration module is constructed to restore the occluded image features at the feature level while effectively reducing the parameters of the model.The module uses spatial contextual information from the non-occluded regions to predict the features of adjacent occluded regions,which restores the semantic information of the occluded regions in the feature space.The feature restoration module mainly consists of two subparts:the adaptive region division unit and the feature restoration one.The adaptive region division unit divides the image into six regions adaptively according to the predicted localization points to facilitate the clustering of similar feature information in different regions.The adaptive divi⁃sion in the module could effectively alleviate the misalignment caused by fixed division methods,and it could achieve more accurate position alignment.The feature restoration unit comprises of an encoder and a decoder.The encoder encodes the feature information coming from the divided regions of the image with similar appearances or close positions into a cluster.Meanwhile,the decoder assigns the cluster information to the occluded body parts in the image,which completes the fea⁃ture restoration of missing body parts.Second,a pose estimation network is employed to extract pedestrian pose informa⁃tion.The pose estimation network is responsible for guiding the generation of keypoint heatmaps for the restored complete image features.Then,it implements the prediction of body keypoints with the heatmaps to obtain pose information.The pretrained pose estimation guidance model performs fusion learning on the global non-occluded regions and the restored regions to obtain more distinctive pedestrian feature information for more accurate pedestrian matching.Finally,a feature enhancement module is proposed to extract salient features from the image for eliminating the interference coming from background information while enhancing the learning capability for effective information.This module not only makes the network pay close attention to the valid semantic information in the feature maps but also reduces the interference coming from background noises,which could effectively alleviate the failure of feature learning caused by occlusion.Result We conducted several comparative experiments and ablation experiments on three publicly available datasets to validate the effectiveness of our method.We employed mean average precision(mAP)and Rank-1 accuracy as our evaluation metrics.Experiment results demonstrate that our method achieves mAP and Rank-1 of 88.8%and 95.5%on the Market1501 data⁃set,respectively.The mAP and Rank-1 are 79.2%and 89.3%,respectively,on the Duke multi-tracking multi-camera ReID(DukeMTMC-reID)dataset.On the occluded Duke multi-tracking multi-camera re-recognition(OccludedDukeMTMC)dataset,the mAP and Rank-1 can reach 51.7%and 60.3%,respectively.Moreover,our method outper⁃forms the PGMA-Net by 0.4%in mAP on the Market1501 dataset,by 0.8%in mAP and 0.7%in Rank-1 on the DukeMTMC-reID dataset,and by 1.2%in mAP on the Occluded-DukeMTMC dataset.At the same time,the ablation experiments confirm the effectiveness of the three proposed modules.Conclusion Our proposed method,pose-guided and multi-scale feature fusion(PGMF),could effectively recover the features of missing body parts,alleviate the issue of back⁃ground interference,and achieve accurate pedestrian matching.Therefore,the proposed model effectively alleviates the misidentification caused by occlusion,improves the accuracy of person ReID,and exhibits robustness.

作者张红颖刘腾飞罗谦张涛 Zhang Hongying;Liu Tengfei;Luo Qian;Zhang Tao(College of Electronic Information and Automation,Civil Aviation University of China,Tianjin 300300,China;Civil Aviation Electronic Technology Co.,Ltd.,Chengdu 610041,China)

机构地区中国民航大学电子信息与自动化学院民航成都电子技术有限责任公司

出处《中国图象图形学报》 CSCD 北大核心 2024年第8期2364-2376,共13页 Journal of Image and Graphics

基金国家自然科学基金民航联合研究基金重点支持项目(U2133211)。

关键词行人重识别(ReID) 遮挡姿态引导特征融合特征修补 person re-identification(ReID) occlusion pose guidance feature fusion feature restoration

分类号 TP391.41 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献2

1张永飞,杨航远,张雨佳,豆朝鹏,廖胜才,郑伟诗,张史梁,叶茫,晏轶超,李俊杰,王生进.行人再识别技术研究进展[J].中国图象图形学报,2023,28(6):1829-1862. 被引量：3
2李擎,胡伟阳,李江昀,刘艳,李梦璇.基于深度学习的行人重识别方法综述[J].工程科学学报,2022,44(5):920-932. 被引量：7

二级参考文献19

1熊炜,熊子婕,杨荻椿,童磊,刘敏,曾春艳.基于深层特征融合的行人重识别方法[J].计算机工程与科学,2020,42(2):358-364. 被引量：5
2杨婉香,严严,陈思,张小康,王菡子.基于多尺度生成对抗网络的遮挡行人重识别方法[J].软件学报,2020,31(7):1943-1958. 被引量：18
3张晓伟,吕明强,李慧.基于局部语义特征不变性的跨域行人重识别[J].北京航空航天大学学报,2020,46(9):1682-1690. 被引量：6
4熊炜,杨荻椿,熊子婕,童磊,李利荣,王娟.基于全局特征拼接的行人重识别算法研究[J].计算机应用研究,2021,38(1):316-320. 被引量：8
5史维东,张云洲,刘双伟,朱尚栋,暴吉宁.针对形变与遮挡问题的行人再识别[J].中国图象图形学报,2020,25(12):2530-2540. 被引量：7
6徐龙壮,彭力,朱凤增.多任务金字塔重叠匹配的行人重识别方法[J].计算机工程,2021,47(1):239-245. 被引量：6
7董亚超,刘宏哲,徐成.基于显著性多尺度特征协作融合的行人重识别方法[J].计算机工程,2021,47(6):234-244. 被引量：9
8杨晓峰,张来福,王志鹏,萨旦姆,邓红霞,李海芳.基于胶囊网络的跨域行人再识别[J].计算机工程与科学,2021,43(9):1591-1599. 被引量：1
9任雪娜,张冬明,包秀国,李冰.语义引导的遮挡行人再识别注意力网络[J].通信学报,2021,42(10):106-116. 被引量：6
10刘乾,王洪元,曹亮,孙博言,肖宇,张继.基于联合损失胶囊网络的换衣行人重识别[J].计算机应用,2021,41(12):3596-3601. 被引量：3

共引文献8

1蒋原,李擎,苗磊,吕萌,武建文,陈明轩.多电飞机断路器电弧机理及灭弧技术研究综述[J].工程科学学报,2023,45(4):611-620. 被引量：2
2朱利,林欣,徐亦飞,刘真,马英.基于城市信息单元和差异注意力的多层行人重识别技术[J].集成技术,2023,12(1):91-104. 被引量：1
3石昌森.视频监控中的行人重识别方法评述[J].科技创新导报,2022,19(29):106-110.
4张红颖,王徐泳,彭晓雯.结合前景分割的多特征融合行人重识别[J].中国图象图形学报,2023,28(5):1360-1371.
5余文涛,赵倩,季堂煜.基于颜色随机化和全相关注意力的跨模态行人重识别[J].国外电子测量技术,2023,42(6):10-16.
6杨盼盼,马凌飞,平阳,索雅丽.移动AR+VR支持下跨媒体视频关键帧还原仿真[J].微型电脑应用,2024,40(3):32-36.
7张甲鹏,李佳欣,王清瑜,武慧真,窦育民,赵利敏.基于视频监控的考研教室动态播报系统[J].无线互联科技,2024,21(4):94-98.
8孙弋洋.基于三维空间的多行人重识别方法[J].数字通信世界,2024(7):61-63.

1张嘉辉,赵威,王子琛,蒙志君.基于检测和重识别的无人机行人跟踪算法[J].北京航空航天大学学报,2024,50(8):2538-2546.
2刘志刚,王淼,刘苗苗.基于姿态引导特征增强的遮挡行人重识别[J].计算机技术与发展,2024,34(4):89-94.
3丁梦磊.光学遥感图像中舰船识别方法研究[J].舰船科学技术,2024,46(16):143-147.

中国图象图形学报

2024年第8期

浏览历史

内容加载中请稍等...

融合姿态引导和多尺度特征的遮挡行人重识别

参考文献2

二级参考文献19

共引文献8

相关作者

相关机构

相关主题

浏览历史