基于跨模态近邻损失的可视-红外行人重识别

Cross-modality nearest neighbor loss for visible-infrared person re-identification

下载PDF

导出

摘要可视-红外跨模态行人重识别任务的目标是给定一个模态的特定人员图像,在其他不同模态摄像机所拍摄的图像集中进行检索,找出相同人员对应的图像。由于成像方式不同,不同模态的图像之间存在明显的模态差异。为此,从度量学习的角度出发,对损失函数进行改进以获取具有更加辨别性的信息。对图像特征内聚性进行理论分析,并在此基础上提出一种基于内聚性分析和跨模态近邻损失函数的重识别方法,以加强不同模态样本的内聚性。将跨模态困难样本的相似性度量问题转化为跨模态最近邻样本对和同模态样本对的相似性度量,使得网络对模态内聚性的优化更加高效和稳定。对所提方法在全局特征表示的基线网络和部分特征表示的基线网络上进行实验验证结果表明:所提方法对可视-红外行人重识别的预测结果相较于基线方法,平均准确度最高可提升8.44%,证明了方法在不同网络架构中的通用性;同时,以较小的模型复杂度和较低的计算量为代价,实现了可靠的跨模态行人重识别结果。 The goal of the visual-infrared person re-identification task is to search the image of a specific person in a given modality in the image set taken by other cameras in different modality to find out the corresponding image of the same person.Due to the different imaging methods,there are obvious modal differences between images of different modalities.Therefore,from the perspective of metric learning,the loss function is improved to obtain more discriminative information.The cohesiveness of image features is analyzed theoretically,and a re-recognition method based on cohesiveness analysis and cross-modal nearest neighbor loss function is proposed to strengthen the cohesiveness of different modal samples.The similarity measurement problem of cross-modal hard samples is transformed into the similarity measurement of cross-modal nearest neighbor sample pairs and the same modality sample pairs,which makes the optimization of modal cohesion of the network more efficient and stable.The proposed method is experimentally verified on the baseline networks of global feature representation and partial feature representation.Compared with the baseline method,the proposed method can improve the average accuracy of the visual and infrared person re-identification by up to 8.44%.The universality of the proposed method in different network architectures is proved.Moreover,at the cost of less model complexity and less computation,the reliable visual-infrared person re-identification results are achieved.

作者赵三元阿琪高宇 ZHAO Sanyuan;A Qi;GAO Yu(School of Computer Science&Technology,Beijing Institute of Technology,Beijing 100081,China;Yangtze Delta Region Academy,Beijing Institute of Technology,Jiaxing 314019,China)

机构地区北京理工大学计算机学院北京理工大学长三角研究院

出处《北京航空航天大学学报》 EI CAS CSCD 北大核心 2024年第2期433-441,共9页 Journal of Beijing University of Aeronautics and Astronautics

基金国家自然科学基金(61902027)。

关键词可视-红外行人重识别度量学习深度学习跨模态学习计算机视觉 visible-infrared person re-identification metric learning deep learning cross-modality learning computer vision

分类号 TP391.4 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

1俞昆,程玉虎,邢镔,王雪松.基于双级对齐部分迁移网络的旋转设备故障诊断[J].电子学报,2023,51(12):3529-3539.
2申小虎,朱翔宇,史洪飞,王传之.基于机器学习鸟声识别算法研究进展[J].生物多样性,2023,31(11):164-189.

北京航空航天大学学报

2024年第2期

浏览历史

内容加载中请稍等...

基于跨模态近邻损失的可视-红外行人重识别

相关作者

相关机构

相关主题

浏览历史