期刊文献+

基于跨模态近邻损失的可视-红外行人重识别

Cross-modality nearest neighbor loss for visible-infrared person re-identification
下载PDF
导出
摘要 可视-红外跨模态行人重识别任务的目标是给定一个模态的特定人员图像,在其他不同模态摄像机所拍摄的图像集中进行检索,找出相同人员对应的图像。由于成像方式不同,不同模态的图像之间存在明显的模态差异。为此,从度量学习的角度出发,对损失函数进行改进以获取具有更加辨别性的信息。对图像特征内聚性进行理论分析,并在此基础上提出一种基于内聚性分析和跨模态近邻损失函数的重识别方法,以加强不同模态样本的内聚性。将跨模态困难样本的相似性度量问题转化为跨模态最近邻样本对和同模态样本对的相似性度量,使得网络对模态内聚性的优化更加高效和稳定。对所提方法在全局特征表示的基线网络和部分特征表示的基线网络上进行实验验证结果表明:所提方法对可视-红外行人重识别的预测结果相较于基线方法,平均准确度最高可提升8.44%,证明了方法在不同网络架构中的通用性;同时,以较小的模型复杂度和较低的计算量为代价,实现了可靠的跨模态行人重识别结果。 The goal of the visual-infrared person re-identification task is to search the image of a specific person in a given modality in the image set taken by other cameras in different modality to find out the corresponding image of the same person.Due to the different imaging methods,there are obvious modal differences between images of different modalities.Therefore,from the perspective of metric learning,the loss function is improved to obtain more discriminative information.The cohesiveness of image features is analyzed theoretically,and a re-recognition method based on cohesiveness analysis and cross-modal nearest neighbor loss function is proposed to strengthen the cohesiveness of different modal samples.The similarity measurement problem of cross-modal hard samples is transformed into the similarity measurement of cross-modal nearest neighbor sample pairs and the same modality sample pairs,which makes the optimization of modal cohesion of the network more efficient and stable.The proposed method is experimentally verified on the baseline networks of global feature representation and partial feature representation.Compared with the baseline method,the proposed method can improve the average accuracy of the visual and infrared person re-identification by up to 8.44%.The universality of the proposed method in different network architectures is proved.Moreover,at the cost of less model complexity and less computation,the reliable visual-infrared person re-identification results are achieved.
作者 赵三元 阿琪 高宇 ZHAO Sanyuan;A Qi;GAO Yu(School of Computer Science&Technology,Beijing Institute of Technology,Beijing 100081,China;Yangtze Delta Region Academy,Beijing Institute of Technology,Jiaxing 314019,China)
出处 《北京航空航天大学学报》 EI CAS CSCD 北大核心 2024年第2期433-441,共9页 Journal of Beijing University of Aeronautics and Astronautics
基金 国家自然科学基金(61902027)。
关键词 可视-红外行人重识别 度量学习 深度学习 跨模态学习 计算机视觉 visible-infrared person re-identification metric learning deep learning cross-modality learning computer vision
  • 相关文献

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部