期刊文献+

基于多层注意力机制的RGBT目标跟踪

RGBT Tracking via a Multi-Layer Attention Mechanism
下载PDF
导出
摘要 挖掘热红外和可见光数据的互补信息,可以有效提升复杂环境下视觉跟踪的鲁棒性。然而大多数方法在特征提取过程中只是独立提取单模态特征,忽略了多层特征建模对准确定位目标位置的重要作用。针对上述问题,本文提出了基于多层注意力机制的RGBT目标跟踪方法。首先,将多模态图片对输入骨干网络以提取两种模态的深度特征,同时在各层特征提取中引入模态注意力模块,用于过滤不准确的多模态信息,有效实现多层次多模态特征建模。此外,为了抑制多模态融合特征中的噪声和冗余信息,本文提出了模态融合模块,并利用该模块进一步实现多模态特征的自适应融合,从而获得更具有判别性的多模态特征。在两个公开数据集上的实验表明,本文方法在RGBT目标跟踪任务上实现了高精度和快速跟踪。 Extracting the complementary information of infrared and visible light data can effectively improve the robustness of visual tracking in complex environments.However,in the process of feature extraction,most of these methods only extract single-modal features independently,ignoring the important role of multi-layer feature modeling in accurately locating the target position.Aiming at the above problems,this paper proposes an RGBT tracking based on a multi-layer attention mechanism.Firstly,the depth features of two modalities are extracted from the input backbone network of multi-modal images,and at the same time,the modality attention module is introduced into each layer of feature extraction to filter inaccurate multi-modal information,thus realizing effective multi-level and multi-modal feature modeling.In addition,to suppress the noise and redundant information in multi-modal fusion features,a modality fusion module is proposed to further realize the adaptive fusion of multi-modal features and obtain more discriminating multi-modal features.Experiments on two public datasets show that the proposed method generates higher tracking accuracy and speed.
作者 吴毅 翟素兰 刘磊 WU Yi;ZHAI Sulan;LIU Lei(School of Mathematical Sciences,Anhui University,Hefei 230601,China;Anhui Provincial Key Laboratory of Multi Modal Cognitive Computation Anhui University,Hefei 230601,China)
出处 《安庆师范大学学报(自然科学版)》 2024年第2期77-83,共7页 Journal of Anqing Normal University(Natural Science Edition)
基金 国家自然科学基金(62076003) 安徽大学数学学院开放课题(KF2019A03)。
关键词 RGBT目标跟踪 特征建模 注意力机制 多模态融合特征 自适应融合 RGBT tracking feature model attention mechanism multi-modal fusion features adaptive fusion
  • 相关文献

参考文献1

二级参考文献211

  • 1Ta D N, Chen W C, Gelfand N, Pulli K. Surftrac: efficient tracking and continuous object recognition using local feature descriptors. In: Pro?ceedings of IEEE Conference on Computer Vision and Pattern Recog?nition.2009,2937-2944.
  • 2Skrypnyk I, Lowe D G. Scene modelling, recognition and tracking with invariant image features. In: Proceedings of IEEE and ACM In?ternational Symposium on Mixed and Augmented Reality. 2004, 110- 119.
  • 3Chau D P, Bremond F, Thonnat M. Object tracking in videos: ap?proaches and issues. 2013, arXiv preprint arXiv: 1304.5212.
  • 4Ko T. A survey on behavior analysis in video surveillance for home?land security applications. In: Proceedings of the 37th IEEE Applied Imagery Pattern Recognition Workshop. 2008,1-8.
  • 5Ess A, Schindler K, Leibe B, Van Gool L. Object detection and track?ing for autonomous navigation in dynamic environments. The Interna?tional Journal of Robotics Research, 2010, 29: 1707-1725.
  • 6Mistry P, Maes P. SixthSense: a wearable gestural interface. In: Pro?ceedings of ACM SIGGRAPH ASIA 2009 Sketches. 2009, II.
  • 7Bradski G R. Real time face and object tracking as a component of a perceptual user interface. In: Proceedings of the 4th IEEE Workshop on Applications of Computer Vision. 1998, 214-219.
  • 8Zhu Z, Ji Q. Eye gaze tracking under natural head movements. In: Pro?ceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition. 2005, 918-923.
  • 9Kim I, Choi H S, Yi K M, Choi J Y, Kong S G. Intelligent visual surveillance - a survey. International Journal of Control, Automation and Systems, 2010, 8(5): 926-939.
  • 10Siemens S. Sistore CX EDS-intelligent video detection system. Tech?nical Report. 2008.

共引文献9

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部