基于多层注意力机制的RGBT目标跟踪

RGBT Tracking via a Multi-Layer Attention Mechanism

下载PDF

导出

摘要挖掘热红外和可见光数据的互补信息,可以有效提升复杂环境下视觉跟踪的鲁棒性。然而大多数方法在特征提取过程中只是独立提取单模态特征,忽略了多层特征建模对准确定位目标位置的重要作用。针对上述问题,本文提出了基于多层注意力机制的RGBT目标跟踪方法。首先,将多模态图片对输入骨干网络以提取两种模态的深度特征,同时在各层特征提取中引入模态注意力模块,用于过滤不准确的多模态信息,有效实现多层次多模态特征建模。此外,为了抑制多模态融合特征中的噪声和冗余信息,本文提出了模态融合模块,并利用该模块进一步实现多模态特征的自适应融合,从而获得更具有判别性的多模态特征。在两个公开数据集上的实验表明,本文方法在RGBT目标跟踪任务上实现了高精度和快速跟踪。 Extracting the complementary information of infrared and visible light data can effectively improve the robustness of visual tracking in complex environments.However,in the process of feature extraction,most of these methods only extract single-modal features independently,ignoring the important role of multi-layer feature modeling in accurately locating the target position.Aiming at the above problems,this paper proposes an RGBT tracking based on a multi-layer attention mechanism.Firstly,the depth features of two modalities are extracted from the input backbone network of multi-modal images,and at the same time,the modality attention module is introduced into each layer of feature extraction to filter inaccurate multi-modal information,thus realizing effective multi-level and multi-modal feature modeling.In addition,to suppress the noise and redundant information in multi-modal fusion features,a modality fusion module is proposed to further realize the adaptive fusion of multi-modal features and obtain more discriminating multi-modal features.Experiments on two public datasets show that the proposed method generates higher tracking accuracy and speed.

作者吴毅翟素兰刘磊 WU Yi;ZHAI Sulan;LIU Lei(School of Mathematical Sciences,Anhui University,Hefei 230601,China;Anhui Provincial Key Laboratory of Multi Modal Cognitive Computation Anhui University,Hefei 230601,China)

机构地区安徽大学数学科学学院安徽大学多模态认知计算安徽省重点实验室

出处《安庆师范大学学报（自然科学版）》 2024年第2期77-83,共7页 Journal of Anqing Normal University(Natural Science Edition)

基金国家自然科学基金(62076003) 安徽大学数学学院开放课题(KF2019A03)。

关键词 RGBT目标跟踪特征建模注意力机制多模态融合特征自适应融合 RGBT tracking feature model attention mechanism multi-modal fusion features adaptive fusion

分类号 TP391 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献1

1Ahmad ALI,Abdul JALIL,Jianwei NIU,Xiaoke ZHAO,Saima RATHORE,Javed AHMED,Muhammad AKSAM IFTIKHAR.Visual object tracking- classical and contemporary approaches[J].Frontiers of Computer Science,2016,10(1):167-188. 被引量：10

二级参考文献211

1Ta D N, Chen W C, Gelfand N, Pulli K. Surftrac: efficient tracking and continuous object recognition using local feature descriptors. In: Pro?ceedings of IEEE Conference on Computer Vision and Pattern Recog?nition.2009,2937-2944.
2Skrypnyk I, Lowe D G. Scene modelling, recognition and tracking with invariant image features. In: Proceedings of IEEE and ACM In?ternational Symposium on Mixed and Augmented Reality. 2004, 110- 119.
3Chau D P, Bremond F, Thonnat M. Object tracking in videos: ap?proaches and issues. 2013, arXiv preprint arXiv: 1304.5212.
4Ko T. A survey on behavior analysis in video surveillance for home?land security applications. In: Proceedings of the 37th IEEE Applied Imagery Pattern Recognition Workshop. 2008,1-8.
5Ess A, Schindler K, Leibe B, Van Gool L. Object detection and track?ing for autonomous navigation in dynamic environments. The Interna?tional Journal of Robotics Research, 2010, 29: 1707-1725.
6Mistry P, Maes P. SixthSense: a wearable gestural interface. In: Pro?ceedings of ACM SIGGRAPH ASIA 2009 Sketches. 2009, II.
7Bradski G R. Real time face and object tracking as a component of a perceptual user interface. In: Proceedings of the 4th IEEE Workshop on Applications of Computer Vision. 1998, 214-219.
8Zhu Z, Ji Q. Eye gaze tracking under natural head movements. In: Pro?ceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition. 2005, 918-923.
9Kim I, Choi H S, Yi K M, Choi J Y, Kong S G. Intelligent visual surveillance - a survey. International Journal of Control, Automation and Systems, 2010, 8(5): 926-939.
10Siemens S. Sistore CX EDS-intelligent video detection system. Tech?nical Report. 2008.

共引文献9

1Qinyang Zhou,Wei Guo,Na Chen,Ze Wang,Ganghua Li,Yanfeng Ding,Seishi Ninomiya,Yue Mu.Analyzing Nitrogen Effects on Rice Panicle Development by Panicle Detection and Time-Series Tracking[J].Plant Phenomics,2023,5(2):253-267. 被引量：2
2Nan REN,Junping DU,Suguo ZHU,Linghui LI,Dan FAN,JangMyung LEE.Robust visual tracking based on scale invariance and deep learning[J].Frontiers of Computer Science,2017,11(2):230-242. 被引量：2
3姜文涛,张壮,刘万军.基于视觉目标跟踪的行李托运系统设计与实现[J].辽宁工程技术大学学报（自然科学版）,2017,36(8):876-882. 被引量：1
4Kang LI,Fazhi HE,Haiping YU,Xiao CHEN.A parallel and robust object tracking approach synthesizing adaptive Bayesian learning and improved incremental subspace learning[J].Frontiers of Computer Science,2019,13(5):1116-1135. 被引量：4
5Jiaqing FAN,Huihui SONG,Kaihua ZHANG,Qingshan LIU,Fei YAN,Wei LIAN.Real-time manifold regularized context-aware correlation tracking[J].Frontiers of Computer Science,2020,14(2):334-348. 被引量：1
6柳有权,裴雪,李婉,刘正雄.基于邻近目标置信度评估的视觉目标跟踪与定位[J].系统仿真学报,2020,32(7):1294-1300. 被引量：3
7张灿龙,李燕茹,李志欣,王智文.基于核相关滤波与特征融合的分块跟踪算法[J].广西师范大学学报（自然科学版）,2020,38(5):12-23. 被引量：5
8韩佼志,王红雨,吴昌学,刘瑢琦,余欣芝,曹彦.基于YOLOv5的单目视觉无人机检测与定位方法[J].飞行力学,2023,41(3):61-66. 被引量：6
9杨晓丽,张馨月,于涛,高鹏,王茂励.RGBT多模态视觉跟踪方法综述[J].计算机测量与控制,2024,32(9):1-8. 被引量：1

安庆师范大学学报（自然科学版）

2024年第2期

浏览历史

内容加载中请稍等...

基于多层注意力机制的RGBT目标跟踪

参考文献1

二级参考文献211

共引文献9

相关作者

相关机构

相关主题

浏览历史