摘要
针对全卷积孪生网络目标跟踪算法(SiamFC)在目标形变、遮挡和快速运动等复杂场景中易导致跟踪失败的问题,提出一种利用混合注意力机制增强网络识别能力的算法(SiamMA)。首先,在训练阶段提出堆叠裁剪法构建自对抗训练样本对,以模拟实际跟踪时的复杂场景,使训练的网络模型具有更强的泛化性。其次提出混合注意力机制,在网络不同分支融合使用空间注意力和通道注意力网络模块,有效抑制了特征图中的背景干扰,提升算法的鲁棒性。采用GOT-10k、UAV123等4种数据集进行算法性能评测实验,结果表明本文算法的跟踪成功率、精度等主要性能指标较SiamFC和KCF等6种经典算法均有提升,平均速度达到60 fps。
To solve the tracking failure problem of fully-convolutional siamese networks algorithm(SiamFC)in complex scenes such as objects deformation,occlusion,and fast motion,a novel method(SiamMA)that uses the mixed attention mechanism to enhance the network identification ability is proposed.Firstly,in order to simulate the complex scenes and enhances the generalization performance of networks,an image stacking and cropping method is adopted in the network training stage to build the self-adversarial training sample pairs.Secondly,a mixed attention mechanism algorithm is proposed,which fuses spatial attention and channel attention modules in different branches of the network,so the background interference in the feature map can effectively be suppressed and the robustness of the algorithm is improved.4 open test datasets such as Got-10k and UAV123,etc.,are adopted to evaluate the algorithm performance.The experimental results show that our method outperforms 6 traditional algorithms such as SiamFC,KCF,etc.,on the main performance indexes such as tracking success rate and precision.The average speed of the algorithm can reach 60 frames per second.
作者
冯琪堯
张惊雷
FENG Qi-yao;ZHANG Jing-lei(School of Electrical Engineering and Automation,Tianjin University of Technology,Tianjin 300384;Tianjin Key Laboratory for Control Theory and Applications in Complicated Systems,Tianjin 300384,China)
出处
《计算机工程与科学》
CSCD
北大核心
2022年第2期276-282,共7页
Computer Engineering & Science
关键词
目标跟踪
孪生网络
混合注意力机制
自对抗训练样本对
object tracking
siamese network
mixed attention mechanism
self-adversarial training sample pairs