期刊文献+

基于Transformer的行人重识别网络

A person re-identification network based on transformer
下载PDF
导出
摘要 针对行人重识别中水平切片方法由于分块特征感受野之间存在交叉重叠带来的分块数量限制问题,提出一种基于Transformer的行人重识别网络结构。首先,输入图像经过CNN网络提取中间特征图,并将特征图进行分块,对每块特征进一步切分成像素级token向量;然后,对各像素级token向量展平并加入位置编码和全局token向量,输入Transformer IN编码器中;接着,对得到的全局token向量进一步加入分类token向量和位置编码后,输入Transformer OUT编码器,得到最终的编码器输出;最后,取分类token向量并加上全连接后,利用softmax和交叉熵损失对行人进行分类。在Market-1501、Duke MTMC-re ID数据集上的实验结果表明,本方法能够更细粒度地提取特征,并利用Transformer的全局把控能力,进一步提高了切片的数量和分类的精度。 Aiming at the limitation of the number of blocks caused by overlapping and overlapping of block feature sensing fields in horizontal slicing-based person re-identification method,a person re-identification network structure CNN with INOUT_Transformer(CIT)based on Transformer was proposed.First of all,the input image was extracted from the middle feature image through CNN network,and the feature image was divided into blocks,and each piece of feature was further cut into pixel-level token vector.Then,each pixel level token vector was flattened and the position encoding and global token vector were added,which were input into the TransformerI N encoder.Then,the global token vector was further added into the classified token vector and position encoding,and then input into the Transformer OUT to obtain the final encoder output.Finally,after taking the classification token vector and adding the fully connected layer,the pedestrian was classified by Softmax and cross entropy loss.Experimental results on Market-1501 and Duke MTMC-reI D datasets show that the proposed method can extract features more fine-grained,and further improve the number of slices and classification accuracy by utilizing Transformer's global control ability.
作者 莫建文 莫伦麟 MO Jianwen;MO Lunlin(School of Information and Communication,Guilin University of Electronic Technology,Guilin 541004,China)
出处 《桂林电子科技大学学报》 2023年第3期195-201,共7页 Journal of Guilin University of Electronic Technology
基金 国家自然科学基金(62001133,62177012,61967005) 桂林电子科技大学研究生教育创新计划(2021YCXS026)。
关键词 深度学习 行人重识别 TRANSFORMER 自注意力 deep learning person re-identificaiton transformer self-attention
  • 相关文献

参考文献2

二级参考文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部