双分支特征融合网络的步态识别算法被引量：7

Dual branch feature fusion network based gait recognition algorithm

导出

摘要目的在步态识别算法中,基于外观的方法准确率高且易于实施,但对外观变化敏感;基于模型的方法对外观变化更加鲁棒,但建模困难且准确率较低。为了使步态识别算法在获得高准确率的同时对外观变化具有更好的鲁棒性,提出了一种双分支网络融合外观特征和姿态特征,以结合两种方法的优点。方法双分支网络模型包含外观和姿态两条分支,外观分支采用Gait Set网络从轮廓图像中提取外观特征;姿态分支采用5层卷积网络从姿态骨架中提取姿态特征。在此基础上构建特征融合模块,融合外观特征和姿态特征,并引入通道注意力机制实现任意尺寸的特征融合,设计的模块结构使其能够在融合过程中抑制特征中的噪声。最后将融合后的步态特征应用于识别行人身份。结果实验在CASIA-B(Institute of Automation,Chinese Academy of Sciences,Gait Dataset B)数据集上通过跨视角和不同行走状态两种实验设置与目前主流的步态识别算法进行对比,并以Rank-1准确率作为评价指标。在跨视角实验设置的MT(medium-sample training)划分中,该算法在3种行走状态下的准确率分别为93.4%、84.8%和70.9%,相比性能第2的算法分别提升了1.4%、0.5%和8.4%;在不同行走状态实验设置中,该算法在两种行走状态下的准确率分别为94.9%和90.0%,获得了最佳性能。结论在能够同时获取外观数据和姿态数据的场景下,该算法能够有效地融合外观信息和姿态信息,在获得更丰富的步态特征的同时降低了外观变化对步态特征的影响,提高了步态识别的性能。 ObjectiveGait is a kind of human walking pattern,which is one of the key biometric features for person identification.As a non-contact and long-distance recognition way to capture human identity information,gait recognition has been developed in video surveillance and public security.Gait recognition algorithms can be segmented into two mainstreams like appearance-based methods and the model-based methods.The appearance-based methods extract gait from a sequence of silhouette images in common.However,the appearance-based methods are basically affected by appearance changes like non-rigid clothing deformation and background clutters.Different from the appearance-based methods,the model-based methods commonly leverage body structure or motion prior to model gait pattern and more robust to appearance variations.Actually,it is challenged to identify a universal model for gait description,and the previous pre-defined models can be constrained in certain scenarios.Recent model-based methods are focused on deep learning-based pose estimation to model key-points of human body.But the estimated pose model constrains the redundant noises in subject to pose estimators and occlusion.In summary,the appearance-based methods are based visual features description while the model-based methods tend to describe a semantic level-based motion and structure.We aim to design a novel approach for gait recognition beyond the existed two methods mentioned above and improve gait recognition ability via the added appearance features and pose features.Methodwe design a dual-branch network for gait recognition.The input data are fed into a dual-branch network to extract appearance features and pose features each.Then,the two kinds of features are merged into the final gait features in the context of feature fusion module.In detail,we adopt an optimal network Gait Set as the appearance branch to extract appearance features from silhouette images and design a two-stream convolutional neural network(CNN)to extract pose features from pose key-points based on the position information and motion information.Meanwhile,a squeeze-and-excitation feature fusion module(SEFM)is designed to merge two kinds of features via the weights of two kinds of features learning.In the squeeze step,appearance feature maps and pose feature maps are integrated via pooling,concatenation,and projection.In the excitation step,we obtain the weighted feature maps of appearance and pose via projection and Hadamard product.The two kinds of feature maps are down-sampled and concatenated into the final gait feature in accordance with adaptive weighting.To verify the appearance features and pose features,we design two variants of SEFM in related to SEFM-A and SEFM-P further.The SEFM module merges appearance features and pose features in mutual;the SEFM-A module merges pose features into appearance features and appearance features remain unchanged;the SEFM-P module merges appearance features into pose features and no pose features changed.Our algorithm is based on Pytorch and the evaluation is carried out on database CASIA(Institute of Automation,Chinese Academy of Sciences)Gait Dataset B(CASIA-B).We adopt the Alpha Pose algorithm to extract pose key-points from origin RGB videos,and use silhouette images obtained.In each iteration of the training process,we randomly select 16 subjects and select 8 random samples of each subject further.Every sample of them contains a sub-sequence of 30 frames.Consequently,each batch has 3840 image-skeleton pairs.We adopt the Adam optimizer to optimize the network for 60000 iterations.The initial learning rate is set to 0.0002 for the pose branch,and 0.0001 for the appearance branch and the SEFM,and then the learning rate is cut10 times at the 45000-th iteration.ResultWe first verify the effectiveness of the dual-branch network and feature fusion modules.Our demonstration illustrates that our dual-branch network can enhance performance and there is a clear complementary effect between appearance features and pose features.The Rank-1 accuracies of five feature fusion modules like SEFM,SEFM-A,SEFM-P,Concatenation,and multi-modal transfer module(MMTM)are 83.5%,81.9%,93.4%,92.6%and 79.5%,respectively.These results demonstrate that appearance features are more discriminative because there are noises existed in pose features.Our SEFM-P is capable to merge two features in the feature fusion procedure via noises suppression.Then,we compare our methods to advanced gait recognition methods like CNNs,event-based gait recognition(EV-Gait),Gait Set,and Pose Gait.We conduct the experiments with two protocols and evaluate the rank-1 accuracy of three walking scenarios in the context of normal walking,bag-carrying,and coat-wearing.Our method archives the best performance in all experimental protocols.Our three scenarios-based rank-1 accuracies are reached 93.4%,84.8%,and 70.9%in protocol 1.The results of protocol 2 are obtained by 95.7%,87.8%,77.0%,respectively.Comparing to the second-best method of Gait Set,the rank-1 accuracies in the context of coat-wearing walking scenario are improved by 8.4%and 6.6%.ConclusionWe harness a novel gait recognition network based on the fusions of appearance features and pose features.Our analyzed results demonstrated that our method can develop two kinds of features and the appearance variations is more robust,especially for clothing changes scenario.

作者徐硕郑锋唐俊鲍文霞 Xu Shuo;Zheng Feng;Tang Jun;Bao Wenxia(School of Electronics and Information Engineering,Anhui University,Hefei 230601,China;College of Engineering,Southern University of Science and Technology,Shenzhen 518055,China)

机构地区安徽大学电子信息工程学院南方科技大学工学院

出处《中国图象图形学报》 CSCD 北大核心 2022年第7期2263-2273,共11页 Journal of Image and Graphics

基金国家自然科学基金项目(61772032) 国家重点研发计划资助(SQ2018YFC080102) 安徽省重点研发计划资助(202004a7020050)。

关键词生物特征识别步态识别特征融合双分支网络 SE模块人体姿态估计步态轮廓图像 biometric recognition gait recognition feature fusion two-branch network squeeze-and-excitation module human body pose estimation gait silhouette images

分类号 TP391.4 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献1

1贲晛烨,徐森,王科俊.行人步态的特征表达及识别综述[J].模式识别与人工智能,2012,25(1):71-81. 被引量：73

二级参考文献89

1彭彰,吴晓娟,杨军.基于肢体长度参数的多视角步态识别算法[J].自动化学报,2007,33(2):210-213. 被引量：10
2Little J,Boyd J E.Recognizing People by Their Gait:The Shape of Motion.Videre:Journal of Computer Vision Research,1998,1(2):1-32.
3Tanawongsuwan R,Bobick A.Performance Analysis of TimeDistance Gait Parameters under Different Speeds//Proc of the4th International Conference on Audio-and Video-Based Biometric Person Authentication.Guildford,UK,2003:715-724.
4Cuntoor N,Kale A,Chellappa R.Combining Multiple Evidences for Gait Recognition//Proc of the International Conference on Acoustics,Speech and Signal Processing.Hong Kong,China,2003,III:33-36.
5Chalidabhongse T,Kruger V,Chellappa R.The UMD Database for Human Identification at a Distance.Technical Report.College Park,USA:University of Maryland,2001.
6Gross R,Shi J.The CMU Motion of Body(MoBo)Database.Technical Report,CMU-RI-TR-01-18.Pittsburgh,USA:Carnegie Mellon University,2001.
7Nixon M,Carter J,Shutler J,et al.Experimental Plan for Automatic Gait Recognition.Technical Report.Southampton,UK:University of Southampton,2001.
8Sarkar S.The Human ID Gait Challenge Problem:Data Sets,Performance and Analysis.IEEE Trans on Pattern Analysis and Machine Intelligence,2005,27(2):162-177.
9Wang Liang,Tan Tieniu.Silhouette Analysis-Based Gait Recognition for Human Identification.IEEE Trans on Pattern Analysis and Machine Intelligence,2003,25(12):1505-1518.
10Yu Shiqi,Tan Daoliang,Tan Tieniu.A Framework for Evaluating the Effect of View Angle,Clothing and Carrying Condition on Gait Recognition//Proc of the18th International Conference on Pattern Recognition.Hong Kong,China,2006:441-444.

共引文献72

1李贻斌,郭佳旻,张勤.人体步态识别方法与技术[J].吉林大学学报（工学版）,2020,50(1):1-18. 被引量：10
2张元元,姜树明,魏志强,张建峰,许世杰.基于步态的身份识别研究综述[J].山东科学,2012,25(3):113-118. 被引量：3
3张善文,张传雷.基于监督Isomap的步态识别方法[J].计算机应用研究,2012,29(11):4338-4341. 被引量：3
4王旭启,张善文.监督最大差伸展算法及其在步态识别中的应用[J].计算机应用研究,2012,29(11):4390-4393.
5张善文,黄文准,张传雷.基于监督局部映射的煤矿井下人员身份鉴别方法[J].中国安全科学学报,2012,22(11):101-106. 被引量：1
6车辚辚,韩东升.去身份识别技术综述[J].计算机应用研究,2013,30(2):341-345. 被引量：1
7张传雷,张善文,田子建.基于监督局部保持映射算法的井下人员定位技术[J].煤炭科学技术,2013,41(2):67-70. 被引量：5
8杨阳,郭继昌.基于光流空间分布的步态识别方法[J].计算机应用研究,2013,30(7):2206-2209. 被引量：2
9张善文,张传雷,黄文准.基于最大最小判别映射的煤矿井下人员身份鉴别方法[J].煤炭学报,2013,38(10):1894-1899. 被引量：6
10贲晛烨,张鹏,潘婷婷,王科俊.线性插值框架下矩阵步态识别的性能分析[J].智能系统学报,2013,8(5):415-425. 被引量：3

同被引文献38

1梅宏.虹膜识别产业发展现状及存在问题[J].金卡工程,2005,9(8):76-77. 被引量：1
2贲晛烨,徐森,王科俊.行人步态的特征表达及识别综述[J].模式识别与人工智能,2012,25(1):71-81. 被引量：73
3陈宝远,霍智超,陈光毅,孙忠祥.一种改进的三帧差分运动目标检测算法[J].应用科技,2016,43(2):10-13. 被引量：12
4程爱灵,黄昶,李小雨.运动目标检测算法研究综述[J].信息通信,2017,30(1):12-14. 被引量：7
5王雅丽,马静,李海青,张曼,孙哲南.基于虹膜纹理深度特征和Fisher向量的人种分类[J].中国图象图形学报,2018,23(1):28-38. 被引量：3
6何逸炜,张军平.步态识别的深度学习:综述[J].模式识别与人工智能,2018,31(5):442-452. 被引量：25
7李玲玲.人脸识别和步态识别技术融合的必要性[J].电脑知识与技术,2018,14(10):187-188. 被引量：4
8吴止锾,高永明,李磊,薛俊诗.类别非均衡遥感图像语义分割的全卷积网络方法[J].光学学报,2019,39(4):393-404. 被引量：21
9王科俊,丁欣楠,邢向磊,刘美辰.多视角步态识别综述[J].自动化学报,2019,45(5):841-852. 被引量：22
10殷赫.步态识别：人工智能“慧眼”[J].上海信息化,2019,0(11):61-63. 被引量：3

引证文献7

1黄炯,钟慧婷,许梓毓,黄智浩,吴晓扬,王济榕.一种基于步态识别技术的新型闸机设计优化研究[J].科技创业月刊,2023,36(S01):31-37.
2郭学才,谢奋.基于特征融合的变电站火灾视频识别算法[J].信息与电脑,2022,34(15):64-66.
3李涛,高志刚,管晟媛,徐久成,马媛媛.结合全局注意力机制的实时语义分割网络[J].智能系统学报,2023,18(2):282-292. 被引量：2
4许文正,黄天欢,贲晛烨,曾翌,张军平.跨视角步态识别综述[J].中国图象图形学报,2023,28(5):1265-1286. 被引量：3
5陈英,吴文强,徐亮,郭书斌.融合虹膜-眼周特征的非协作身份认证[J].中国图象图形学报,2023,28(5):1462-1476.
6侯赛辉,付杨,李奥奇,刘旭,曹春水,黄永祯.多部位特征增强的步态识别算法[J].中国图象图形学报,2023,28(5):1477-1486.
7马玉祥,代雪晶.基于视频残差神经网络的深度步态识别[J].计算机系统应用,2024,33(4):279-287.

二级引证文献5

1阳强,罗坚,黄宇琛.遮挡条件下的步态图像时空修复网络及其应用[J].中国图象图形学报,2024,29(1):179-191.
2彭鸿滨,张静,许梦珍.基于双注意力机制与特征融合网络的动态手势识别算法研究[J].信息化研究,2023,49(6):29-35.
3张家波,高洁,黄钟玉,徐光辉.基于多尺度分区有向时空图的步态情绪识别[J].电子与信息学报,2024,46(3):1069-1078.
4鲁斌,刘亚伟,张宇航,杨振宇.基于密度感知和自注意力机制的点云分割算法[J].激光与光电子学进展,2024,61(8):115-125.
5周玉,赵小锋,汪一,孙彦景,李松.关键细粒度信息指导的多尺度遮挡行人重识别[J].电子与信息学报,2024,46(6):2578-2586.

1张红颖,田鹏华.结合残差网络与多级分块结构的步态识别方法[J].电子测量与仪器学报,2022,36(6):66-72. 被引量：3
2马皖宜,张德平.基于多谱注意力高分辨率网络的人体姿态估计[J].计算机辅助设计与图形学学报,2022,34(8):1283-1292. 被引量：4
3黄超,李国瀚,张富平,徐正蓺.一种基于行人步态数据集的步长估计算法[J].工业控制计算机,2022,35(8):99-101.
4彭红星,徐慧明,刘华鼐.融合双分支特征和注意力机制的葡萄病虫害识别模型[J].农业工程学报,2022,38(10):156-165. 被引量：9
5S. M. H. Sithi Shameem Fathima,R. S. D. Wahida Banu,S. Mohamed Mansoor Roomi.Gait Based Human Recognition with Various Classifiers Using Exhaustive Angle Calculations in Model Free Approach[J].Circuits and Systems,2016,7(8):1465-1475.
6胡海,周平,王恒,徐圆圆.基于二维医学影像推算三维人体姿态[J].软件工程与应用,2022,11(4):842-853.
7张红颖,包雯静.融合自注意力机制的生成对抗网络跨视角步态识别[J].中国图象图形学报,2022,27(4):1097-1109. 被引量：4
8鲍文霞,茅丽丽,王年,唐俊,杨先军,张艳.非局部注意力双分支网络的跨模态赤足足迹检索[J].中国图象图形学报,2022,27(7):2199-2213. 被引量：1
9Jing Xiao,Huan Yang,Kun Xie,Jia Zhu,Ji Zhang.Learning discriminative representation with global and fine-grained features for cross-view gait recognition[J].CAAI Transactions on Intelligence Technology,2022,7(2):187-199.
10王建山,刘春阳,吴帆.混沌脉宽调制对艇用推进电机振动的影响[J].船电技术,2022,42(6):66-71.

中国图象图形学报

2022年第7期

浏览历史

内容加载中请稍等...

双分支特征融合网络的步态识别算法被引量：7

参考文献1

二级参考文献89

共引文献72

同被引文献38

引证文献7

二级引证文献5

相关作者

相关机构

相关主题

浏览历史

双分支特征融合网络的步态识别算法 被引量：7

参考文献1

二级参考文献89

共引文献72

同被引文献38

引证文献7

二级引证文献5

相关作者

相关机构

相关主题

浏览历史

双分支特征融合网络的步态识别算法被引量：7