用于动作识别的双流自适应注意力图卷积网络被引量：4

Two-Stream Adaptive Attention Graph Convolutional Networks for Action Recognition

下载PDF

导出

摘要人体动作识别因在公共安全方面具有重要的作用而在计算机视觉领域备受关注。然而,现有的图卷积网络在融合多尺度节点的邻域特征时,通常采用各阶邻接矩阵直接相加的方法,各项重要性一致,难以聚焦于重要特征,不利于最优节点关系的建立,同时采用对不同模型的预测结果求平均的双流融合方法,忽略了潜在数据的分布差异,融合效果欠佳。为此,文中提出了一种双流自适应注意力图卷积网络,用于对人体动作进行识别。首先,设计了能自适应平衡权重的多阶邻接矩阵,使模型聚焦于更加重要的邻域;然后,设计了多尺度的时空自注意力模块及通道注意力模块,以增强模型的特征提取能力;最后,提出了一种双流融合网络,利用双流预测结果的数据分布来决定融合系数,提高融合效果。该算法在NTU RGB+D的跨主体和跨视角两个子数据集上的识别准确率分别达92.3%和97.5%,在Kinetics-Skeleton数据集上的识别准确率达39.8%,均高于已有算法,表明了文中算法对于人体动作识别的优越性。 Human action recognition has received much attention in the field of computer vision because of its important role in public safety.However,when fusing the neighborhood features of multi-scale nodes,existing graph convolutional networks usually adopt a direct summation method,in which the same importance is attached to each feature,so it is difficult to focus on important features and is not conducive to the establishment of optimal nodal relationships.In addition,the two-stream fusion method,which averages the prediction results of different models,ignores the potential data distribution differences and the fusion effect is not good.To this end,this paper proposed a two-stream adaptive attention graph convolutional network for human action recognition.Firstly,a multi-order adjacency matrix that adaptively balances the weights was designed to focus the model on more important domains.Secondly,a multi-scale spatio-temporal self-attention module and a channel attention module were designed to enhance the feature extraction capability of the model.Finally,a two-stream fusion network was proposed to improve the fusion effect by using the data distribution of the two-stream prediction results to determine the fusion coefficients.On the two subdatasets of cross subject and cross view of NTU RGB+D,the recognition accuracy of the algorithm is 92.3%and 97.5%,respectively;while on the Kinetics-Skeleton dataset,it reaches 39.8%,both of which are higher than the existing algorithms,indicating the superiority of the algorithm in human motion recognition.

作者杜启亮向照夷田联房余陆斌 DU Qiliang;XIANG Zhaoyi;TIAN Lianfang;YU Lubin(School of Automation Science and Engineering,South China University of Technology,Guangzhou 510640,Guangdong,China;China-Singapore International Joint Research Institute,South China University of Technology,Guangzhou 510555,Guangdong,China;Key Laboratory of Autonomous Systems and Network Control of the Ministry of Education,South China University of Technology,Guangzhou 510640,Guangdong,China;Research Institute of Modern Industrial Innovation,South China University of Technology,Zhuhai 519170,Guangdong,China)

机构地区华南理工大学自动化科学与工程学院华南理工大学中新国际联合研究院华南理工大学自主系统与网络控制教育部重点实验室华南理工大学珠海现代产业创新研究院

出处《华南理工大学学报（自然科学版）》 EI CAS CSCD 北大核心 2022年第12期20-29,共10页 Journal of South China University of Technology(Natural Science Edition)

基金广东省海洋经济发展专项(GDNRC[2020]018) 广东省重点领域研发计划项目(2019B020214001,2018B010109001) 广州市产业技术重大攻关计划项目(2019-01-01-12-1006-0001) 华南理工大学中央高校基本科研业务费专项资金资助项目(2018KZ05) 华南理工大学研究生教育改革项目(zysk2018005)。

关键词动作识别图卷积网络邻接矩阵注意力双流融合 action recognition graph neural network adjacency matrix attention two-stream fusion

分类号 TP391.4 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献2

1朱煜,赵江坤,王逸宁,郑兵兵.基于深度学习的人体行为识别算法综述[J].自动化学报,2016,42(6):848-857. 被引量：132
2杜启亮,黄理广,田联房,黄迪臻,靳守杰,李淼.基于视频监控的手扶电梯乘客异常行为识别[J].华南理工大学学报（自然科学版）,2020,48(8):10-21. 被引量：18

二级参考文献55

1Fujiyoshi H, Lipton A J, Kanade T. Real-time human mo- tion analysis by image skeletonization. IEICE Transactions on Information and Systems, 2004, 87-D(1): 113-120.
2Chaudhry R, Ravichandran A, Hager G, Vidal R. His- tograms of oriented optical flow and Binet-Cauchy kernels on nonlinear dynamical systems for the recognition of hu- man actions. In: Proceedings of the 2009 IEEE Conference on Computer Vision and Pattern Recognition. Miami, FL: IEEE, 2009. 1932-1939.
3Dalal N, Triggs B. Histograms of oriented gradients for human detection. In: Proceedings of the 2005 IEEE Con- ference on Computer Vision and Pattern Recognition. San Diego, CA, USA: IEEE, 2005. 886-893.
4Lowe D G. Object recognition from local scale-invariant fea- tures. In: Proceedings of the 7th IEEE International Confer- ence on Computer Vision. Kerkyra: IEEE, 1999. 1150-1157.
5Schuldt C, Laptev I, Caputo B. Recognizing human actions: a local SVM approach. In: Proceedings of the 17th In- ternational Conference on Pattern Recognition. Cambridge: IEEE, 2004. 32-36.
6Dollar P, Rabaud V, Cottrell G, Belongie S. Behavior recog- nition via sparse spatio-temporal features. In: Proceedings of the 2005 IEEE International Workshop on Visual Surveil- lance and Performance Evaluation of Tracking and Surveil- lance. Beijing, China: IEEE, 2005.65-72.
7Rapantzikos K, Avrithis Y, Kollias S. Dense saliency-based spatiotemporal feature points for action recognition. In: Proceedings of the 2009 IEEE Conference on Computer Vision and Pattern Recognition. Miami, FL: IEEE, 2009. 1454-1461.
8Knopp J, Prasad M, Willems G, Timofte R, Van Gool L. Hough transform and 3D SURF for robust three dimensional classification. In: Proceedings of the llth European Confer- ence on Computer Vision (ECCV 2010). Berlin Heidelberg: Springer. 2010. 589-602.
9Klaser A, Marszaeek M, Schmid C. A spatio-temporal de- scriptor based on 3D-gradients. In: Proceedings of the 19th British Machine Vision Conference. Leeds: BMVA Press, 2008. 99.1-99.10.
10Wang H, Ullah M M, Klaser A, Laptev I, Schmid C. Evalua- tion of local spatio-temporal features for action recognition. In: Proceedings of the 2009 British Machine Vision Confer- ence. London, UK: BMVA Press, 2009. 124.1-124.11.

共引文献148

1建中华,南静,刘鑫,代伟.基于时空张量融合的人体骨架行为自适应识别方法[J].仪器仪表学报,2023,44(6):74-85. 被引量：1
2谈咏东,王永雄,陈姝意,缪银龙.(2+1)D多时空信息融合模型及在行为识别的应用[J].信息与控制,2019,48(6):715-722. 被引量：3
3李明.基于深度学习算法的电梯振动异常检测方法[J].西部特种设备,2023,6(2):44-48.
4童立靖,徐光亚,冯金芝.一种基于CNN与位姿自适应的运动模型生成方法[J].西安文理学院学报（自然科学版）,2024,27(2):1-7.
5贾双成,杨凤萍.基于神经网络的人体动态行为智能识别方法[J].科技通报,2020(1):60-63. 被引量：1
6柴晋,乔加飞,孙灏,梁占伟,张千.神经网络算法在脱硫系统优化中的应用进展[J].洁净煤技术,2021,27(S02):27-32. 被引量：3
7王明松,秦永佩,张鑫鑫.基于TensorFlow的动作行为识别原理与实践[J].电子技术（上海）,2021,50(4):112-113. 被引量：1
8吴松平,王天一.基于神经网络和迁移学习的视频人体行为识别[J].智能计算机与应用,2021,11(12):153-157. 被引量：4
9王鹏.氦氖激光照射耳穴治疗冠心病30例[J].中华理疗杂志,2000,23(2):119-120.
10杨观赐,杨静,苏志东,陈占杰.改进的YOLO特征提取算法及其在服务机器人隐私情境检测中的应用[J].自动化学报,2018,44(12):2238-2249. 被引量：22

同被引文献38

1孔维行,乐燕芬,施伟斌,李瑞祥,余家宝,孙凤.无线传感器网络人体姿态识别算法[J].数据通信,2013(3):16-19. 被引量：5
2徐哲,刘云峰,董景新.基于相关向量机的MEMS加速度计零偏温漂补偿[J].北京航空航天大学学报,2013,39(11):1558-1562. 被引量：7
3汪金辉,许雪梅,丁家峰,丁一鹏,尹林子.基于法布里-珀罗标准具和多光栅校准的光纤布喇格光栅波长解调系统[J].光子学报,2016,45(6):35-40. 被引量：4
4李超,王永杰,李芳.基于F-P温控标准具的高稳定性FBG波长解调系统(英文)[J].红外与激光工程,2017,46(1):238-242. 被引量：15
5吴军伟,缪玲娟,李福胜,沈军.改进支持向量机的光纤陀螺温度漂移补偿方法[J].红外与激光工程,2018,47(5):133-138. 被引量：11
6朱红蕾,朱昶胜,徐志刚.人体行为识别数据集研究进展[J].自动化学报,2018,44(6):978-1004. 被引量：35
7田联房,吴啟超,杜启亮,黄理广,李淼,张大明.基于人体骨架序列的手扶电梯乘客异常行为识别[J].华南理工大学学报（自然科学版）,2019,47(4):10-19. 被引量：20
8郭阿英,许志猛,陈良琴.一种基于WiFi信道状态信息的人体动作识别方法[J].传感技术学报,2019,32(11):1688-1693. 被引量：10
9丁重阳,刘凯,李光,闫林,陈博洋,钟育民.基于时空权重姿态运动特征的人体骨架行为识别研究[J].计算机学报,2020,43(1):29-40. 被引量：30
10杜启亮,黄理广,田联房,黄迪臻,靳守杰,李淼.基于视频监控的手扶电梯乘客异常行为识别[J].华南理工大学学报（自然科学版）,2020,48(8):10-21. 被引量：18

引证文献4

1盛文娟,胡俊,彭刚定.基于注意力机制和长短期记忆网络的F-P滤波器温漂误差修正[J].光学学报,2023,43(22):57-64.
2田晟,宋霖,赵凯龙.基于偏移注意力机制和多特征融合的点云分类[J].华南理工大学学报（自然科学版）,2024,52(1):100-109.
3樊旭斌,刘威.基于MPU9250和数字孪生的人体动作识别系统设计[J].数据通信,2023(6):19-23.
4朱红蕾,卫鹏娟,徐志刚.基于骨架的人体异常行为识别与检测研究进展[J].控制与决策,2024,39(8):2484-2501.

1程换新,孙胜意,骆晓玲,王雪.基于稠密连接时空双流网络的行为识别方法研究[J].电子测量技术,2022,45(18):134-138. 被引量：1
2刘娇,赵尚民.结合NASA DEM和AW3D30 DEM的太原市DEM数据融合[J].测绘通报,2022(11):90-95. 被引量：1
3蒋朝云,李亚,朱贵富,王海瑞.多元特征融合的涡扇发动机剩余寿命预测[J].化工自动化及仪表,2023,50(1):51-57.
4程龙.多传感器数据关联与状态跟踪算法分析[J].电子技术与软件工程,2022(21):217-221. 被引量：1
5吴琳,许茹玉,粟兴旺,黄金玻,王晓明.基于结构误差的图卷积网络[J].计算机应用研究,2023,40(1):155-159.
6慕时荣,周宇,叶治安,王绍曾,康少鑫,徐兆郢,王䶮飞.火电厂再生水深度处理系统诊断分析与应用[J].工业水处理,2023,43(1):168-174. 被引量：3
7肖薇薇,王晓霞,郝瑞.乡村振兴战略背景下陕西省城乡融合测度研究[J].江西农业学报,2022,34(10):214-219. 被引量：4
8Honghai Tang,Daqi Wang,Yilai Shu.Structural insights into Cas9 mismatch:promising for development of high-fidelity Cas9 variants[J].Signal Transduction and Targeted Therapy,2022,7(9):3147-3149.
9苏晓萍,查英华,曲鸿博.一种异质图的Lorentz嵌入模型[J].电子科技大学学报,2023,52(1):146-153.
10李占兵,李会泉,刘青青,张建波,黄形中,吴秀文,李少鹏.铝灰基聚合氯化铝处理选煤废水试验[J].洁净煤技术,2022,28(12):143-148. 被引量：1

华南理工大学学报（自然科学版）

2022年第12期

浏览历史

内容加载中请稍等...

用于动作识别的双流自适应注意力图卷积网络被引量：4

参考文献2

二级参考文献55

共引文献148

同被引文献38

引证文献4

相关作者

相关机构

相关主题

浏览历史

用于动作识别的双流自适应注意力图卷积网络 被引量：4

参考文献2

二级参考文献55

共引文献148

同被引文献38

引证文献4

相关作者

相关机构

相关主题

浏览历史

用于动作识别的双流自适应注意力图卷积网络被引量：4