期刊文献+

交互关系超图卷积模型的双人交互行为识别

Two-person interaction recognition based on the interactive relationship hypergraph convolution network model
下载PDF
导出
摘要 为提高学校、商场等公共场所的安全性,实现对监控视频中的偷窃、抢劫和打架斗殴等异常双人交互行为的自动识别,针对现有基于关节点数据的行为识别方法在图的创建中忽略了2个人之间的交互信息,且忽略了单人非自然连接关节点间的交互关系的问题,提出一种基于交互关系超图卷积模型用于双人交互行为的建模与识别。首先针对每一帧的关节点数据构建对应的单人超图以及双人交互关系图,其中超图同时使多个非自然连接节点信息互通,交互关系图强调节点间交互强度。将以上构建的图模型送入时空图卷积对空间和时间信息分别建模,最后通过SoftMax分类器得到识别结果。该算法框架的优势是在图的构建过程中加强考虑双人的交互关系、非自然连接点间结构关系以及四肢灵活的运动特征。在NTU数据集上的测试表明,该算法得到了97.36%的正确识别率,该网络模型提高了双人交互行为特征的表征能力,取得了比现有模型更好的识别效果。 To enhance the security of schools,shopping malls,and other public places,it is important to achieve automatic identification of abnormal two-person interaction behaviors,such as stealing,robbing,fighting,and assaulting,in surveillance videos.However,the current behavior recognition method based on joint data in graph creation neglects the two-person interaction information as well as the interaction relationship between the single unnatural connection joints.To address this issue,a two-person interaction behavior recognition model based on the interactive relationship hypergraph convolution network is proposed to model and identify human interactions.First,the corresponding single hypergraph and two-person interaction graph are created for the joint-point data of each frame,where the hypergraph makes the information of multiple unnaturally connected nodes interchangeable at the same time,and the interaction graph emphasizes the interaction strength between nodes.The above-constructed graph models are fed into the spatiotemporal graph convolution to model the spatial and temporal information separately,and lastly,the recognition results are acquired by the SoftMax classifier.The benefits of the proposed algorithm framework are that the interactive relationship between two persons,the structural relationship between unnatural connections,and the flexible motion characteristics of limbs are regarded in the graph construction process.Tests on the NTU data set demonstrate that the algorithm attains a correct recognition rate of 97.36%.The findings indicate that the network model enhances the ability to represent the characteristics of two-person interaction and has better recognition performance than the current models.
作者 代金利 曹江涛 姬晓飞 DAI Jinli;CAO Jiangtao;JI Xiaofei(School of Information and Control Engineering,Liaoning Petrochemical University,Fushun 113001,China;School of Automa-tion,Shenyang Aerospace University,Shenyang 110136,China)
出处 《智能系统学报》 CSCD 北大核心 2024年第2期316-324,共9页 CAAI Transactions on Intelligent Systems
基金 国家自然科学基金项目(61673199) 辽宁省科技公益研究基金项目(2016002006)。
关键词 双人交互 行为识别 关节点数据 深度学习 时空图卷积网络 超图 图结构 神经网络 two-person interaction behavior recognition skeleton node data deep learning ST-GCN hypergraph graph structure neural networks
  • 相关文献

参考文献4

二级参考文献33

  • 1杜友田,陈峰,徐文立,李永彬.基于视觉的人的运动识别综述[J].电子学报,2007,35(1):84-90. 被引量:79
  • 2Aggarwal J K, Cai Q. Human motion analysis: A review [J]. Computer Vision and Image Understanding, 1999,73 (3) :428 - 440.
  • 3Moeslund Thomas B, Hilton Adrian, Krllger Volker. A survey of advances in vision-based human motion capture and analysis [ J ]. Computer Vision and Image Understanding, 2006, 104(3) :90 -126.
  • 4Turaga P, Chellappa R, Subrahmanian V S,et al. Machine recognition of human activities:A survey [J]. IEEE Transactions on Circuits and Systems for Video Technology ,2008,18 (11) :1473 -1488.
  • 5Moore D J, Essa I A, Hayes M H. Exploiting human actions and object context for recognition tasks[ A ]. In IEEE International Conference on Computer Vision ( ICCV ) , 1999:80 - 86.
  • 6Peursum P, West G, Venkatesh S. Combining image regions and human activity for indirect object recognition in indoor wide-angle views [C]// IEEE International Conference on Computer Vision (ICCV) ,2005.
  • 7Gupta A, Davis L S. Objects in action : An approach for combining action understanding and object perception [ C ]//IEEE Conference on Computer Vision and Pattern Recognition (CVPR) ,2007.
  • 8Ryoo M S, Aggarwal J K. Hierarchical recognition of human activities interacting with objects[ C]//2nd International Workshop on Semantic Learning Applications in Multimedia (SLAM), Proceedings of CVPR, 2007.
  • 9Oliver N M, Rosario B, Pentland A P. A Bayesian computer vision system for modeling human interactions[ J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2000, 22 (8) : 831 - 843.
  • 10Ivanov Y A, Bobick A F. Recognition of visual activities and interactions by stochastic parsing[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence,2000,22 ( 8 ) : 852 - 872.

共引文献36

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部