期刊文献+

基于时空语义信息的视频运动目标交互行为识别方法 被引量:6

Moving-Objects Interaction Recognition Based on the Spatial-Temporal Semantic Information
原文传递
导出
摘要 提出一种融合时间及目标之间空间语义信息的视频运动目标交互行为识别方法,即基于目标之间空间语义的变化规律识别其交互行为类别。不同于传统的语义事件建模方法,首先根据运动目标跟踪结果,基于其运动方向以及建立目标之间的空间关系(拓扑关系和方向关系)模型,提出一种提取人目标之间空间语义(前面、后面、背对、面对以及左右)的方法;然后基于空间语义的变化规律建立随机文法规则;最后采用随机文法器识别九种常见的两人交互行为。该方法无需训练样本,实验结果验证了方法的有效性及优越性。 A method for recognizing human-human interaction is proposed based on the spatial-temporal semantic information.Different from traditional methods to model the interactions,this framework achieves recognizing activities based on the transformation of the spatial semantic meaning.First,with detection and tracking results,the spatial semantic meaning(front,back,face to face,back to back,and left or right) between the persons are extracted based on motion directions and spatial relationships(including topological and directional relations).Then,stochastic context-free grammar is used to recognize interactions that the rules are learned based on the transformation of spatial semantics.Extensive experiments have been executed to validate the effectiveness of the proposed approach,and the method can recognize the interactions without additional training.
出处 《光学学报》 EI CAS CSCD 北大核心 2012年第5期145-151,共7页 Acta Optica Sinica
基金 国家973计划(2010CB327900) 国家自然科学基金(61001176)资助课题
关键词 机器视觉 交互行为识别 空间关系 空间语义 随机文法 machine vision interaction recognition spatial relationship spatial semantic meaning stochastic context-free grammar
  • 相关文献

参考文献24

  • 1G. Lavee, E. Rivlin, M. Rudzsky. Understanding video events:a survey of methods for automatic interpretation of semantic occurrences in video [J]. IEEE Trans. Systems, Man, and Cybernetics, Part C : Applications and Reviews, 2009, 39 (5) : 4894504.
  • 2M. S. Ryoo, J. K. Aggarwal. Human activity analysis: a review [J]. ACM Comhuter Surveys, 2011, 43(3): 16.
  • 3黎洪松,李达.人体运动分析研究的若干新进展[J].模式识别与人工智能,2009,22(1):70-78. 被引量:38
  • 4J. W. Davis, A. F. Bobick. The representation and recognition of human movement using temporal templates [ C]. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR'97), Washington, DC, USA, 1997: 928.
  • 5M. S. Ryoo, J. K. Aggarwal. Spatio-temporal relationship Match: video structure comparison for recognition of complex human activities[C]. In Proceedings of the IEEE International Conference on Computer Vision (ICCV'09), Kyoto, Japan, Oct 2009.
  • 6S. Hongeng, R. Nevatia, F. Bremond. Video-based event recognition: activity representation and probabilistic recognition methods [ J]. Computer Vision and Image Understanding (CVIU), 2004, 96(2) : 129-162.
  • 7N. M. Oliver, B. Rosario, A. P. Pentland. A Bayesian computer vision system for modeling human interactions [J]. IEEE Trans. Pattern Analysis and Machine Intelligence, 2000,22(8): 831-843.
  • 8S. Park, J. K. Aggarwal. A hierarchical Bayesian network for event recognition of human actions and interactions [ J ]. Multimedia System, 2004, 10(2) : 164-179.
  • 9P. Natarajan, R. Nevatia. Coupled hidden semi Markov models for activity recognition [C]. In IEEE Workshop on Motion and Video Computing (WMVC'07), Austin, TX, USA, 2007.
  • 10A. Galata, A. G. Cohn, D. Mageeet a/. Modeling interaction using learned qualitative spatio-temporal relations and variable length Markov models [C]. Proceeding of the 15th European Conference on Artificial Intelligence (ECAI'02), 2002: 741-745.

二级参考文献169

  • 1黄士科,陶琳,张天序.一种改进的基于光流的运动目标检测方法[J].华中科技大学学报(自然科学版),2005,33(5):39-41. 被引量:17
  • 2刘贵喜,邵明礼,刘先红,朱东波.真实场景下视频运动目标自动提取方法[J].光学学报,2006,26(8):1150-1155. 被引量:32
  • 3杜友田,陈峰,徐文立,李永彬.基于视觉的人的运动识别综述[J].电子学报,2007,35(1):84-90. 被引量:79
  • 4Appleton B, Talbot H. Globally Minimal Surfaces by Continuous Maximal Flows. IEEE Trans on Pattern Analysis and Machine Intelligence, 2006, 28(1) : 106 -118
  • 5Boykov Y, Jolly M P. Interactive Graph Cuts for Optimal Boundary & Region Segmentation of Objects in n-d Images//Proc of the 8th International Conference on Computer Vision. Vancouver, Canada, 2001, I : 105-112
  • 6Criminisi A, Cross G, Blake A, et al. Bilayer Segmentation of Live Video // Proc of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition. New York, USA, 2006: 53 -60
  • 7Kim M, Choi J G, Kim D. A VOP Generation Tool: Automatic Segmentation of Moving Objects in Image Sequences Based on Spario-Temporal Information. IEEE Trans on Circuits and Systems for Video Technology, 1998, 9(8): 1216-1226
  • 8Collins R T, Lipton A J, Kanade T, et al. A System for Video Surveillance and Monitoring: VSAM Report. Technical Report, CMURI-TR-00-12, Pittsburg, USA: Carnegie Mellon University. Robotics Institute, 2000
  • 9Migliore D A, Matteucci M, Naccari M. A Revaluation of Frame Difference in Fast and Robust Motion Detection// Proc of the 4th ACM International Workshop on Video Surveillance and Sensor Networks. Santa Barbara, USA, 2006:215 -218
  • 10Barton J L, Fleet D J, Beauchemin S S, et al. Performance of Optical Flow Techniques. International Journal of Computer Vision, 1994, 12(1) : 42 -77

共引文献112

同被引文献94

  • 1宋枫溪,杨静宇,刘树海,张大鹏.基于多类最大散度差的人脸表示方法[J].自动化学报,2006,32(3):378-385. 被引量:17
  • 2王向军,王研,李智.基于特征角点的目标跟踪和快速识别算法研究[J].光学学报,2007,27(2):360-364. 被引量:48
  • 3Guofeng Zou, Wang Kejun, Yuan Lei, et al.. New research advances in facial expression recognition [C]. the IEEE 25th Chinese Control and Decision Conference ( CCDC), Guiyang, 2013: 3403-3409.
  • 4H Abdi, L J Williams. Principal component analysis [J]. Wiley Interdisciplinary Reviews: Computational Statistics, 2010, 2(4): 433-459.
  • 5J B Tenenbaum, V De Silva, J C Langford. A global geometric framework for nonlinear dimensionality reduction [J]. Science, 2000, 290(5500): 2319-2323.
  • 6M Belkin, P Niyogi. Laplaeian eigenmaps and spectral techniques for embedding and clustering [C]. NIPS, 2001, 14: 585-591.
  • 7X He, D Cai, S Yah, el al.. Neighborhood preserving embedding [C]. Tenth IEEE International Conference on Computer Vision, 2005, 2: 1208-1213.
  • 8X He, S Yan, Y Hu, et al.. Face recognition using laplacianfaces [J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2005, 27(3) : 328-340.
  • 9Y Chang, C Hu, M Turk. Manifold of facial expression [C]. AMFG, 2003: 28-35.
  • 10C C Chang, C J Lin. LIBSVM:a library for support vector machines [J]. ACM Transactions on Intelligent Systems and Technology (TIST), 2011, 2(3): 1-30.

引证文献6

二级引证文献66

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部