基于时空语义信息的视频运动目标交互行为识别方法被引量：6

Moving-Objects Interaction Recognition Based on the Spatial-Temporal Semantic Information

导出

摘要提出一种融合时间及目标之间空间语义信息的视频运动目标交互行为识别方法,即基于目标之间空间语义的变化规律识别其交互行为类别。不同于传统的语义事件建模方法,首先根据运动目标跟踪结果,基于其运动方向以及建立目标之间的空间关系(拓扑关系和方向关系)模型,提出一种提取人目标之间空间语义(前面、后面、背对、面对以及左右)的方法;然后基于空间语义的变化规律建立随机文法规则;最后采用随机文法器识别九种常见的两人交互行为。该方法无需训练样本,实验结果验证了方法的有效性及优越性。 A method for recognizing human-human interaction is proposed based on the spatial-temporal semantic information.Different from traditional methods to model the interactions,this framework achieves recognizing activities based on the transformation of the spatial semantic meaning.First,with detection and tracking results,the spatial semantic meaning（front,back,face to face,back to back,and left or right） between the persons are extracted based on motion directions and spatial relationships（including topological and directional relations）.Then,stochastic context-free grammar is used to recognize interactions that the rules are learned based on the transformation of spatial semantics.Extensive experiments have been executed to validate the effectiveness of the proposed approach,and the method can recognize the interactions without additional training.

作者金标胡文龙王宏琦

机构地区中国科学院电子学研究所中国科学院空间信息处理与应用系统技术重点实验室中国科学院研究生院

出处《光学学报》 EI CAS CSCD 北大核心 2012年第5期145-151,共7页 Acta Optica Sinica

基金国家973计划(2010CB327900) 国家自然科学基金(61001176)资助课题

关键词机器视觉交互行为识别空间关系空间语义随机文法 machine vision interaction recognition spatial relationship spatial semantic meaning stochastic context-free grammar

分类号 TP391 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献24

1G. Lavee, E. Rivlin, M. Rudzsky. Understanding video events:a survey of methods for automatic interpretation of semantic occurrences in video [J]. IEEE Trans. Systems, Man, and Cybernetics, Part C : Applications and Reviews, 2009, 39 (5) : 4894504.
2M. S. Ryoo, J. K. Aggarwal. Human activity analysis: a review [J]. ACM Comhuter Surveys, 2011, 43(3): 16.
3黎洪松,李达.人体运动分析研究的若干新进展[J].模式识别与人工智能,2009,22(1):70-78. 被引量：38
4J. W. Davis, A. F. Bobick. The representation and recognition of human movement using temporal templates [ C]. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR'97), Washington, DC, USA, 1997: 928.
5M. S. Ryoo, J. K. Aggarwal. Spatio-temporal relationship Match: video structure comparison for recognition of complex human activities[C]. In Proceedings of the IEEE International Conference on Computer Vision (ICCV'09), Kyoto, Japan, Oct 2009.
6S. Hongeng, R. Nevatia, F. Bremond. Video-based event recognition: activity representation and probabilistic recognition methods [ J]. Computer Vision and Image Understanding (CVIU), 2004, 96(2) : 129-162.
7N. M. Oliver, B. Rosario, A. P. Pentland. A Bayesian computer vision system for modeling human interactions [J]. IEEE Trans. Pattern Analysis and Machine Intelligence, 2000,22(8): 831-843.
8S. Park, J. K. Aggarwal. A hierarchical Bayesian network for event recognition of human actions and interactions [ J ]. Multimedia System, 2004, 10(2) : 164-179.
9P. Natarajan, R. Nevatia. Coupled hidden semi Markov models for activity recognition [C]. In IEEE Workshop on Motion and Video Computing (WMVC'07), Austin, TX, USA, 2007.
10A. Galata, A. G. Cohn, D. Mageeet a/. Modeling interaction using learned qualitative spatio-temporal relations and variable length Markov models [C]. Proceeding of the 15th European Conference on Artificial Intelligence (ECAI'02), 2002: 741-745.

二级参考文献169

1黄士科,陶琳,张天序.一种改进的基于光流的运动目标检测方法[J].华中科技大学学报（自然科学版）,2005,33(5):39-41. 被引量：17
2刘贵喜,邵明礼,刘先红,朱东波.真实场景下视频运动目标自动提取方法[J].光学学报,2006,26(8):1150-1155. 被引量：32
3杜友田,陈峰,徐文立,李永彬.基于视觉的人的运动识别综述[J].电子学报,2007,35(1):84-90. 被引量：79
4Appleton B, Talbot H. Globally Minimal Surfaces by Continuous Maximal Flows. IEEE Trans on Pattern Analysis and Machine Intelligence, 2006, 28(1) : 106 -118
5Boykov Y, Jolly M P. Interactive Graph Cuts for Optimal Boundary & Region Segmentation of Objects in n-d Images//Proc of the 8th International Conference on Computer Vision. Vancouver, Canada, 2001, I : 105-112
6Criminisi A, Cross G, Blake A, et al. Bilayer Segmentation of Live Video // Proc of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition. New York, USA, 2006: 53 -60
7Kim M, Choi J G, Kim D. A VOP Generation Tool: Automatic Segmentation of Moving Objects in Image Sequences Based on Spario-Temporal Information. IEEE Trans on Circuits and Systems for Video Technology, 1998, 9(8): 1216-1226
8Collins R T, Lipton A J, Kanade T, et al. A System for Video Surveillance and Monitoring: VSAM Report. Technical Report, CMURI-TR-00-12, Pittsburg, USA: Carnegie Mellon University. Robotics Institute, 2000
9Migliore D A, Matteucci M, Naccari M. A Revaluation of Frame Difference in Fast and Robust Motion Detection// Proc of the 4th ACM International Workshop on Video Surveillance and Sensor Networks. Santa Barbara, USA, 2006:215 -218
10Barton J L, Fleet D J, Beauchemin S S, et al. Performance of Optical Flow Techniques. International Journal of Computer Vision, 1994, 12(1) : 42 -77

共引文献112

1明瑞玲,李峰.一种面向互动投影的多目标跟踪方法[J].无线通信技术,2012,21(2):48-53.
2阮涛涛,姚明海,瞿心昱,楼中望.基于视觉的人体运动分析综述[J].计算机系统应用,2011,20(2):245-254. 被引量：26
3李刚,闫宗群,何永强,陆旭光.全向凝视红外多目标处理系统[J].激光与光电子学进展,2011,48(6):99-104. 被引量：3
4尹建芹,田国会,姜海涛,周风余.面向家庭服务的人体动作识别[J].四川大学学报（工程科学版）,2011,43(4):101-107. 被引量：7
5徐从富,郝春亮,苏保君,楼俊杰.马尔可夫逻辑网络研究[J].软件学报,2011,22(8):1699-1713. 被引量：8
6金标,胡文龙,王宏琦.基于多级跟踪队列的运动目标跟踪遮挡处理[J].光学学报,2011,31(8):211-218. 被引量：5
7吴联世,夏利民,罗大庸.人的交互行为识别与理解研究综述[J].计算机应用与软件,2011,28(11):60-63. 被引量：9
8孙锦红,刘卫东,马亮,杨伟蕾.基于机器视觉的3D人体动作识别研究[J].计算机与现代化,2011(11):86-89. 被引量：4
9汤泽胜,王兆仲.单帧图像人体姿态估计综述[J].计算机工程与科学,2011,33(11):89-97. 被引量：7
10Cai Limei Qian Jiansheng.A method for detecting miners based on helmets detection in underground coal mine videos[J].Mining Science and Technology,2011,21(4):553-556.

同被引文献94

1宋枫溪,杨静宇,刘树海,张大鹏.基于多类最大散度差的人脸表示方法[J].自动化学报,2006,32(3):378-385. 被引量：17
2王向军,王研,李智.基于特征角点的目标跟踪和快速识别算法研究[J].光学学报,2007,27(2):360-364. 被引量：48
3Guofeng Zou, Wang Kejun, Yuan Lei, et al.. New research advances in facial expression recognition [C]. the IEEE 25th Chinese Control and Decision Conference ( CCDC), Guiyang, 2013: 3403-3409.
4H Abdi, L J Williams. Principal component analysis [J]. Wiley Interdisciplinary Reviews: Computational Statistics, 2010, 2(4): 433-459.
5J B Tenenbaum, V De Silva, J C Langford. A global geometric framework for nonlinear dimensionality reduction [J]. Science, 2000, 290(5500): 2319-2323.
6M Belkin, P Niyogi. Laplaeian eigenmaps and spectral techniques for embedding and clustering [C]. NIPS, 2001, 14: 585-591.
7X He, D Cai, S Yah, el al.. Neighborhood preserving embedding [C]. Tenth IEEE International Conference on Computer Vision, 2005, 2: 1208-1213.
8X He, S Yan, Y Hu, et al.. Face recognition using laplacianfaces [J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2005, 27(3) : 328-340.
9Y Chang, C Hu, M Turk. Manifold of facial expression [C]. AMFG, 2003: 28-35.
10C C Chang, C J Lin. LIBSVM:a library for support vector machines [J]. ACM Transactions on Intelligent Systems and Technology (TIST), 2011, 2(3): 1-30.

引证文献6

1柯佳,詹永照,陈潇君,汪满容.基于格框架的视频事件时空相关描述分析方法[J].系统仿真学报,2015,27(4):770-778.
2李雅倩,李颖杰,李海滨,张强,张文明.融合全局与局部多样性特征的人脸表情识别[J].光学学报,2014,34(5):164-170. 被引量：29
3蔡加欣,冯国灿,汤鑫,罗志宏.基于局部轮廓和随机森林的人体行为识别[J].光学学报,2014,34(10):204-213. 被引量：28
4蔡加欣,冯国灿,汤鑫,罗志宏.基于姿势字典学习的人体行为识别[J].光学学报,2014,34(12):173-184. 被引量：9
5张旭光,刘春霞,左佳倩.基于因果网络分析的小规模人群行为识别[J].光学学报,2015,35(8):177-181. 被引量：5
6谭程午,夏利民,王嘉.基于融合特征的群体行为识别[J].计算机技术与发展,2018,28(1):17-22.

二级引证文献66

1尚雪莲,秦健勇.MEF融合HFF的戏剧视频关键情节自动提取[J].电视技术,2015,39(8):50-54.
2周琳,杨娜.基于自适应二叉树算法的图像划痕检测研究[J].激光与光电子学进展,2015,52(5):65-70. 被引量：4
3朱二莉,彭波,刘志中.基于自适应鲁棒在线度量学习的面部表情识别[J].电视技术,2015,39(11):77-82.
4谢昭,童昊浩,孙永宣,吴克伟.一种仿生物视觉感知的视频轮廓检测方法[J].自动化学报,2015,41(10):1814-1824. 被引量：5
5胡敏,程轶红,王晓华,任福继,许良凤,黄晓音.基于非对称局部梯度编码的人脸表情识别[J].中国图象图形学报,2015,20(10):1313-1321. 被引量：5
6褚龙现,刘建芳,马丽.面部表情识别中基于TTL的特定个体学习模型[J].电视技术,2015,39(21):99-103.
7蒋加伏,赵怡.局部证据RBF人体行为高层特征自相似融合识别研究[J].计算技术与自动化,2015,34(4):95-100.
8罗元,张天,张毅.一种改进的LDP面部表情特征提取方法[J].半导体光电,2016,37(1):122-125. 被引量：5
9王晓华,黄伟,金超,胡敏,任福继.多特征多分类器优化匹配的人脸表情识别[J].光电工程,2016,43(3):73-79. 被引量：5
10刘娟,胡敏,黄忠.基于最优支持度的证据融合表情识别方法[J].电子测量与仪器学报,2016,30(5):714-721. 被引量：8

1唐四薪,周勇,邹赛.基于词汇化随机文法模型的RNA二级结构预测[J].计算机工程与科学,2009,31(3):128-131. 被引量：4
2陈昌红,张杰,刘峰.双人交互行为的稀疏表征方法[J].模式识别与人工智能,2016,29(5):464-471. 被引量：3
3周思超,夏利民.基于稠密轨迹聚类的人体交互行为识别[J].采矿技术,2016,16(4):77-83.
4陈平.基于随机文法的骨架结构化表示[J].西昌学院学报（自然科学版）,2008,22(4):47-49.
5唐四薪,周勇,易胤.随机文法模型在RNA二级结构预测中的应用[J].生物数学学报,2008,23(4):735-742. 被引量：2
6王方石.L-系统在植物模拟中的应用[J].北方交通大学学报,1998,22(3):45-48. 被引量：14
7宋昭,李芬.基于专家系统的公式识别器的实现[J].计算机工程,2005,31(13):38-39. 被引量：1
8唐四薪,赵辉煌,周勇.RNA二级结构预测:基于半监督学习的随机文法模型方法[J].计算机与应用化学,2013,30(9):1038-1042. 被引量：1
9甘玲,谷伟庆.组合金字塔和多核学习的图像分类方法[J].小型微型计算机系统,2014,35(7):1642-1646. 被引量：2
10吴联世,夏利民,罗大庸.人的交互行为识别与理解研究综述[J].计算机应用与软件,2011,28(11):60-63. 被引量：9

光学学报

2012年第5期

浏览历史

内容加载中请稍等...

基于时空语义信息的视频运动目标交互行为识别方法被引量：6

参考文献24

二级参考文献169

共引文献112

同被引文献94

引证文献6

二级引证文献66

相关作者

相关机构

相关主题

浏览历史

基于时空语义信息的视频运动目标交互行为识别方法 被引量：6

参考文献24

二级参考文献169

共引文献112

同被引文献94

引证文献6

二级引证文献66

相关作者

相关机构

相关主题

浏览历史

基于时空语义信息的视频运动目标交互行为识别方法被引量：6