期刊文献+

融合HCRF和AAM的足球视频精彩事件检测 被引量:3

Fusion of HCRF and AAM Highlight Events Detection in Soccer Videos
下载PDF
导出
摘要 精彩事件检测在体育视频语义分析领域具有很高的学术研究价值和广泛的市场应用前景.利用隐条件随机场(hidden conditional random field,HCRF)模型在表达和识别语义事件方面的强大功能,创新性地提出了一种融合了HCRF和情感激励模型(affective arousal model,AAM)的精彩事件检测方法.首先,通过精彩事件视频结构语义分析,定义了13种多模态语义线索,以准确描述精彩事件富含的语义信息;其次,在基于概念格的多模态语义线索聚类基础上,添加时域特征信息,以构建特征值加权的情感激励模型,得到了各类精彩事件的情感激励值;最后,在小规模训练样本情况下,有效建立了各类精彩事件检测的HCRF模型,基于视频语义镜头序列、情感激励值序列和精彩事件之间的映射关系,从多模态语义线索、视频结构语义、情感语义等多个维度挖掘了精彩事件的潜在规律,实现了同一HCRF模型下各类精彩事件的同时检测.实验证明了该方法的有效性. Highlight event detection in soccer videos has high academic research value and wide market application prospect in the field of sport video semantic analysis. Based on the powerful expression o{ hidden conditional random field (HCRF) model in the expression and identification of semantic event, a fusion HCRF and affective arousal model (AAM) framework for highlight event detection is put forward. Firstly, through the analysis of the structural semantics of the wonderful event video, thirteen kinds of multi-modal semantic clues are defined to accurately describe the included semantic information of the wonderful events. Secondly, on the clustering foundation of the multi-modal semantic clues by concept lattice, time-domain features are added to establish an affective arousal model based on feature weight coefficient, and then the affective arousal value of the different kinds of highlights events is calculated. Finally, the above observed sequence is used as HCRF model input in the case of small-scale training samples, and a wonderful event detection HCRF model is effectively established based on the mapping relationship between the sequences of video semantic shots, affective arousal values and the highlight events. The inherent laws of the wonderful events are excavated from multiple dimensions like multi-modal semantic clues, video structure semantics, and affective semantics. The detection of wonderful events is simultaneously achieved by using the same HCRF model. Experimental results show the effectiveness of this paper.
出处 《计算机研究与发展》 EI CSCD 北大核心 2014年第1期225-236,共12页 Journal of Computer Research and Development
基金 国家自然科学基金项目(61072110) 陕西省自然科学基金项目(SJ08F15)
关键词 视频语义分析 事件检测 隐条件随机场 情感语义 语义标注 概念格 video semantic analysis event detection hidden conditional random field affectivesemantic semantic annotation concept lattice
  • 相关文献

参考文献22

  • 1Lavee G, Rivlin E, Rudzsky M. Understanding video events a survey of methods for automatic interpretation of semantic occurrence in video[J].IEEE Trans on Systems, Man, and Cybernetics, Part C: Applications and Reviews, 2009, 39 (5) : 489-504.
  • 2Shih H C, Huang Chunglin. MSN: Statistical understanding of broadcasted baseball video using multi-level semantic network [J]. IEEE Trans on Broadcasting, 2005, 51 (4): 449-459.
  • 3Hung M H, Hsieh C H. Event detection of broadcast baseball videos [J]. IEEE Trans on Circuits and Systems for Video Technology, 2008, 18(12): 1713-1726.
  • 4Wang Fei, Ma Yufei, Zhang Hongjiang, et al. A generic framework for semantic sports video analysis using dynamic Bayesian networks[C] //Proc of the llth Int Multimedia Modeling Conf. Piscataway, NJ: IEEE, 2005:115-122.
  • 5Huang Chunglin, Shih H C, Chao Chungyuan. Semantic analysis of soccer video using dynamic Bayesian network [J]. IEEE Trans on Multimedia, 2006, 8(4), 749-760.
  • 6Cheng C C, Hsu C T. Fusion of audio and motion information on HMM-based highlight extraction for baseball games [J]. IEEE Trans on Multimedia, 2006, 8(3): 585- 599.
  • 7Ding Yi, Fan Guoliang. Sports video mining via multichannel segmental hidden Markov models [J]. IEEE Trans on Multimedia, 2009, 11(7): 1301-1309.
  • 8张玉珍,丁思捷,王建宇,戴跃伟,陈钱.基于HMM的融合多模态的事件检测[J].系统仿真学报,2012,24(8):1638-1642. 被引量:4
  • 9Sadlier D A, O'Connor N E. Event detection in field sports video using audio-visual features and a support vector machine [J]. IEEE Trans on Circuits and Systems for Video Technology, 2005, 15(10): 1225-1233.
  • 10Xu Changsheng, Zhang Yifan, Zhu Guangyu, et al. Using webeast text :or semantic event detection in broadcast sports video [J]. IEEE Trans on Multimedia, 2008, 10(7) : 1342- 1355.

二级参考文献25

  • 1彭培华,曲波,陈荣胜.基于支持向量机的小波域视频字幕检测与提取[J].华南理工大学学报(自然科学版),2004,32(z1):63-66. 被引量:4
  • 2刘宇驰,吴玲达.基于HMM的足球视频语义结构分析[J].计算机工程与应用,2006,42(28):174-176. 被引量:1
  • 3刘宇驰,栾悉道,戴端辉,吴玲达.多模态体育视频语义分析[J].计算机科学,2007,34(1):109-111. 被引量:6
  • 4金国英,陶霖密,徐光,张翔.基于HHMM的多线索融合和事件推理方法[J].清华大学学报(自然科学版),2007,47(1):112-115. 被引量:4
  • 5Hanjalic A.Adaptive extraction of highlights from a sport video based on excitement modeling[J].IEEE Trans on Multimedia,2005,7(6):1114-1122.
  • 6Hanjalic A,Xu Liqun.Affective video content representation and modeling[J].IEEE Trans on Multimedia,2005,7(1):143-154.
  • 7Dietz R B,Lang A.Affective agents:Effects of agent affection on arousal,attention,liking and learning[C]//Proc of the 3rd Int Cognitive Technology Conf.Piscataway,NJ:IEEE,1999:151-156.
  • 8J Y Chen, Y H Li, L D Wu, S Y Lao. Semantic event detection in soccer video by integrating multi-features using Bayesian network [C]// Proceedings of 2004 International Symposium on Intelligent Multimedia, Video and Speed Proceeding, 2004, Oct.
  • 9Y Yang, S X Lin, Y D Zhang, et al. Highlights extraction in soccer avideos based on goal_mouth detection [C]//IEEE Proc. ISSPA 2007. USA: IEEE, 2007: 1-4.
  • 10Hang Y, Fong S.An experimental comparison of decision trees in traditional data mining and data stream mining[C]// 6th Intemational Conference on Advanced Information Management and Service,2010.

共引文献11

同被引文献37

  • 1TRIPATHI V, MINU E. An Improved Algorithm (KPCA) For Face Recognition[J]. Digital Image Processing, 2012, 4 (1): 27-32.
  • 2VAN I M H, BAKERMANS-KRANENBURG M Jtrust: meta-analysis of the effects of intranasal oxytocin adminis- tration on face recognition, trust to in-group, and trust to out-group[J]. Psychoneuroendocrinology, 2012, 37(3): 438-443.
  • 3YANG A Y, ZHOU Z, BALASUBRAMANIAN A G, et al. Fast-minimization algorithms for robust face recognition[J]. IEEE Trans. Image Processing, 2013, 22(8): 3234-3246.
  • 4HANJALIC A, XU L Q. Affective video content representation and modeling[J]. IEEE Trans. Multimedia,2005, 7(1) : 143-154.
  • 5POTAPOV D, DOUZE M, HARCHAOUI Z, et al. Computer Vi- sion-ECCV [M]. [S.l.] : Springer International Publishing, 2014.
  • 6ACAR E. Learning representations for affective video understand- ing[C]// Proc. the 21st ACM International Conference on Multi- media. [S.l.]:ACM Press, 2013 : 1055-1058.
  • 7TIMMERS R, CROOK H. Affective priming in music listening: emotions as a source of musical expectation[J]. Music Percep- tion: An Interdisciplinary Journal, 2014, 31(5): 470-484.
  • 8LARTILLOT O, TOIVIAINEN P, EEROLa T. A matlab toolbox for music information retrieval[M]. Berlin:Springer, 2008.
  • 9HASAN H, ABDUL-KAREEM S. Fingerprint image enhance- ment and recognition algorithms: a survey[J]. Neural Computing and Applications, 2013, 23(6): 1605-1610.
  • 10CHAN C H, TAHIR M A, KITTLER J, et al. Muhiscale local phase quantization for robust component-based face recognition using kernel fusion of multiple descriptors[J]. IEEE Trans. Pat- tern Analysis and Machine Intelligence, 2013, 35 (5) : 1164-1177.

引证文献3

二级引证文献5

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部