融合HCRF和AAM的足球视频精彩事件检测被引量：3

Fusion of HCRF and AAM Highlight Events Detection in Soccer Videos

下载PDF

导出

摘要精彩事件检测在体育视频语义分析领域具有很高的学术研究价值和广泛的市场应用前景.利用隐条件随机场(hidden conditional random field,HCRF)模型在表达和识别语义事件方面的强大功能,创新性地提出了一种融合了HCRF和情感激励模型(affective arousal model,AAM)的精彩事件检测方法.首先,通过精彩事件视频结构语义分析,定义了13种多模态语义线索,以准确描述精彩事件富含的语义信息;其次,在基于概念格的多模态语义线索聚类基础上,添加时域特征信息,以构建特征值加权的情感激励模型,得到了各类精彩事件的情感激励值;最后,在小规模训练样本情况下,有效建立了各类精彩事件检测的HCRF模型,基于视频语义镜头序列、情感激励值序列和精彩事件之间的映射关系,从多模态语义线索、视频结构语义、情感语义等多个维度挖掘了精彩事件的潜在规律,实现了同一HCRF模型下各类精彩事件的同时检测.实验证明了该方法的有效性. Highlight event detection in soccer videos has high academic research value and wide market application prospect in the field of sport video semantic analysis. Based on the powerful expression o{ hidden conditional random field （HCRF） model in the expression and identification of semantic event, a fusion HCRF and affective arousal model （AAM） framework for highlight event detection is put forward. Firstly, through the analysis of the structural semantics of the wonderful event video, thirteen kinds of multi-modal semantic clues are defined to accurately describe the included semantic information of the wonderful events. Secondly, on the clustering foundation of the multi-modal semantic clues by concept lattice, time-domain features are added to establish an affective arousal model based on feature weight coefficient, and then the affective arousal value of the different kinds of highlights events is calculated. Finally, the above observed sequence is used as HCRF model input in the case of small-scale training samples, and a wonderful event detection HCRF model is effectively established based on the mapping relationship between the sequences of video semantic shots, affective arousal values and the highlight events. The inherent laws of the wonderful events are excavated from multiple dimensions like multi-modal semantic clues, video structure semantics, and affective semantics. The detection of wonderful events is simultaneously achieved by using the same HCRF model. Experimental results show the effectiveness of this paper.

作者同鸣丁力伟姬成龙

机构地区西安电子科技大学电子工程学院

出处《计算机研究与发展》 EI CSCD 北大核心 2014年第1期225-236,共12页 Journal of Computer Research and Development

基金国家自然科学基金项目(61072110) 陕西省自然科学基金项目(SJ08F15)

关键词视频语义分析事件检测隐条件随机场情感语义语义标注概念格 video semantic analysis event detection hidden conditional random field affectivesemantic semantic annotation concept lattice

分类号 TP391 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献22

1Lavee G, Rivlin E, Rudzsky M. Understanding video events a survey of methods for automatic interpretation of semantic occurrence in video[J].IEEE Trans on Systems, Man, and Cybernetics, Part C: Applications and Reviews, 2009, 39 (5) : 489-504.
2Shih H C, Huang Chunglin. MSN: Statistical understanding of broadcasted baseball video using multi-level semantic network [J]. IEEE Trans on Broadcasting, 2005, 51 (4): 449-459.
3Hung M H, Hsieh C H. Event detection of broadcast baseball videos [J]. IEEE Trans on Circuits and Systems for Video Technology, 2008, 18(12): 1713-1726.
4Wang Fei, Ma Yufei, Zhang Hongjiang, et al. A generic framework for semantic sports video analysis using dynamic Bayesian networks[C] //Proc of the llth Int Multimedia Modeling Conf. Piscataway, NJ: IEEE, 2005:115-122.
5Huang Chunglin, Shih H C, Chao Chungyuan. Semantic analysis of soccer video using dynamic Bayesian network [J]. IEEE Trans on Multimedia, 2006, 8(4), 749-760.
6Cheng C C, Hsu C T. Fusion of audio and motion information on HMM-based highlight extraction for baseball games [J]. IEEE Trans on Multimedia, 2006, 8(3): 585- 599.
7Ding Yi, Fan Guoliang. Sports video mining via multichannel segmental hidden Markov models [J]. IEEE Trans on Multimedia, 2009, 11(7): 1301-1309.
8张玉珍,丁思捷,王建宇,戴跃伟,陈钱.基于HMM的融合多模态的事件检测[J].系统仿真学报,2012,24(8):1638-1642. 被引量：4
9Sadlier D A, O'Connor N E. Event detection in field sports video using audio-visual features and a support vector machine [J]. IEEE Trans on Circuits and Systems for Video Technology, 2005, 15(10): 1225-1233.
10Xu Changsheng, Zhang Yifan, Zhu Guangyu, et al. Using webeast text :or semantic event detection in broadcast sports video [J]. IEEE Trans on Multimedia, 2008, 10(7) : 1342- 1355.

二级参考文献25

1彭培华,曲波,陈荣胜.基于支持向量机的小波域视频字幕检测与提取[J].华南理工大学学报（自然科学版）,2004,32(z1):63-66. 被引量：4
2刘宇驰,吴玲达.基于HMM的足球视频语义结构分析[J].计算机工程与应用,2006,42(28):174-176. 被引量：1
3刘宇驰,栾悉道,戴端辉,吴玲达.多模态体育视频语义分析[J].计算机科学,2007,34(1):109-111. 被引量：6
4金国英,陶霖密,徐光,张翔.基于HHMM的多线索融合和事件推理方法[J].清华大学学报（自然科学版）,2007,47(1):112-115. 被引量：4
5Hanjalic A.Adaptive extraction of highlights from a sport video based on excitement modeling[J].IEEE Trans on Multimedia,2005,7(6):1114-1122.
6Hanjalic A,Xu Liqun.Affective video content representation and modeling[J].IEEE Trans on Multimedia,2005,7(1):143-154.
7Dietz R B,Lang A.Affective agents:Effects of agent affection on arousal,attention,liking and learning[C]//Proc of the 3rd Int Cognitive Technology Conf.Piscataway,NJ:IEEE,1999:151-156.
8J Y Chen, Y H Li, L D Wu, S Y Lao. Semantic event detection in soccer video by integrating multi-features using Bayesian network [C]// Proceedings of 2004 International Symposium on Intelligent Multimedia, Video and Speed Proceeding, 2004, Oct.
9Y Yang, S X Lin, Y D Zhang, et al. Highlights extraction in soccer avideos based on goal_mouth detection [C]//IEEE Proc. ISSPA 2007. USA: IEEE, 2007: 1-4.
10Hang Y, Fong S.An experimental comparison of decision trees in traditional data mining and data stream mining[C]// 6th Intemational Conference on Advanced Information Management and Service,2010.

共引文献11

1同鸣,丁力伟,谢文娟.融合归一化语义加权和规则的足球视频进球检测[J].计算机辅助设计与图形学学报,2013,25(2):167-174.
2胡胜红,吴保荣,贾玉福.基于内容丢帧的视频自适应传输优化策略[J].计算机系统应用,2014,23(7):100-105.
3于俊清,张强,王赠凯,何云峰.利用回放场景和情感激励检测足球视频精彩镜头[J].计算机学报,2014,37(6):1268-1280. 被引量：5
4杨亭,丰洪才,金凯,赵杰雪.基于多模态融合和竞争力的视频场景分割算法[J].武汉理工大学学报（信息与管理工程版）,2014,36(6):759-763. 被引量：1
5同鸣,王硕,丁力伟,王纲.HCRF和网络文本的精彩事件自动检测定位[J].西安电子科技大学学报,2015,42(4):81-87.
6余春艳,翁子林.音频情感感知与视频精彩片段提取[J].计算机辅助设计与图形学学报,2015,27(10):1890-1899. 被引量：4
7左进,陈泽茂.基于改进K均值聚类的异常检测算法[J].计算机科学,2016,43(8):258-261. 被引量：51
8胡胜红,谭生龙,桂超,孙宝林.基于记分牌时间和新闻文本提取足球视频精彩事件[J].济南大学学报（自然科学版）,2016,30(5):321-327. 被引量：2
9孙仕柏,崔荣一.足球比赛场景中交互行为分析方法研究[J].吉林大学学报（信息科学版）,2016,34(5):676-685.
10卢阳,孙恩情,邢延超.乒乓球比赛视频精彩回合剪辑研究[J].电脑知识与技术（过刊）,2014,20(12X):8527-8528. 被引量：1

同被引文献37

1TRIPATHI V, MINU E. An Improved Algorithm (KPCA) For Face Recognition[J]. Digital Image Processing, 2012, 4 (1): 27-32.
2VAN I M H, BAKERMANS-KRANENBURG M Jtrust: meta-analysis of the effects of intranasal oxytocin adminis- tration on face recognition, trust to in-group, and trust to out-group[J]. Psychoneuroendocrinology, 2012, 37(3): 438-443.
3YANG A Y, ZHOU Z, BALASUBRAMANIAN A G, et al. Fast-minimization algorithms for robust face recognition[J]. IEEE Trans. Image Processing, 2013, 22(8): 3234-3246.
4HANJALIC A, XU L Q. Affective video content representation and modeling[J]. IEEE Trans. Multimedia,2005, 7(1) : 143-154.
5POTAPOV D, DOUZE M, HARCHAOUI Z, et al. Computer Vi- sion-ECCV [M]. [S.l.] : Springer International Publishing, 2014.
6ACAR E. Learning representations for affective video understand- ing[C]// Proc. the 21st ACM International Conference on Multi- media. [S.l.]:ACM Press, 2013 : 1055-1058.
7TIMMERS R, CROOK H. Affective priming in music listening: emotions as a source of musical expectation[J]. Music Percep- tion: An Interdisciplinary Journal, 2014, 31(5): 470-484.
8LARTILLOT O, TOIVIAINEN P, EEROLa T. A matlab toolbox for music information retrieval[M]. Berlin:Springer, 2008.
9HASAN H, ABDUL-KAREEM S. Fingerprint image enhance- ment and recognition algorithms: a survey[J]. Neural Computing and Applications, 2013, 23(6): 1605-1610.
10CHAN C H, TAHIR M A, KITTLER J, et al. Muhiscale local phase quantization for robust component-based face recognition using kernel fusion of multiple descriptors[J]. IEEE Trans. Pat- tern Analysis and Machine Intelligence, 2013, 35 (5) : 1164-1177.

引证文献3

1尚雪莲,秦健勇.MEF融合HFF的戏剧视频关键情节自动提取[J].电视技术,2015,39(8):50-54.
2同鸣,王硕,丁力伟,王纲.HCRF和网络文本的精彩事件自动检测定位[J].西安电子科技大学学报,2015,42(4):81-87.
3廖彬,王志宁,李敏,孙瑞娜.融合XGBoost与SHAP模型的足球运动员身价预测及特征分析方法[J].计算机科学,2022,49(12):195-204. 被引量：5

二级引证文献5

1丁建立,杨锟.航班到港延误时长预测及特征分析[J].河北科技大学学报,2023,44(3):246-255. 被引量：1
2董佳奇,胡冬梅,闫雨龙,彭林,张鹏辉,牛月圆,段小琳.基于可解释性机器学习的城市O_(3)驱动因素挖掘[J].环境科学,2023,44(7):3660-3668. 被引量：4
3史颖,丁天琪,祁晓博,亓慧.一种可解释的相对贫困识别与预警模型[J].山西大学学报（自然科学版）,2024,47(1):155-165.
4刘天畅,王雷,朱庆华.基于SHAP解释方法的智慧居家养老服务平台用户流失预测研究[J].数据分析与知识发现,2024,8(1):40-54. 被引量：2
5李金霞,卞华星,温富国,胡天牧,秦诗涵,吴涵,马晖.基于XGBoost的电网物资供应商履约风险预测[J].计算机科学,2024,51(S01):1174-1182.

1同鸣,丁力伟,刘莹莹.多维语义线索和HCRF模型的足球视频精彩事件检测[J].计算机辅助设计与图形学学报,2013,25(11):1715-1724. 被引量：1
2同鸣,王硕,丁力伟,王纲.HCRF和网络文本的精彩事件自动检测定位[J].西安电子科技大学学报,2015,42(4):81-87.
3王敏超,詹永照,苟建平,毛启容.面向视频语义分析的局部敏感的可鉴别稀疏表示[J].计算机科学,2015,42(9):313-318. 被引量：3
4魏维,何嘉,刘凤玉.视频语义分析运动特征表征与抽取技术研究[J].计算机工程与应用,2007,43(16):213-215.
5陈娟.视频镜头边界检测的实验设计[J].实验科学与技术,2010,8(2):57-59.
6杨春蓉.基于内容的流媒体视频检索技术[J].科技视界,2012(32):36-37. 被引量：1
7于俊清,张强,王赠凯,何云峰.利用回放场景和情感激励检测足球视频精彩镜头[J].计算机学报,2014,37(6):1268-1280. 被引量：5
8周艳青,王磊.基于视觉的人体动作识别综述[J].山东轻工业学院学报（自然科学版）,2012,26(1):85-90. 被引量：7
9同鸣,丁力伟,谢文娟.融合归一化语义加权和规则的足球视频进球检测[J].计算机辅助设计与图形学学报,2013,25(2):167-174.
10Randall Basham.A Case Review of Selected Popular Computing Games： Therapeutic and Affective Aspects of Gaming and Comparisons to Children＇s Literature[J].通讯和计算机（中英文版）,2016,13(6):271-280.

计算机研究与发展

2014年第1期

浏览历史

内容加载中请稍等...

融合HCRF和AAM的足球视频精彩事件检测被引量：3

参考文献22

二级参考文献25

共引文献11

同被引文献37

引证文献3

二级引证文献5

相关作者

相关机构

相关主题

浏览历史

融合HCRF和AAM的足球视频精彩事件检测 被引量：3

参考文献22

二级参考文献25

共引文献11

同被引文献37

引证文献3

二级引证文献5

相关作者

相关机构

相关主题

浏览历史

融合HCRF和AAM的足球视频精彩事件检测被引量：3