足球比赛精彩场景的自动分析与提取

Automatic Analysis and Extraction of Soccer Highlights

下载PDF

导出

摘要提出了基于MPEG压缩域音频流的足球比赛精彩场景自动分析与提取算法首先直接提取出压缩域音频特征 ;然后基于提取出来的压缩域特征实现解说音的检测和分割 ,并且分别识别足球比赛中解说员激动解说和观众激昂欢呼两种类型音频事件 ;最后通过概率融合生成最终结果 ,融合结果所对应的比赛片段就是提取出的足球比赛精彩场景 This paper presents an algorithm to automatically analyze and extract soccer program highlights scene based on MPEG compressed audio-track analysis. In this algorithm, audio compressed features are directly extracted first and then based on the features the algorithm detects and segments commentary speeches, to recognize in particular the events of excited commentary and crowd cheers respectively. Finally, the recognized results are integrated by probability fusion, and the corresponding video clips of fusion result are chosen as soccer highlights. Experimental data shows that the algorithm works well.

作者陈忠克郭振江刘骏伟吴飞庄越挺

机构地区浙江大学计算机学院浙江大学医学院附属邵逸夫医院

出处《计算机辅助设计与图形学学报》 EI CSCD 北大核心 2004年第6期856-860,共5页 Journal of Computer-Aided Design & Computer Graphics

基金国家自然科学基金 ( 60 2 72 0 3 1) 教育部博士点科研基金( 2 0 0 10 3 3 5 0 49) 国家"十五"重大科技攻关项目 ( 2 0 0 1BA10 1A0 7 0 3 ) 浙江省科技计划项目重点科研项目 ( 2 0 0 3C2 10 10 ) 浙江省自然科学基金(M 60 3 2 0 2 )资助

关键词足球精彩场景 MPEG音频概率融合 soccer highlights MPEG audio probability fusion

分类号 TP391.4 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献10

1Xie L, Chang S-F, Divakaran A, et al. Structure analysis of soccer video with hidden Markov models[A]. In: Proceedings of International Conference on Acoustic, Speech and Signal Processing, Orlando, 2002. 345～350
2Chang P, Han M, Gong Y. Highlight detection and classification of baseball game video with hidden Markov models[A]. In: Proceedings of the International Conference on Image Processing, New York, 2002. 167～171
3Rui Yong, Gupta Anoop, Acero Alex. Automatically extracting highlights for TV baseball programs[A]. In: Proceedings of ACM Multimedia, Los Angeles, 2000. 105～115
4Tzanetakis George, Cook Perry. Sound analysis using MPEG compressed audio[A]. In: Proceedings of International Conference on Acoustic, Speech and Signal Processing, Istanbul, 2000. 757～761
5Rabiner L, Juang B-H. Fundamentals of Speech Recognition[M]. New Jersey: Prentice-Hall, 1993
6Huang L S, Yang C-H. A novel approach to robust speech endpoint detection in car environments[A]. In: Proceedings of International Conference on Acoustic, Speech and Signal Processing, Istanbul, 2000. 434～438
7Rabiner L R. A tutorial on hidden Markov models and selected applications in speech recognition[J]. Proceedings of the IEEE, 1989, 77(2): 257～286
8庄越挺,毛祎,吴飞,潘云鹤.基于隐马尔可夫链的广播新闻分割分类[J].计算机研究与发展,2002,39(9):1057-1063. 被引量：7
9肖俊,庄越挺,吴飞.基于细节层次与最小生成树的三维地形识别与检索[J].软件学报,2003,14(11):1955-1963. 被引量：10
10Platt J C. Probabilistic Outputs for Support Vector Machines for Pattern Recognition[M]. In: Fayyad U, ed. Advances in Large Margin Classifiers. Boston: Kluwer Academic Publishers, 1999. 61～74

二级参考文献26

1[1]J T Foote. An overview of audio information retrieval. Multimedia Systems, 1999, 7(1): 2～11
2[2]S John. Real time discrimination of broadcast speech/music. In: Proc of Int'l Conf on Acoustic, Speech, and Signal Processing (ICASSP-96). Atlanta, GA, 1996. 993～996
3[3]E Scheirer, M Slaney. Construction and evaluation of a robust multifeature music/speech discriminator. In: Proc of Int'l Conf on Acoustic, Speech, and Signal Processing (ICASSP-97). Munich, Germany, 1997. 1331～1334
4[4]M Spina, V Zue. Automatic transcription of general audio data: Preliminary analysis. In: Proc of Int'l Conf on Spoken Language Processing. Philadelphia, PA, 1996. 594～597
5[5]J T Foote. A similarity measure for automatic audio classification. In: Proc of AAAI 1997 Spring Symp on Intelligent Integration and Use of Text, Image, Video, and Audio Corpora. Palo Alto, CA: Stanford, 1997
6[6]S Savitha, D Petkovic, D Ponceleon. Towards robust features for classifying audio in the cuevideo system. In: Proc of ACM Multimedia 99. New York, USA, 1999. 393～400
7[7]Tong Zhang, C-C Jay Kuo. Heuristic approach for generic audio data segmentation and annotation. In: Proc of ACM Multimedia Conf. Orlando, 1999. 67～76
8[8]M Slaney, R F Lyon. A perceptual pitch detector. In: Proc of Int'l Conf on Acoustic, Speech, and Signal Processing 1990 (ICASSP 90). Albuquerque, 1990. 357～360
9[9]L R Rabiner. A tutorial on hidden Markov models and selected applications in speech recognition. Proc of the IEEE, 1989, 77(2): 257～286
10[10]G Tzanetakis, P Cook. Multifeature audio segmentation for browsing and annotation. In: Proc of 1999 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics. New Paltz, NY, 1999

共引文献15

1杨玉莲,谢磊.基于子词链的中文新闻广播故事自动分割[J].计算机应用研究,2009,26(2):583-586. 被引量：2
2毛祎,潘红,吴飞,庄越挺.基于深度加权法向映射的三维模型检索[J].计算机辅助设计与图形学学报,2005,17(2):247-252. 被引量：5
3闫丽颖,王欢,杨颖.模糊c均值聚类在wav格式音频检索中的研究[J].中国科技信息,2006(02A):15-15. 被引量：1
4张伟,陈立潮,侯娟,王宇.聚类分析及其在GIS中的应用研究[J].科技情报开发与经济,2007,17(18):178-180. 被引量：4
5万丽莉,赵沁平,郝爱民.一种基于部件空间分布的三维模型检索方法[J].软件学报,2007,18(11):2902-2913. 被引量：13
6宋晓慧.一种基于交互式分割的部分模型检索方法[J].计算机仿真,2009,26(3):244-246. 被引量：1
7张慧杰,孙吉贵,吕英华,吕楠,王远志.一种新的基于发散度函数的地形模型简化方法[J].计算机学报,2009,32(5):962-973. 被引量：9
8张瑞杰,李弼程,屈丹.基于可信度变化趋势的音频分割算法[J].计算机工程,2010,36(8):177-179. 被引量：3
9宋国明,王厚军,姜书艳,刘红.最小生成树SVM的模拟电路故障诊断方法[J].电子科技大学学报,2012,41(3):412-417. 被引量：9
10徐海峰,秦茂玲,刘辉.一种基于特征点分割的三维模型检索方法[J].计算机技术与发展,2013,23(1):71-74. 被引量：2

1曹德明,王永琳.多传感器系统的检测概率融合与性能比较[J].抗恶劣环境计算机,1992,6(3):17-29.
2于洋,王蕾.改进的遗传算法在图像边缘提取中的应用[J].火力与指挥控制,2002,27(5):67-70. 被引量：3
3MPEG技术与光盘[J].电脑爱好者,1996,0(3):58-59.
4耿军雪,谢陈.基于改进遗传算法的遥感图像边缘检测[J].测绘技术装备,2006,8(4):36-38. 被引量：2
5周莉,台正.异化的字幕[J].当代电视,2014(6):96-97.
6赵娟娟.数字图像边缘检测方法的对比分析及优化[J].甘肃科学学报,2012,24(3):143-146. 被引量：6
7凝固的色彩喷墨打印机大阅兵[J].新电脑,2003(8):142-145.
8石剑.重磅来袭情定Win8“超级本”[J].优品,2013,0(3):198-201.
9娄小平,戴军.集群通信中一种语音加密方法的研究与实现[J].湖南文理学院学报（自然科学版）,2008,20(4):75-78. 被引量：3
10潘继斌.基于SVM局部后验概率融合方法研究[J].数学的实践与认识,2006,36(2):182-185.

计算机辅助设计与图形学学报

2004年第6期

浏览历史

内容加载中请稍等...

足球比赛精彩场景的自动分析与提取

参考文献10

二级参考文献26

共引文献15

相关作者

相关机构

相关主题

浏览历史