期刊文献+

足球比赛精彩场景的自动分析与提取

Automatic Analysis and Extraction of Soccer Highlights
下载PDF
导出
摘要 提出了基于MPEG压缩域音频流的足球比赛精彩场景自动分析与提取算法 首先直接提取出压缩域音频特征 ;然后基于提取出来的压缩域特征实现解说音的检测和分割 ,并且分别识别足球比赛中解说员激动解说和观众激昂欢呼两种类型音频事件 ;最后通过概率融合生成最终结果 ,融合结果所对应的比赛片段就是提取出的足球比赛精彩场景 This paper presents an algorithm to automatically analyze and extract soccer program highlights scene based on MPEG compressed audio-track analysis. In this algorithm, audio compressed features are directly extracted first and then based on the features the algorithm detects and segments commentary speeches, to recognize in particular the events of excited commentary and crowd cheers respectively. Finally, the recognized results are integrated by probability fusion, and the corresponding video clips of fusion result are chosen as soccer highlights. Experimental data shows that the algorithm works well.
出处 《计算机辅助设计与图形学学报》 EI CSCD 北大核心 2004年第6期856-860,共5页 Journal of Computer-Aided Design & Computer Graphics
基金 国家自然科学基金 ( 60 2 72 0 3 1) 教育部博士点科研基金( 2 0 0 10 3 3 5 0 49) 国家"十五"重大科技攻关项目 ( 2 0 0 1BA10 1A0 7 0 3 ) 浙江省科技计划项目重点科研项目 ( 2 0 0 3C2 10 10 ) 浙江省自然科学基金(M 60 3 2 0 2 )资助
关键词 足球精彩场景 MPEG音频 概率融合 soccer highlights MPEG audio probability fusion
  • 相关文献

参考文献10

  • 1Xie L, Chang S-F, Divakaran A, et al. Structure analysis of soccer video with hidden Markov models[A]. In: Proceedings of International Conference on Acoustic, Speech and Signal Processing, Orlando, 2002. 345~350
  • 2Chang P, Han M, Gong Y. Highlight detection and classification of baseball game video with hidden Markov models[A]. In: Proceedings of the International Conference on Image Processing, New York, 2002. 167~171
  • 3Rui Yong, Gupta Anoop, Acero Alex. Automatically extracting highlights for TV baseball programs[A]. In: Proceedings of ACM Multimedia, Los Angeles, 2000. 105~115
  • 4Tzanetakis George, Cook Perry. Sound analysis using MPEG compressed audio[A]. In: Proceedings of International Conference on Acoustic, Speech and Signal Processing, Istanbul, 2000. 757~761
  • 5Rabiner L, Juang B-H. Fundamentals of Speech Recognition[M]. New Jersey: Prentice-Hall, 1993
  • 6Huang L S, Yang C-H. A novel approach to robust speech endpoint detection in car environments[A]. In: Proceedings of International Conference on Acoustic, Speech and Signal Processing, Istanbul, 2000. 434~438
  • 7Rabiner L R. A tutorial on hidden Markov models and selected applications in speech recognition[J]. Proceedings of the IEEE, 1989, 77(2): 257~286
  • 8庄越挺,毛祎,吴飞,潘云鹤.基于隐马尔可夫链的广播新闻分割分类[J].计算机研究与发展,2002,39(9):1057-1063. 被引量:7
  • 9肖俊,庄越挺,吴飞.基于细节层次与最小生成树的三维地形识别与检索[J].软件学报,2003,14(11):1955-1963. 被引量:10
  • 10Platt J C. Probabilistic Outputs for Support Vector Machines for Pattern Recognition[M]. In: Fayyad U, ed. Advances in Large Margin Classifiers. Boston: Kluwer Academic Publishers, 1999. 61~74

二级参考文献26

  • 1[1]J T Foote. An overview of audio information retrieval. Multimedia Systems, 1999, 7(1): 2~11
  • 2[2]S John. Real time discrimination of broadcast speech/music. In: Proc of Int'l Conf on Acoustic, Speech, and Signal Processing (ICASSP-96). Atlanta, GA, 1996. 993~996
  • 3[3]E Scheirer, M Slaney. Construction and evaluation of a robust multifeature music/speech discriminator. In: Proc of Int'l Conf on Acoustic, Speech, and Signal Processing (ICASSP-97). Munich, Germany, 1997. 1331~1334
  • 4[4]M Spina, V Zue. Automatic transcription of general audio data: Preliminary analysis. In: Proc of Int'l Conf on Spoken Language Processing. Philadelphia, PA, 1996. 594~597
  • 5[5]J T Foote. A similarity measure for automatic audio classification. In: Proc of AAAI 1997 Spring Symp on Intelligent Integration and Use of Text, Image, Video, and Audio Corpora. Palo Alto, CA: Stanford, 1997
  • 6[6]S Savitha, D Petkovic, D Ponceleon. Towards robust features for classifying audio in the cuevideo system. In: Proc of ACM Multimedia 99. New York, USA, 1999. 393~400
  • 7[7]Tong Zhang, C-C Jay Kuo. Heuristic approach for generic audio data segmentation and annotation. In: Proc of ACM Multimedia Conf. Orlando, 1999. 67~76
  • 8[8]M Slaney, R F Lyon. A perceptual pitch detector. In: Proc of Int'l Conf on Acoustic, Speech, and Signal Processing 1990 (ICASSP 90). Albuquerque, 1990. 357~360
  • 9[9]L R Rabiner. A tutorial on hidden Markov models and selected applications in speech recognition. Proc of the IEEE, 1989, 77(2): 257~286
  • 10[10]G Tzanetakis, P Cook. Multifeature audio segmentation for browsing and annotation. In: Proc of 1999 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics. New Paltz, NY, 1999

共引文献15

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部