期刊文献+

一种面向基于内容视频检索的音频场景分割方法

Audio Scene Segmentation Method for Content-based Video Retrieval
下载PDF
导出
摘要 视频数据中的音频流包含了丰富的语义信息.在基于内容的视频检索中,对音频信息的分析是不可分割的一部分.本文主要讨论基于内容的音频场景分割,分析各种音频特征及提取方法,并在此基础上提出一种新的音频流分割方法,根据六种音频类型(语音、音乐、静音、环境音、纯语音、音乐背景下的语音和环境音背景下的语音)的音频特征对视频数据中的音频流分割音频场景.实验证明该方法是有效的,在保证一定的分割精度的同时,准确率和查全率都得到了较大的提高. Audio streams in video contain a lot of semantic information. In content-based video retrieval, it is indivisible to analyze audio signals. Having discussed various audio features and their extracting methods, we bring forward a new method for audio scene segmentation, according to the features of six kinds of audio signal types (silence, music, environmental sound, pure speech, speech with music and speech with environmental sound) to segment audio stream. Experimental results show that this proposed approach not only ensures segmented precision, but also improves greatly the recall and precision.
出处 《小型微型计算机系统》 CSCD 北大核心 2008年第3期557-562,共6页 Journal of Chinese Computer Systems
基金 国家自然科学基金项目(60673122)资助 广东省自然科学基金项目(5301029)资助 深圳大学科研启动基金项目(200515)资助
关键词 音频场景分割 基于内容的音频分析 音频特征 音频分类 audio scene segmentation content-based audio analysis audio features audio classification
  • 相关文献

参考文献2

二级参考文献14

  • 1Chou W.,Gu L..Robust singing detection in speech/music discriminator design.In:Proceedings of the IEEE ICASSP,Salt Lake City,USA,2001,2:865~868
  • 2Ajmera J.,Mccowan I.A.,Bourlard H..Robust HMM-based speech/music segmentation.In:Proceedings of the IEEE ICASSP,Orlando,USA,2002,1:297~300
  • 3Sundaram H.,Chang S.F..Audio scene segmentation using multiple features,models and time scales.In:Proceedings of the IEEE ICASSP,Istanbul,Turkey,2000,4:2441~2444
  • 4Foote J..Automatic audio segmentation using a measure of audio novelty.In:Proceedings of the IEEE Multimedia and Expo,New York,USA,2000,1:452~455
  • 5Kemp T.,Schmidt M.,Waibel A..Strategies for automatic segmentation of audio data.In:Proceedings of the IEEE ICASSP,Istanbul,Turkey,2000,3:1423~1426
  • 6Zhang T.,Kuo C.J..Audio content analysis for online audiovisual data segmentation and classification.IEEE Transactions on Speech and Audio Processing,2000,9(4):441~457
  • 7Lu L.,Zhang H.J.,Jiang H..Content analysis for audio classification and segmentation.IEEE Transactions on Speech and Audio Processing,2002,10(7):504~516
  • 8Bobrek M.,Koch D.B..Music signal segmentation using tree-structured filter banks.Journal of the Audio Engineering Society,1998,46(5):412~427
  • 9Zhang Y.B.,Zhou J..A study on content-based music classification.In:Proceedings of the 7th IEEE International Symposium on Signal Processing and Its Applications,Paris,France,2003,2:113~116
  • 10Li D.G.,Sethi I.K.,Dimitrova N.,Mcgee T..Classification of general audio data for content-based retrieval.Pattern Recognition Letters,2001,22(5):533~544

共引文献18

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部