期刊文献+

一种新的基于分类的音频流分割方法 被引量:10

A Novel Classification-Based Audio Segmentation Algorithm
下载PDF
导出
摘要 很多传统的音频流分割方法都是基于小尺度音频分类的,它们普遍存在虚假分割点过多的缺点,严重影响了实际应用的效果.我们的研究表明,大尺度音频片段的分类正确率明显高于小尺度音频片段的分类正确率.基于这个事实和减少虚假分割点的目的,我们提出了一种新的基于分类的音频流分割方法.首先,采用基于大尺度分类的分割方法对音频流进行粗分割,然后采用基于小尺度分类的细分割步骤在边界区域中进一步精确定位分割点.理论分析和实验结果均表明,当处理类别变换频率较低的音频流时,这种分割方法在保持真实分割点检测率的同时能够大幅降低虚假分割率. Content-based audio segmentation plays an important role in multimedia applications. Many conventional segmentation algorithms are based on small-scale classification and always result in a high false alarm rate. Our experimental results show that large-scale audio can be more easily classified than small ones, and this trend is irrespective of classifiers. According to this fact,we present a novel framework for audio segmentation to reduce the false seg- mentations. First,a rough segmentation step based on large-scale classification is taken to ensure the integrality of the content of segments. Then a subtle segmentation step based on small-scale classification is taken to further locate the segmentation points from the boundary areas computed by the rough segmentation step. Both theoretical analysis and ex- perimental results show that nearly 3/4 false segmentation points can be reduced comparing to the conventional audio segmentation method based on small-scale audio classification, while preserving a low missing rate, when infrequently type-changed audio streams are dealt. So it can be concluded that it is very suitable for the real tasks such as music broadcast segmentation or music video analysis.
出处 《电子学报》 EI CAS CSCD 北大核心 2006年第4期612-617,共6页 Acta Electronica Sinica
基金 国家自然科学基金(No.60573060No.60205002No.60332010) 北京市自然科学基金(No.4042020)
关键词 音频分类 音频分割 虚假分割 神经网络 audio classification audio segmentation false segmentation rate neural network
  • 相关文献

参考文献12

  • 1Chou W, Gu L. Robust singing detection in speech/music discriminator design[ A]. In. Proc ICASSP[ C ].Salt Lake City, USA : IEEE,2001,2:865 - 868.
  • 2Ajmera J, Mccowan I A, Bourlard H. Robust HMM-based speech/music segmentation [ A ]. In: Proc ICASSP[ C]. Orlando, USA: IEEE,2002 ,1:297 -300.
  • 3Sundaram H, Chang S F. Audio scene segmentation using multiple features, models and time scales [ A ]. In:IEEE Proc ICASSP [ C ]. Istanbul, Turkey: IEEE, 2000.4.2441 - 2444.
  • 4Foote J. Automatic audio segmentation using a measure of audio novelty [ A ]. In: IEEE Proc Multimedia and Expo [ C ]. New York, USA: IEEE, 2000.1. 452 - 455.
  • 5Kemp T, Schmidt M, Waibel A. Strategies for automatic segmentation of audio data [ A ]. In: IEEE Proc. ICASSP[ C ]. Istanbul, Turkey: IEEE,2000.3. 1423 - 1426.
  • 6Zhang T, Kuo C J. Audio content analysis for online audiovisual data segmentation and classification [ J ]. IEEE Trans Speech and Audio Processing ,2000,9 (4) :441 -457.
  • 7Lu L, Zhang H J, Jiang H. Content analysis for audio classification and segmentation [ J ]. IEEE Trans, Speech and Audio Processing,2002,10 ( 7 ) : 504 - 516.
  • 8Zhang Y B, Zhou J. A study on content-based music classification[ A ]. In. IEEE Proc Seventh International Symposium on Signal Processing and Its Applications[ C ]. Paris, France, IEEE, 2003.2.113 - 116.
  • 9Li D G, Sethi I K, Dimitrova N, Mcgee T. Classification of general audio data for content-based retrieval [ J ].Pattern Recognition Letters, 2001,22 (5) :533 - 544.
  • 10Ripley B D. Pattern recognition and neural networks[ M ]. London: Cambridge University Press, 1996.

同被引文献107

引证文献10

二级引证文献16

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部