期刊文献+

改进的音频混合分割方法 被引量:4

Research on the Improved Hybrid Segmentation Algorithm for Audio
下载PDF
导出
摘要 针对基于距离和贝叶斯信息准则的混合分割算法在候选分割点确认时存在过于激进、容易造成分割点丢失的问题,提出一种保守的分割点确认方法,使被否定的候选分割点有多次机会被检验;针对固定的惩罚因子无法兼顾准确率和查全率的问题,提出了基于可检测度的惩罚因子自适应算法,并在一个启发式规则的基础上对基于可检测度的惩罚因子自适应进行扩充,实现了基于可检测度和启发式规则的惩罚因子自适应方法.实验结果表明,文中算法明显优于已有算法,且在性能上得到了很大提升. DISTBIC is a typical hybrid audio segmentation algorithm, but its validation method for candidate changing points is too radical and may lose the true changing points. To address this problem, we develop a new BIC validation algorithm, which gives the candidate points several opportunities to be validated. For a fixed penalty factor value, it is difficult to set a proper value to achieve both high recall rate and high precision. Based on the detect ability and heuristic rule, an adaptive setting method of the penalty factor is designed to solve this problem. The experimental results indicate that the improved algorithm is superior to the original algorithms.
出处 《计算机辅助设计与图形学学报》 EI CSCD 北大核心 2010年第7期1174-1181,共8页 Journal of Computer-Aided Design & Computer Graphics
基金 国家自然科学基金(60703049) 武汉市青年科技晨光计划(200850731353))
关键词 音频分割 贝叶斯信息准则 惩罚因子 自适应 audio segmentation Bayesian information criterion penalty factor adaptive
  • 相关文献

参考文献14

  • 1Cheng S S, Wang H M. A sequential metric-based audio segmentation method via the Bayesian information eriterion [C] //Proceedings of Eurospeech, Geneva, 2003: 945-948.
  • 2Chen S S, Gopalakrishnan P. Speaker, environment and channel change detection and clustering via the Bayesian information criterion [C] //Proceedings of the DARPA Workshop, Lansdowne, 1998: 127-132.
  • 3Cettolo M, Vescovi M. Efficient audio segmentation algorithms based on the BIC [C] //Proceedings of the International Conference on Acoustics, Speech, and Signal Processing, Hong Kong, 2003:537-540.
  • 4Tritsehler A, Gopinath R. Improved speaker segmentation and segments clustering using the Bayesian information criterion [C]//Proceedings of the Eurospeech, Budapest, 1999 : 2997-3000.
  • 5Cettolo M, Vescovi M, Rizzi R. Evaluation of BIC based algorithms for audio segmentation [J]. Computer Speech and Language, 2005, 19(2) : 147- 170.
  • 6Sivakumaran P, Fortuna J, Ariyaeeinia A M. On the use of the Bayesian information criterion in multiple speaker detection[C] //Proceedings of the Eurospeech, Scandinavia, 2001:795-798.
  • 7Ajmera J, MeCowan I A, Bourlard H. Robust HMM based speech/music segmentation [C] //Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing, Orlando, 2002:297-300.
  • 8Gauvain J L, Lamel L, Adda G. The LIMSI broadcast news transcription system [J]. Speech Communication, 2002, 37 (1): 89-108.
  • 9Lu L, Li S Z, Zhang H J. Content-based audio segmentation using support vector machines [C] //Proceedings of International Conference on Multimedia and Expro, Tokyo, 2001 : 749-752.
  • 10张一彬,周杰,边肇祺,张大鹏.一种基于内容的音频流二级分割方法[J].计算机学报,2006,29(3):457-465. 被引量:7

二级参考文献24

  • 1Chou W.,Gu L..Robust singing detection in speech/music discriminator design.In:Proceedings of the IEEE ICASSP,Salt Lake City,USA,2001,2:865~868
  • 2Ajmera J.,Mccowan I.A.,Bourlard H..Robust HMM-based speech/music segmentation.In:Proceedings of the IEEE ICASSP,Orlando,USA,2002,1:297~300
  • 3Sundaram H.,Chang S.F..Audio scene segmentation using multiple features,models and time scales.In:Proceedings of the IEEE ICASSP,Istanbul,Turkey,2000,4:2441~2444
  • 4Foote J..Automatic audio segmentation using a measure of audio novelty.In:Proceedings of the IEEE Multimedia and Expo,New York,USA,2000,1:452~455
  • 5Kemp T.,Schmidt M.,Waibel A..Strategies for automatic segmentation of audio data.In:Proceedings of the IEEE ICASSP,Istanbul,Turkey,2000,3:1423~1426
  • 6Zhang T.,Kuo C.J..Audio content analysis for online audiovisual data segmentation and classification.IEEE Transactions on Speech and Audio Processing,2000,9(4):441~457
  • 7Lu L.,Zhang H.J.,Jiang H..Content analysis for audio classification and segmentation.IEEE Transactions on Speech and Audio Processing,2002,10(7):504~516
  • 8Bobrek M.,Koch D.B..Music signal segmentation using tree-structured filter banks.Journal of the Audio Engineering Society,1998,46(5):412~427
  • 9Zhang Y.B.,Zhou J..A study on content-based music classification.In:Proceedings of the 7th IEEE International Symposium on Signal Processing and Its Applications,Paris,France,2003,2:113~116
  • 10Li D.G.,Sethi I.K.,Dimitrova N.,Mcgee T..Classification of general audio data for content-based retrieval.Pattern Recognition Letters,2001,22(5):533~544

共引文献21

同被引文献30

  • 1张一彬,周杰,边肇祺,张大鹏.一种基于内容的音频流二级分割方法[J].计算机学报,2006,29(3):457-465. 被引量:7
  • 2ISHI C T, ISHIGURO H, HAGITA N. Automatic extraction of para- linguistic information using prosodic features related to F0, duration and voice quality[ J]. SCI, Speech Communication 50, 2008 : 531 - 543.
  • 3CHENG S S, WANG H M.A. Sequential metric to based audio segmen- tation method via the Bayesian information criterion [ C]// Proceedings of Eurospeech. Geneva: University of Geneva, 2003:945 -948.
  • 4CHEN S S, GOPLALAKRISHNAN P. Speaker, environment and channel change detection and clustering via the Bayesian information criterion [ C ]// proceedings of the DARPA workshop. Lansdowne : [ s. n. ] , 1988 : 127 - 132.
  • 5CETI'OLO M, VESCOVI M. Efficient audio segmentation algorithms based on the BIC [ C ]//Proceedings of the International Conference on Acoustics, Speech, and Signal Processing, Hang Kong : IEEE, 2003 : 537 - 540.
  • 6Cettolo M, Vescovi M, Rizzi R. Evaluation of BIC based algorithms for audio segmentation [ J]. Computer Speech and Language, 2005, 19f2) : 147 -170.
  • 7MAO QiRong, WANG XiaoJia, ZHAN YongZhao. Speech emotion recognition method based on improved decision tree and layered fea- ture selection [ J ]. International Journal of Humanoid Robotics, 2010:245 - 261.
  • 8Taras Butko,Climent Nadeu. Audio segmentation of broadcast news in the Albayzin-2010 evaluation: Overview, results, and discussion [J]. EURASIP Journal on Audio, Speech, and Music Processing, 2011 (1): 1-10.
  • 9Sebastien Lefevre, Nicole Vincent. A two level strategy for au- dio segmentation[J]. Journal of Digital Signal Processing, 2010, 21 (2): 270-277.
  • 10Dalibor Mitrovic, Matthias Zeppelzauer, Christian Breithene- der. Features for content-based audio retrieval [J]. Journal of Advances in Computer, 2010, 78 (10): 71-150.

引证文献4

二级引证文献12

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部