期刊文献+

基于凝聚信息瓶颈的音频事件聚类方法 被引量:7

Audio Events Clustering Based on Agglomerative Information Bottleneck
下载PDF
导出
摘要 为了进一步提高音频事件聚类算法性能,本文基于凝聚信息瓶颈理论提出一种音频事件聚类方法.首先,论述信息瓶颈原理及其推导过程;然后,详细论述一种基于凝聚信息瓶颈的音频事件聚类方法,包括源变量、相关变量和目标变量的定义,聚类的具体步骤,算法主要计算量分析等.采用取自两个数据库的音频事件样本进行测试,实验结果表明:与目前文献报道的方法相比,本文方法在多种实验条件下都获得了更高的K值(平均类纯度和平均音频纯度的几何平均值),而且运算速度更快. In order to further improve the performance of methods for audio events clustering, this paper proposes a method for audio events clustering based on the theory of agglomerative information bottleneck. First, the principles and derivations of information bottleneck are briefly introduced. Then, the proposed method is described in detail, including the definitions of source variables, relevance variables and destination variables, the steps of the proposed method and the analyses of main computational loads of all methods. The proposed method and two kinds of previous methods ( including the method based on spectral clustering, and the method based on both Bayesian information criterion and agglomerative hierarchical clustering) are evaluated on the experimental data extracted from two different corpora of audio events. The experimental results show that the proposed method obtains higher K values ( geometric mean of average clustering purity and average audio purity) and runs faster than the previous methods under several experimental conditions.
作者 李艳雄 王琴 张雪 邹领 LI Yan-xiong WANG Qin ZHANG Xue ZOU Ling(School of Electronic and Information Engineering, South China University of Technology, Guangzhou, Guangdong 510640, China)
出处 《电子学报》 EI CAS CSCD 北大核心 2017年第5期1064-1071,共8页 Acta Electronica Sinica
基金 国家自然科学基金(No.61101160) 中央高校基本科研业务费专项资金重点项目(No.2015ZZ102) 广州市珠江科技新星专项(No.2013J2200070)
关键词 凝聚信息瓶颈 音频事件聚类 音频内容分析 agglomerative information bottleneck audio events clustering audio content analysis
  • 相关文献

参考文献2

二级参考文献22

  • 1Stouten F,Duchateau J, Martens J P, et al. Coping with disfluencies in spontaneous speech recognition: acoustic detection and linguistic context manipulation [ J ]. Speech Communication ,2006,48 ( 11 ) : 1590-1606.
  • 2Cai R,Lu L,Zhang H J,et al. Highlight sound effects detection in audio stream [C]//Proceedings of IEEE International Conference on Multimedia and Expo. Baltimore: IEEE, 2003 : 37-40.
  • 3Kennedy L S, Ellis D P W. Laughter detection in meetings [ C ]//Proceedings of NIST International Conference on Acoustics, Speech and Signal Processing (Meeting Recognition Workshop ). Montreal: The National Institute of Standard and Technology ,2004:118-121.
  • 4Knox M T, Mirghafori N. Automatic laughter detection using neural networks [ C ]//Proceedings of InterSpeech. Antwerpen:International Speech Communication Association, 2007:2973-2976.
  • 5Laskowski K, Schuhz T. Detection of laughter-in-interaction in muhichannel close-talk microphone recordings of meetings [ C ] // Proceedings of the 5th International Workshop on Machine Learning for Muhimodal Interaction. Utrecht : Springer-Verlag ,2008 : 149-160.
  • 6Knox M T, Morgan N, Mirghafori N. Getting the last laugh: automatic laughter segmentation in meetings [ C ]//Proceedings of InterSpeech. Brisbane: International Speech Communication Association,2008:797-800.
  • 7Garg G, Ward N. Detecting filled pauses in tutorial dialogs [R]. EI Paso : Department of Computer Science, University of Texas at EI Paso,2006 : 1-9.
  • 8Audhkhasi K, Kandhway K, Deshmukh O D, et al. Formant-based technique for automatic filled-pause detection in spontaneous spoken English [C]//Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing. Taipei : IEEE, 2009:4857-4860.
  • 9Li Y X, He Q H, Kwong S, et al. Characteristics-based effective applause detection for meeting speech [ J ]. Signal Processing ,2009,89 ( 8 ) : 1625-1633.
  • 10Carter A. Automatic acoustic laughter detection [ D ]. Staffordshire: Department of Electronic Engineering, Keele Universtiy ,2000.

共引文献4

同被引文献48

引证文献7

二级引证文献27

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部