期刊文献+

基于变分贝叶斯改进的说话人聚类算法 被引量:2

Improved Algorithm of Speaker Clustering Based on Variation Bayesian
下载PDF
导出
摘要 说话人聚类是说话人分离中的一个重要过程,然而传统的以贝叶斯信息准则作为距离测度的层次聚类方式,会出现聚类误差向上传递的情况。本文提出了一种逐级算法增强处理机制。当片段之间的最小贝叶斯信息准则距离超过设定的门限值时,或者类别个数到达一定程度时,将当前聚类结果作为初始类中心,通过变分贝叶斯迭代法重新对每个类别中的片段调优,最后再依据概率线性判别分析得分门限确定说话人个数。实验表明,本文方法在美国国家标准技术署08summed测试集上,使得"类纯度"和"说话人纯度"比传统算法都有了一定提升,且使得说话人分离整体性能相对提升了27.6%。 The speaker clustering is an important process of speaker diarization,yet traditional method for hierarchical agglomerative clustering(HAC)with distance measurement based on Bayesian information criterion(BIC)can lead to the clustering error propagation.To solve this problem,step by step algorithm is proposed,when the minimum BIC distance between segments exceeds a predefined threshold,or the number of the categories on hierarchical clustering reaches a certain number.The current clustering result as the initial class center,and then variational Bayesian method will be exploited to tune the speaker segments among the categories iteratively.Finally,the number of speaker is determined according to the probabilistic linear discriminant analysis(PLDA)score threshold.Experiments on national institute of standards and technology(NIST)08summed test set show that this method improves the "class purity" and "speaker purity" compared with conventional algorithms.Moreover,performance of speaker diarization is relatively improved by 27.6%.
出处 《数据采集与处理》 CSCD 北大核心 2017年第1期54-61,共8页 Journal of Data Acquisition and Processing
基金 公安部应用创新计划(2014YYCXGAES048)资助项目
关键词 说话人聚类 贝叶斯信息准则 概率线性判别分析 变分贝叶斯 speaker clustering Bayesian information criterion probabilistic linear discriminant analysis variational Bayesian
  • 相关文献

参考文献3

二级参考文献87

  • 1付中华,张艳宁.在线无监督说话人检索中稳健的模型自举算法[J].软件学报,2007,18(3):608-616. 被引量:3
  • 2..http://www.itl.nist.gov/iad/mig/tests/rt/,.
  • 3S.E.Tranter,D.A.Reynolds.An overview of automatic speaker diarization systems[J].IEEE Tram on Audio,Speech,and Language for Processing.2006,14(5):1557-1565.
  • 4M.Kotti,V.Moschou,C.Kotropoulos.Speaker segmentation and clustering.Signal Processing 2008(88):1091-1124.
  • 5T.Stafylakis and V.Katsouros.A review of recent advances in speaker diarization with bayesian methods.Speech and Language Technologies[M].InTech pubhshing 2011:217-240.
  • 6X.Anguera,S.Bozonnet,N.Evans,C.Fredouille,G.Friedland,O.Vinyals.Speaker diarization:a review of recent research[J].IEEE Trans on Audio,Speech,and Language for Processing.2012,20(2):356-370.
  • 7J.Ramírez; J.M.G6rriz,J.C.Segura.Voice activity detection.Fundamentals and Speech Recognition System Robustness[M].In M.Grimm and K.Kroschel.Robust Speech Recognition and Understanding.2007:1-22.
  • 8D.Liu and F.Kubala,Fast speaker change detection for broadcast news transcription and indexing[C].In Proc.Eur Conf.Speech Commun Technol,1999(3):1031-1034.
  • 9Nwe,T.L,Sun,H.,Li.,H.,Rahardja,S.,Speaker diarization in meeting audio,In Proc.of ICASSP,2010:4073-4076.
  • 10V.Gupta,P.Kenny,P.Ouellet,G.Boulianne,and P.Dumouchel.Combining gaussianized/non-gaussianized features to improve speaker diarization of telephone conversations[J].IEEE Signal processing letters.2007,14(12):1040-1043.

共引文献14

同被引文献17

引证文献2

二级引证文献2

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部