期刊文献+

基于模型聚类的说话人识别研究

Research on Speaker Recognition Based on Model Clustering
下载PDF
导出
摘要 随着说话人识别技术的广泛应用,说话人规模不断增长,若采用传统的说话人辨别方式逐一比较,则计算量较大,难以实时响应,使说话人识别系统的性能与实用性大大降低。传统的K-L散度距离由于非对称性,并不是一种很好的聚类距离度量,聚类效果不佳。论文提出了一种基于Wasserstein distance聚类方法,相比于传统说话人识别方法,该方法的识别准确率提升了近4.7%,并且识别耗时仅为传统识别方法的25.5%,大大提升了说话人识别系统的性能与实用性。 With the wide application of speaker recognition technology,the scale of back-end speakers is growing.If the traditional speaker recognition methods are compared one by one,the amount of calculation is large and it is difficult to respond in real time,which greatly reduces the performance and practicability of the speaker recognition system.Therefore,this paper proposes a speaker recognition method based on model clustering.And because the traditional K-L divergence distance is not a good clustering distance measure because of its asymmetry,the clustering effect is poor.In this paper,a Wasserstein distance clustering method based on approximate model is proposed.Compared with the traditional speaker recognition method,the recognition accuracy of this method is improved by nearly 4.7%,and the recognition time is only 25.5%of the traditional recognition method,which greatly improves the performance and practicability of the speaker recognition system.
作者 陈秉沃 张二华 唐振民 CHEN Bingwo;ZHANG Erhua;TANG Zhenmin(School of Computer Science and Engineering,Nanjing University of Science and Technology,Nanjing 210094)
出处 《计算机与数字工程》 2023年第8期1745-1749,1831,共6页 Computer & Digital Engineering
关键词 模型聚类 推土机距离 Wasserstein distance 说话人识别 高斯混合模型 model clustering bulldozer distance Wasserstein distance speaker recognition Gaussian mixture model
  • 相关文献

参考文献6

二级参考文献32

  • 1王伟,邓辉文.基于MFCC参数和VQ的说话人识别系统[J].仪器仪表学报,2006,27(z3):2253-2255. 被引量:30
  • 2Douglas A Reynolds. An overview of automatic speaker recognition technology[A]. Proc ICASSP [C]. Orlando, Florida, USA: IEEE,2002.4072 - 4075.
  • 3Yuqing Gao, et al. Speaker adaptation based on pre-clustering training speakers[A] .Proc Eurospeach[C] .Rhodes,Greece:ESCA,1997.2095- 2098.
  • 4Ernest J Pusateri. Rapid speaker adaptation using speaker clustering [A]. Proc ICSLP' 2002 [C]. Denver, Colorado, USA: ISCA, Sept.2002.61-64.
  • 5Bing Sun,et al. Hierarchical speaker identification using speaker clustering[A]. Proc NLP-KE' 2003 [C]. Beijing, China: IEEE, 2003. 299- 304.
  • 6Douglas A Reynolds. Comparison of background normalization methods for text-independent speaker verification [A]. Proc Eurospeach [C].Rhodes, Greece: ESCA, 1997.963 - 966.
  • 7Homayoon S M Beigi,et al.A distance measure between collections of distributions and its application to speaker recognition[A] .Proc ICASSP[C]. Seattle, Washington, USA: 1998.753 - 756.
  • 8Douglas A Reynolds, et al. Robust text-independent speaker identification using Gassian mixture speaker models[J]. IEEE Trans on Speech and Audio Processing, 1995,3(1) :72 - 83.
  • 9Douglas A Reynolds, et al. Speaker verification using adapted Gaussian mixture models[J]. Digital Signal Processing,2000,10(1):19-41.
  • 10王玥,钱志鸿,王雪,程光明.基于伽马通滤波器组的听觉特征提取算法研究[J].电子学报,2010,38(3):525-528. 被引量:28

共引文献57

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部