期刊文献+

基于GMM-UBM的声纹识别技术的特征参数研究 被引量:15

Research of Feature Parameters in Voiceprint Recognition Technology Based on GMM-UBM
下载PDF
导出
摘要 声纹识别技术实现的关键点在于从语音信号中提取语音特征参数,此参数具备表征说话人特征的能力。基于GMM-UBM模型,通过Matlab实现文本无关的声纹识别系统,对主流静态特征参数MFCC、LPCC、LPC以及结合动态参数的MFCC,从说话人确认与说话人辨认两种应用角度进行性能比较。在取不同特征参数阶数、不同高斯混合度和使用不同时长的训练语音与测试语音的情况下,从理论识别效果、实际识别效果、识别所用时长、识别时长占比等多个方面进行了分析与研究。最终结果表明:在GMM-UBM模式识别方法下,三种静态特征参数中MFCC绝大多数时候具有最佳识别效果,同时其系统识别耗时最长;识别率与语音特征参数的阶数之间并非单调上升关系。静态参数在结合较佳阶数的动态参数时能够提升识别效果;增加动态参数阶数与提高系统识别效果之间无必然联系。 The key of the voiceprint recognition technology is to extract speech feature parameters from speech signals,which have the capacity of representing speakers’ features. Based on Gaussian mixture model-universal background model,the mainstream static feature parameters which are MFCC,LPCC,LPC and MFCC combining with dynamic parameters are compared in terms of speaker identification and speaker verification according to text-independent voiceprint recognition systems realized by Matlab. In the case of taking different feature parameter order,different Gaussian mixture degree and using different time length of training and testing speech,we analyze and research in several aspects such as theoretical recognition effect,actual recognition effect,time consumption for recognizing and its ratio of components etc. The final results show that in the pattern recognition method GMM-UBM,MFCC do have best recognition effect and largest time consumption for system recognizing in the most of time. There is no monotonically ascending relation between the recognition rate and the order of speech feature parameters. MFCC combining with better order of dynamic parameters can improve recognition effect. Increasing the order of dynamic parameters is not definitely associate with improving recognition effect.
作者 周玥媛 孔钦 ZHOU Yue-yuan;KONG Qin(Nanjing University Jinling College,Nanjing 210089,China)
出处 《计算机技术与发展》 2020年第5期76-83,共8页 Computer Technology and Development
基金 全国高校计算机基础教学研究与改革课题(AFCEC-2016-18) 南京大学金陵学院重点教改项目(0010521816,0010521806)。
关键词 GMM-UBM 声纹识别 特征参数性能 说话人确认 说话人辨认 GMM-UBM voiceprint recognition performance of feature parameters speaker identification speaker verification
  • 相关文献

参考文献4

二级参考文献21

  • 1宋淼,李敬阳.浅谈声纹检验鉴定技术[J].北京人民警察学院学报,2005(2):35-36. 被引量:6
  • 2吴尊敬,曹志刚.Improved MFCC-Based Feature for Robust Speaker Identification[J].Tsinghua Science and Technology,2005,10(2):158-161. 被引量:7
  • 3杨阳,陈永明.声纹识别技术及其应用[J].电声技术,2007,31(2):45-46. 被引量:22
  • 4陈魁.实验设计与分析[M].北京:清华大学出版社,1996,8.94.
  • 5赵力.语音信号处理[M].北京:机械工业出版社,2008.
  • 6Reynolds D A,Rose R C.Robust text-independent speaker identification using Gaussian mixture speaker models[J].IEEE Transactions on Speech and Audio Processing,1995,3(1):72-83.
  • 7Reynolds D A.Speaker identification and verification using Gaussian mixture speaker model[J].Speech Communication,1995,17:91-108.
  • 8You K H.Wang H C.Joint estimation of feature transformation parameters and Gaussian mixture model for speaker identification[J].Speech Communication,1999,28:227-241.
  • 9Jim Z C.Improvement of the K-means clustering filtering algorithm[J].Pattern Recognition,2008,41 (12):3677-3681.
  • 10Reynolds D A,Thomas F.Speaker verification using adapted Gaus-sian mixture models[J].Digital Signal Processing,2000,10 (1-3):19-41.

共引文献42

同被引文献335

引证文献15

二级引证文献33

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部