期刊文献+

一种新型的与文本相关的说话人识别方法研究

A new study on text-related speaker recognition
下载PDF
导出
摘要 在与文本相关的说话人识别研究中,既要包含说话人身份的识别,又要包含语音文本内容的识别.提出一种基于语音识别的与文本相关的说话人识别方法,从而建立说话人的声纹模型和语音文本模型,与传统的仅建立一种模型的方法相比,该方法能更精确地描述说话人身份信息和语音的文本信息,较好地解决了短时语音样本识别效果不佳的问题.测试实验表明,和传统与文本相关的说话人识别方法(如基于动态时间规整、高斯混合-通用背景模型)相比,由本方法建立的系统虚警概率降低了8.9%,识别性能得到了提高. In the study of text-related speaker recognition, it is to include the identity recognition as well as the speech text recog-nition. This paper proposes a new kind of text-related speaker recognition method based on the speech recognition. The model built by this method can describe both the identity information and the speech text information more accurately. Besides, it can also solve the problem that the short-term speech samples have poor recognition effect. The experiments show that compared with the traditional text-related speaker recognition system such as dynamic time warping ( DTW) and Gaussian mixture model-universal background model( GMM-UBM) ,the false alarm probability of the system established by the present method is reduced by 8.9% and the recognition performance is improved.
出处 《上海师范大学学报(自然科学版)》 2017年第2期224-230,共7页 Journal of Shanghai Normal University(Natural Sciences)
基金 上海高校青年教师培养计划(zzshsfl14026)
关键词 文本相关 说话人识别 语音识别 text-related speaker recognition speech recognition
  • 相关文献

参考文献5

二级参考文献39

  • 1吴尊敬,曹志刚.Improved MFCC-Based Feature for Robust Speaker Identification[J].Tsinghua Science and Technology,2005,10(2):158-161. 被引量:7
  • 2Reynolds D A,Rose R C.Robust text-independent speaker identification using Gaussian mixture speaker models[J].IEEE Transactions on Speech and Audio Processing,1995,3(1):72-83.
  • 3Reynolds D A.Speaker identification and verification using Gaussian mixture speaker model[J].Speech Communication,1995,17:91-108.
  • 4You K H.Wang H C.Joint estimation of feature transformation parameters and Gaussian mixture model for speaker identification[J].Speech Communication,1999,28:227-241.
  • 5Jim Z C.Improvement of the K-means clustering filtering algorithm[J].Pattern Recognition,2008,41 (12):3677-3681.
  • 6Reynolds D A,Thomas F.Speaker verification using adapted Gaus-sian mixture models[J].Digital Signal Processing,2000,10 (1-3):19-41.
  • 7Alvin F.Martin and Mark A.Przybockl.NIST 2003 Language Recognition Evaluation[A].In:Proceedings of Eurospeech[C].Geneva,Switzerland:Sept.2003,161-164.
  • 8P.A.Torres-Carrasquillo et al.Approaches to Language Identification Using Gaussian Mixture Model and Shifted Delta Cepstral Features[A].In:Proceedings of ICSLP[C].Colorado USA:Sept.2002,89-92.
  • 9D.A.Reynolds,T.F.Quatieri,and R.B.Dunn.Speaker Verification Using Adapted Gaussian Mixture Models[J].Digital Signal Processing,2000,Vol.10:19-41.
  • 10Y.K.Muthusamy,R.A.Cole,and B.T.Qshika.The OGI Multilanguage Telephone Speech Corpus[A].In:Proceedings of ICSLP[C],Oct.1992,895-898.

共引文献72

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部