期刊文献+

基于GMBM-UBBM模型的语言辨识研究

Automatic Language Identification Based on GMBM-UBBM
下载PDF
导出
摘要 高斯混合模型(GMM)是进行说话人无关的语言辨识的一种有效方法,高斯混合二元模型(GMBM)是GMM模型的二元时序扩展,该文在GMBM和GMM-UBM模型的基础上提出了一种基于GMBM-UBBM模型的语言辨识系统,并利用OGI-TS电话语音库对算法的性能进行了测试,然后给出了实验结果。实验结果表明,该算法也是进行语言辨识的一种有效方法,与传统的GMM-UBM算法相比,该算法最多可以获得4.378%的相对改善率。 Gaussian Mixture Model is an effective method for speaker -independent language identification.Gaussian Mixture Bigram Model integrates bigram time correlation to extend the GMM.In this paper,a language identification algorithm using GMBM-UBBM is proposed based on GMBM and GMM-UBM,and some experiments are conducted using OGI-TS multi-language telephone speech corpus.Simulation results demonstrate the effectiveness of GMBM-UBBM for language identification tasks and use of this model allows the proposed system to distinguish among the three languages with maximal4.378%improvement accuracy superior to conventional GMM-UBM.
作者 屈丹 王炳锡
出处 《计算机工程与应用》 CSCD 北大核心 2004年第3期29-32,共4页 Computer Engineering and Applications
基金 国家自然科学基金资助项目(批准号:60372038)
关键词 高斯混合模型 高斯混合二元模型 全局背景模型 全局背景二元模型 贝叶斯自适应 语言辨识 Gaussian mixture model,Gaussian mixture bigram model,Universal background model,Universal background bigram model,Bayesian adaptation
  • 相关文献

参考文献1

二级参考文献11

  • 1Y. K. Muthusamy, E. Barnard and R. A. Cole, "Reviewing Automatic Language Identification", IEEE Signal Processing Magazine, October 1994.
  • 2Berkling, K.M., Arai, T., Barnard, E., Cole, R.A., 1994.Analysis of phoneme-based features for language identification. In: International Conference on Acoustics,Speech, and Signal Processing, Vol. 1, Aprikl 1994, pp.289-292.
  • 3M.A. Zissman. Language identification using phoneme recognition phonotactic language modeling. In Proceedings 1995 IEEE International Conference onAcoustics,Speech, and Signal Processing, pages 3503- 3506, May 1995.
  • 4J. Narvratil and Wemer Zuhlke. Double bigramdecoding in Phonotactic language identification. In Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing 97, Munique,Germany, April 1997.
  • 5Y. K. Muthusamy, R. A. Cole, and B. T. Oshika. The OGI Multi-language telephone speech corpus. Technical report,Center for Spoken Language Understanding Oregon Graduate Institute of Science and Technology, Portland,1993.
  • 6D.A. Reynolds, T. E Quaffed, and R. B. Dunn. Speaker verification using adapted Gaussian mixture models.Digital Signal Processing, Vol. 10, pp 19-41, 2000.
  • 7D.A. Reynolds, and R.C. Rose, Rosust text-independence speaker identification using Gaussian mixture speaker models. IEEE Transactions on Speech and Audio Processing, vol.3, No. 1, pp72-83.
  • 8A. E. Rosenberg and S. Parthasarathy, Speaker background models for connected digit password speaker verification. In Proceedings of the International Conference on Acoustics, Speech and Signal Processing,pp 81-84, 1996
  • 9J. L. Gauvain and C.H. Lee, Maximum a postedori estimation for multivariate Gaussian mixture observations of Markov chains, IEEE Trans. Speech Audio Process.Vol.2, pp 291-298,1994.
  • 10M. A. Zissman, "Comparison of four approaches to automatic language identification of telephone speech",IEEE Trans. Speech Audio Process. Vol. 4, pp 31-44.

共引文献9

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部