期刊文献+

基于音素发生率的自动语言辨识

Automatic Language Identification Using the Frequencies of Occurrence of Phones
下载PDF
导出
摘要 不同语言的语音基元的种类和数量存在着差异,即使两种语言有相同的音素,它们的发生频率也存在差异。以前基于音素标识的语言辨识系统,难以引入新的语言。本文分别使用了GMM和VQ模型对音素符号发生率信息在语言辨识中的作用进行了研究,使用了音素符号发生率方法以及三种改进方法,各项实验结果表明音素符号发生率信息在语言辨识中具有一定的作用,可以作为语言辨识方法研究的一个方向。 Phonetic inventories differ from language to language. Even when languages have identical phones, the frequencies of occurrence of phones differ across languages. It' s difficult to introduce new languages when the language identification system used phones label. In this paper, we study the frequencies of occurrence of phones using Gaussian Mixture Model and Vector Quantization. The method of occurring of phones and three improved methods are provided in this paper. The experimental results show the frequencies of occurrence of phones are very effective in language identification.
出处 《信号处理》 CSCD 北大核心 2006年第2期285-288,共4页 Journal of Signal Processing
基金 国家自然科学基金委员会对“电话信道自然语音语言辨识研究”项目(批准号:No.60372038)的支持
关键词 高斯混合模型 矢量量化模型 混合训练模型 音素发生率 有效性 有效性对 Gaussian Mixture Model ( GMM ) Vector Quantization ( VQ ) Mixed Training Model ( MTM ) Occurring of Phones Usefulness Usefulness Pair
  • 相关文献

参考文献4

  • 1T. Nagarajan and Hema A. Murthy, “Language identification using spectral vector distribution across the languages”, in Proceedings of Int. Conf. Natural Language Processing, Dec. 2002.
  • 2T. Nagarajan and Hema A. Murthy, “A pairwise multiple codebook approach to implicit language identification”,Workshop on Spoken Language Processing, TIFR, India,Jan. 2003.
  • 3Y. K. Muthusamy, E. Barnard and R. A. Cole, “Reviewing Automatic Language Identification”, IEEE Signal Processing Magazine, October 1994.
  • 4Y. K. Muthusamy, R. A. Cole and B. T. Oshika, “The OGI Multi-Language telephone speech corpus”, Technical report,Center for Spoken Language Understanding Oregon Graduate Institute of Science and Technology,Portland, 1993.

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部