基于音素发生率的自动语言辨识

Automatic Language Identification Using the Frequencies of Occurrence of Phones

下载PDF

导出

摘要不同语言的语音基元的种类和数量存在着差异,即使两种语言有相同的音素,它们的发生频率也存在差异。以前基于音素标识的语言辨识系统,难以引入新的语言。本文分别使用了GMM和VQ模型对音素符号发生率信息在语言辨识中的作用进行了研究,使用了音素符号发生率方法以及三种改进方法,各项实验结果表明音素符号发生率信息在语言辨识中具有一定的作用,可以作为语言辨识方法研究的一个方向。 Phonetic inventories differ from language to language. Even when languages have identical phones, the frequencies of occurrence of phones differ across languages. It＇ s difficult to introduce new languages when the language identification system used phones label. In this paper, we study the frequencies of occurrence of phones using Gaussian Mixture Model and Vector Quantization. The method of occurring of phones and three improved methods are provided in this paper. The experimental results show the frequencies of occurrence of phones are very effective in language identification.

作者戴冠男王炳锡屈丹

机构地区解放军信息工程大学

出处《信号处理》 CSCD 北大核心 2006年第2期285-288,共4页 Journal of Signal Processing

基金国家自然科学基金委员会对“电话信道自然语音语言辨识研究”项目（批准号：No.60372038）的支持

关键词高斯混合模型矢量量化模型混合训练模型音素发生率有效性有效性对 Gaussian Mixture Model （ GMM ） Vector Quantization （ VQ ） Mixed Training Model （ MTM ） Occurring of Phones Usefulness Usefulness Pair

分类号 TP391.4 [自动化与计算机技术—计算机应用技术] S858.31 [农业科学—临床兽医学]

引文网络
相关文献

参考文献4

1T. Nagarajan and Hema A. Murthy, “Language identification using spectral vector distribution across the languages”, in Proceedings of Int. Conf. Natural Language Processing, Dec. 2002.
2T. Nagarajan and Hema A. Murthy, “A pairwise multiple codebook approach to implicit language identification”,Workshop on Spoken Language Processing, TIFR, India,Jan. 2003.
3Y. K. Muthusamy, E. Barnard and R. A. Cole, “Reviewing Automatic Language Identification”, IEEE Signal Processing Magazine, October 1994.
4Y. K. Muthusamy, R. A. Cole and B. T. Oshika, “The OGI Multi-Language telephone speech corpus”, Technical report,Center for Spoken Language Understanding Oregon Graduate Institute of Science and Technology,Portland, 1993.

1杜利民.自动语言辨识研究（下）[J].电子科技导报,1996(5):14-15.
2赵忠彪,李文鑫,高荣.基于神经网络的矢量量化算法在语音辨识系统中的应用研究[J].河南科学,2008,26(7):839-841. 被引量：1
3赵虹,韦丽华.基于支持向量机的说话人识别研究[J].现代电子技术,2007,30(6):125-127. 被引量：3
4陈业仙,张歆奕,毛杰.基于GMM-UBM的语言辨识算法研究[J].五邑大学学报（自然科学版）,2010,24(3):56-60.
5张旭博,周渊平.基于MFCC和VQ码书的说话人识别系统研究[J].通信技术,2009,42(9):162-164. 被引量：4
6园丁.万紫千红百花艳[J].阅读,2016,0(86):39-39.
7李战明,王贞.矢量量化与神经网络相结合的说话人识别系统[J].计算机工程与应用,2006,42(15):204-206. 被引量：2
8屈丹,王炳锡.基于GMBM-UBBM模型的语言辨识研究[J].计算机工程与应用,2004,40(3):29-32.
9陈业华,熊学发.穷举极限内的语言辨识[J].荆州师专学报,1990,13(2):25-29.
10行敏锋,沈铁.DLVQ模型应用研究[J].文山师范高等专科学校学报,2007,20(3):97-100.

信号处理

2006年第2期

浏览历史

内容加载中请稍等...

基于音素发生率的自动语言辨识

参考文献4

相关作者

相关机构

相关主题

浏览历史