基于GMBM-UBBM模型的语言辨识研究

Automatic Language Identification Based on GMBM-UBBM

下载PDF

导出

摘要高斯混合模型(GMM)是进行说话人无关的语言辨识的一种有效方法,高斯混合二元模型(GMBM)是GMM模型的二元时序扩展,该文在GMBM和GMM-UBM模型的基础上提出了一种基于GMBM-UBBM模型的语言辨识系统,并利用OGI-TS电话语音库对算法的性能进行了测试,然后给出了实验结果。实验结果表明,该算法也是进行语言辨识的一种有效方法,与传统的GMM-UBM算法相比,该算法最多可以获得4.378%的相对改善率。 Gaussian Mixture Model is an effective method for speaker -independent language identification.Gaussian Mixture Bigram Model integrates bigram time correlation to extend the GMM.In this paper,a language identification algorithm using GMBM-UBBM is proposed based on GMBM and GMM-UBM,and some experiments are conducted using OGI-TS multi-language telephone speech corpus.Simulation results demonstrate the effectiveness of GMBM-UBBM for language identification tasks and use of this model allows the proposed system to distinguish among the three languages with maximal4.378%improvement accuracy superior to conventional GMM-UBM.

作者屈丹王炳锡

机构地区解放军信息工程大学

出处《计算机工程与应用》 CSCD 北大核心 2004年第3期29-32,共4页 Computer Engineering and Applications

基金国家自然科学基金资助项目(批准号:60372038)

关键词高斯混合模型高斯混合二元模型全局背景模型全局背景二元模型贝叶斯自适应语言辨识 Gaussian mixture model,Gaussian mixture bigram model,Universal background model,Universal background bigram model,Bayesian adaptation

分类号 TP391.4 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献1

1屈丹,王炳锡,魏鑫.基于GMM-UBM模型的语言辨识研究[J].信号处理,2003,19(1):85-88. 被引量：10

二级参考文献11

1Y. K. Muthusamy, E. Barnard and R. A. Cole, "Reviewing Automatic Language Identification", IEEE Signal Processing Magazine, October 1994.
2Berkling, K.M., Arai, T., Barnard, E., Cole, R.A., 1994.Analysis of phoneme-based features for language identification. In: International Conference on Acoustics,Speech, and Signal Processing, Vol. 1, Aprikl 1994, pp.289-292.
3M.A. Zissman. Language identification using phoneme recognition phonotactic language modeling. In Proceedings 1995 IEEE International Conference onAcoustics,Speech, and Signal Processing, pages 3503- 3506, May 1995.
4J. Narvratil and Wemer Zuhlke. Double bigramdecoding in Phonotactic language identification. In Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing 97, Munique,Germany, April 1997.
5Y. K. Muthusamy, R. A. Cole, and B. T. Oshika. The OGI Multi-language telephone speech corpus. Technical report,Center for Spoken Language Understanding Oregon Graduate Institute of Science and Technology, Portland,1993.
6D.A. Reynolds, T. E Quaffed, and R. B. Dunn. Speaker verification using adapted Gaussian mixture models.Digital Signal Processing, Vol. 10, pp 19-41, 2000.
7D.A. Reynolds, and R.C. Rose, Rosust text-independence speaker identification using Gaussian mixture speaker models. IEEE Transactions on Speech and Audio Processing, vol.3, No. 1, pp72-83.
8A. E. Rosenberg and S. Parthasarathy, Speaker background models for connected digit password speaker verification. In Proceedings of the International Conference on Acoustics, Speech and Signal Processing,pp 81-84, 1996
9J. L. Gauvain and C.H. Lee, Maximum a postedori estimation for multivariate Gaussian mixture observations of Markov chains, IEEE Trans. Speech Audio Process.Vol.2, pp 291-298,1994.
10M. A. Zissman, "Comparison of four approaches to automatic language identification of telephone speech",IEEE Trans. Speech Audio Process. Vol. 4, pp 31-44.

共引文献9

1张凡,贺苏宁.模糊判决支持向量机在自动语种辨识中的研究[J].计算机工程与应用,2004,40(21):69-71.
2屈丹,侯风雷,王炳锡,吴保民.基于说话人聚类和高斯混合模型的语言辨识研究[J].信号处理,2004,20(3):285-289.
3张强,屈丹,侯风雷,王炳锡.应用说话人聚类技术改善语言辨识系统识别率[J].电声技术,2007,31(3):44-48.
4顾明亮.一种新的汉语方言辨识特征[J].广西科学,2007,14(4):423-425.
5屈丹,闫红刚,唐晖,王炳锡.基于概率统计直方图的压缩域说话人识别[J].数据采集与处理,2009,24(5):594-599.
6陈业仙,张歆奕,毛杰.基于GMM-UBM的语言辨识算法研究[J].五邑大学学报（自然科学版）,2010,24(3):56-60.
7顾明亮,张彪.半监督矢量量化的汉语方言辨识[J].计算机工程与应用,2011,47(33):109-111. 被引量：1
8韩军.基于DBF的汉语方言自动辨识[J].电声技术,2017,41(4):120-124. 被引量：2
9周大春,邵玉斌,张昊阁,龙华,彭艺.应用于噪声环境下语种识别的GFCC改进算法[J].云南大学学报（自然科学版）,2024,46(2):246-254.

1陈业仙,张歆奕,毛杰.基于GMM-UBM的语言辨识算法研究[J].五邑大学学报（自然科学版）,2010,24(3):56-60.
2张凡,贺苏宁.基于支持向量机的多种语言话音识别研究[J].计算机应用,2004,24(S1):282-284. 被引量：3
3姜洪臣,郑榕,张树武,徐波.基于SDC特征和GMM-UBM模型的自动语种识别[J].中文信息学报,2007,21(1):49-53. 被引量：14
4陈业华,熊学发.穷举极限内的语言辨识[J].荆州师专学报,1990,13(2):25-29.
5雷维嘉.51系列单片机慢速读写的时序扩展[J].单片机与嵌入式系统应用,2003(6):78-80.
6龙望晨.基于虚拟化技术的计算机实验室管理模型[J].工业控制计算机,2016,29(7):120-121. 被引量：4
7热依曼.吐尔逊,依皮提哈尔.买买提,吾守尔.斯拉木.维吾尔语电话语音语料库的研发[J].新疆大学学报（自然科学版）,2013,30(2):199-203. 被引量：2
8戴冠男,王炳锡,屈丹.基于音素发生率的自动语言辨识[J].信号处理,2006,22(2):285-288.
9李金厚.FEBM模型中的一点不足与改进[J].安徽工业大学学报（自然科学版）,2002,19(2):145-147.
10张彩红,洪青阳,陈燕.基于GMM-UBM的说话人确认系统的研究[J].心智与计算,2007,0(4):420-425. 被引量：7

计算机工程与应用

2004年第3期

浏览历史

内容加载中请稍等...

基于GMBM-UBBM模型的语言辨识研究

参考文献1

二级参考文献11

共引文献9

相关作者

相关机构

相关主题

浏览历史