基于GMM-UBM模型的语言辨识研究被引量：10

Automatic language identification based on GMM-UBM

下载PDF

导出

摘要与说话人识别、连续语音识别相比,自动语言辨识是一个相对较新的研究,而且是一项较难的课题。本文给出了一种基于GMM-UBM模型的语言辨识系统,并利用OGI-TS电话语音库对算法的性能进行了测试,然后给出了实验结果。实验结果表明,该算法也是进行语言辨识的一种有效方法。 Compared with other speech technologies in speech processing, automatic language identification is a relatively new yet difficult problem. In this paper, a language identification algorithm is provided and some experiments are conducted using OGI-TS telephone speech corpus. Then experiments results are described. It is shown that GMM-UBM is another efficient method to language identification problems.

作者屈丹王炳锡魏鑫

机构地区解放军信息工程大学

出处《信号处理》 CSCD 2003年第1期85-88,共4页 Journal of Signal Processing

关键词语音识别语言辨识 GMM.UBM模型计算机 gaussian mixture model universal background model bayesian adaptation

分类号 TN912.34 [电子电信—通信与信息系统]

引文网络
相关文献

参考文献11

1Y. K. Muthusamy, E. Barnard and R. A. Cole, "Reviewing Automatic Language Identification", IEEE Signal Processing Magazine, October 1994.
2Berkling, K.M., Arai, T., Barnard, E., Cole, R.A., 1994.Analysis of phoneme-based features for language identification. In: International Conference on Acoustics,Speech, and Signal Processing, Vol. 1, Aprikl 1994, pp.289-292.
3M.A. Zissman. Language identification using phoneme recognition phonotactic language modeling. In Proceedings 1995 IEEE International Conference onAcoustics,Speech, and Signal Processing, pages 3503- 3506, May 1995.
4J. Narvratil and Wemer Zuhlke. Double bigramdecoding in Phonotactic language identification. In Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing 97, Munique,Germany, April 1997.
5Y. K. Muthusamy, R. A. Cole, and B. T. Oshika. The OGI Multi-language telephone speech corpus. Technical report,Center for Spoken Language Understanding Oregon Graduate Institute of Science and Technology, Portland,1993.
6D.A. Reynolds, T. E Quaffed, and R. B. Dunn. Speaker verification using adapted Gaussian mixture models.Digital Signal Processing, Vol. 10, pp 19-41, 2000.
7D.A. Reynolds, and R.C. Rose, Rosust text-independence speaker identification using Gaussian mixture speaker models. IEEE Transactions on Speech and Audio Processing, vol.3, No. 1, pp72-83.
8A. E. Rosenberg and S. Parthasarathy, Speaker background models for connected digit password speaker verification. In Proceedings of the International Conference on Acoustics, Speech and Signal Processing,pp 81-84, 1996
9J. L. Gauvain and C.H. Lee, Maximum a postedori estimation for multivariate Gaussian mixture observations of Markov chains, IEEE Trans. Speech Audio Process.Vol.2, pp 291-298,1994.
10M. A. Zissman, "Comparison of four approaches to automatic language identification of telephone speech",IEEE Trans. Speech Audio Process. Vol. 4, pp 31-44.

同被引文献52

1顾明亮,沈兆勇.基于语音配列的汉语方言自动辨识[J].中文信息学报,2006,20(5):77-82. 被引量：19
2姜洪臣,郑榕,张树武,徐波.基于SDC特征和GMM-UBM模型的自动语种识别[J].中文信息学报,2007,21(1):49-53. 被引量：14
3Petracca M,Servetti A, Demartin J C. Performance analysis of compressed-domain automatic speaker recognition as a function of speech coding technique and bit rate [C]//Proceedings of International Conference on Multimedia and Expo (ICME). Toronto, Canada:IEEE Press,2006:1393-1396.
4Dunn R B, Quatieri T F, Reynolds D A, et al. Speaker recognition from coded speech in matched and mismatched conditions [C]//Proceedings of Speaker Recognition Workshop'1. Grete, Greece: [s.n.], 2001: 115-120.
5Quatieri T F, Dunn R B, Reynolds D A, et al. Speaker recognition using G. 729 speech codec parameters [C]//Proceedings of IEEE, International Conference on Audio, Speech and Signal Processing. Istanbul, Turkey:IEEE Press, 2000: 1089-1093.
6Aggarwal C C, Olshefski D, Saha D, et al. CSR: speaker recognition from compressed VoIP packet stream[C]//Proceedings of International Conference on Multimedia and Expo (ICME). Amsterdam, Holand : IEEE Press, 2005 : 970-973.
7Petracca M, Servetti A, Demartin J C. Low-complextity automatic speaker recognition in the compressed GSM-AMR domain[C]//Proceedings of International Conference on Multimedia and Expo (ICME). Amsterdam, Holand:IEEE Press, 2005: 662-665.
8ITU-T H. 323 2000. Packet-based multimedia communications systems[S]. Genevese: ITU-T,2000.
9ITU-T Recommendation G. 729-1996. Coding of speech at 8 kbit/s using conjugate-structure algebraic-code-excited linear-prediction (CS-ACELP)[S]. Helsinki.. WTSC Resolution, 1996.
10ITU-T Recommendation G. 723.1-1996. Dual rate speech coder for multimedia communications trans- mitting at 5.3 and 6.3 kbit/s [S]. Helsinki: WTSC Resolution, 1996.

引证文献10

1张凡,贺苏宁.模糊判决支持向量机在自动语种辨识中的研究[J].计算机工程与应用,2004,40(21):69-71.
2屈丹,侯风雷,王炳锡,吴保民.基于说话人聚类和高斯混合模型的语言辨识研究[J].信号处理,2004,20(3):285-289.
3张强,屈丹,侯风雷,王炳锡.应用说话人聚类技术改善语言辨识系统识别率[J].电声技术,2007,31(3):44-48.
4顾明亮.一种新的汉语方言辨识特征[J].广西科学,2007,14(4):423-425.
5屈丹,闫红刚,唐晖,王炳锡.基于概率统计直方图的压缩域说话人识别[J].数据采集与处理,2009,24(5):594-599.
6陈业仙,张歆奕,毛杰.基于GMM-UBM的语言辨识算法研究[J].五邑大学学报（自然科学版）,2010,24(3):56-60.
7顾明亮,张彪.半监督矢量量化的汉语方言辨识[J].计算机工程与应用,2011,47(33):109-111. 被引量：1
8韩军.基于DBF的汉语方言自动辨识[J].电声技术,2017,41(4):120-124. 被引量：2
9周大春,邵玉斌,张昊阁,龙华,彭艺.应用于噪声环境下语种识别的GFCC改进算法[J].云南大学学报（自然科学版）,2024,46(2):246-254. 被引量：1
10屈丹,王炳锡.基于GMBM-UBBM模型的语言辨识研究[J].计算机工程与应用,2004,40(3):29-32.

二级引证文献4

1艾虎,李菲.基于改进的长短期神经网络的贵州方言辨识系统的设计与实现[J].科学技术与工程,2019,19(5):203-210. 被引量：3
2杨伟,杨俊杰.基于语言学音系例字的口音自动识别探究[J].中国司法鉴定,2021(2):38-42. 被引量：2
3刘琪,莫东林.语种识别技术在中短波广播强噪声语音音频识别中的应用[J].电声技术,2024,48(5):49-51.
4徐磊,魏来,宋丽娟.基于关键词识别的“黑广播”识别方法研究[J].中国无线电,2019(2):39-40. 被引量：8

1屈丹,侯风雷,王炳锡,吴保民.基于说话人聚类和高斯混合模型的语言辨识研究[J].信号处理,2004,20(3):285-289.
2屈丹,王炳锡,魏鑫.语言辨识的矢量量化方法(VQ)[J].信息工程大学学报,2002,3(3):54-57.
3屈丹,王波,王炳锡.语言辨识系统的决策级融合研究[J].电声技术,2003,27(11):55-59. 被引量：1
4杜利民.自动语言辨识研究(上)[J].电子科技导报,1996(4):16-19. 被引量：3
5张文林,李弼程,屈丹.基于SVM-UBM的语言辨识系统[J].计算机工程与应用,2007,43(10):41-43.
6张强,屈丹,王炳锡,戴冠男.语言辨识系统中最佳线性融合技术的研究[J].信号处理,2006,22(5):737-740.
7黄山奇,张连海,屈丹.一种基于人耳听觉感知和子带补偿滤波的鲁棒语言辨识特征参数提取算法[J].模式识别与人工智能,2012,25(1):166-171. 被引量：2
8詹妮·德赖弗.看看你的身体在说什么[J].学生阅读世界（初中生）,2014(4):48-48.
9屈丹,王炳锡,藏传辉.基于GMM区分性训练方法的语言辨识系统[J].计算机工程与应用,2004,40(6):108-110. 被引量：4
10张强,屈丹,侯风雷,王炳锡.应用说话人聚类技术改善语言辨识系统识别率[J].电声技术,2007,31(3):44-48.

信号处理

2003年第1期

浏览历史

内容加载中请稍等...

基于GMM-UBM模型的语言辨识研究被引量：10

参考文献11

同被引文献52

引证文献10

二级引证文献4

相关作者

相关机构

相关主题

浏览历史

基于GMM-UBM模型的语言辨识研究 被引量：10

参考文献11

同被引文献52

引证文献10

二级引证文献4

相关作者

相关机构

相关主题

浏览历史

基于GMM-UBM模型的语言辨识研究被引量：10