语言辨识系统中最佳线性融合技术的研究

Optimal Linear Combinations of Classifiers for Language Identification

下载PDF

导出

摘要本文利用不同参数提取方法对语言辨识系统中的线性融合技术进行了研究。融合系数的获取通过三个准则进行实现,CFM准则、MSE准则和CE准则。实验系统采用了区分性高斯混合模型,利用OGI-TS多语种电话语音语料库,对决策级融合性能进行了评估。实验表明,利用决策级融合技术,选择最佳融合系数,可以很好地改善语言辨识率。 This paper presents the fusions for optimally combining different language identification using different features. The optimal combining coefficients are obtained using three criterions. The criterions considered are; classification figure of merit （ CFM ）, mean square error（MSE） and cross entropy（CE）. The reference system uses the discriminative training algorithm to get each model parameters. The experiments are conducted using OGI Multi-language speech corpus. The experimental results show the optimal combination of different classifiers using different parameters is very effective in improving the language identification accuracy rates.

作者张强屈丹王炳锡戴冠男

机构地区济南市粟山路一号

出处《信号处理》 CSCD 北大核心 2006年第5期737-740,共4页 Journal of Signal Processing

基金国家自然科学基金 No.60372038

关键词语言辨识最佳线性融合CFM准则 MSE准则 CE准则 Language identification Optimal linear combination CFM Criterion MSE Criterion CE Criterion.

分类号 TN912.34 [电子电信—通信与信息系统]

引文网络
相关文献

参考文献6

1Hakan Ahincay and Mubeccel Demirekler, Comparison of Different Objective Functions for Optimal Linear Combination of Classifiers For speaker Identification. In Proceedings of ICASSP' 2001, May 2001, Salt Lake City, USA.
2E. Wong and S. Sridharan, Fusion of Output Scores on Language Identification System, Workshop on Multilingual Speech and Language Processing,Aalborg Denmark,2001.
3J. B. Hampshire, A. H. Waibel. A novel objective function for improved phoneme recognition using time-delay neural networks. IEEE Trans. On neutral networks, Vol. 1, No. 2,pp 216-228,1990.
4Hermansky, H. "Perceptual linear predictive ( PLP ) analysis of speech", Journal of the Acoustical Society of America,Vol. 87 ,pp. 1738-1752,1990.
5Qu Dan, Wang Bingxi, Zhang Qiang, Two discriminative training schemes for language identification, ICSP 2004,Vol. 1, pp. p 630-633.
6Y. K. Muthusamy, R. A. Cole, and B. T. Oshika. The OGI Multi-language telephone speech corpus. Technical report,Center for Spoken Language Understanding Oregon Graduate Institute of Science and Technology,Portland,1993.

1屈丹,侯风雷,王炳锡,吴保民.基于说话人聚类和高斯混合模型的语言辨识研究[J].信号处理,2004,20(3):285-289.
2屈丹,王波,王炳锡.语言辨识系统的决策级融合研究[J].电声技术,2003,27(11):55-59. 被引量：1
3赵丹.单天线OFDM系统的信道估计算法研究[J].信息技术,2013,37(6):180-182.
4张文林,李弼程,屈丹.基于SVM-UBM的语言辨识系统[J].计算机工程与应用,2007,43(10):41-43.
5屈丹,王炳锡,魏鑫.基于GMM-UBM模型的语言辨识研究[J].信号处理,2003,19(1):85-88. 被引量：10
6张强,屈丹,侯风雷,王炳锡.应用说话人聚类技术改善语言辨识系统识别率[J].电声技术,2007,31(3):44-48.
7杨旭东,王万良.基于改进的MSE准则的小波图像压缩[J].计算机辅助设计与图形学学报,2003,15(4):402-405. 被引量：10
8屈丹,王炳锡,藏传辉.基于GMM区分性训练方法的语言辨识系统[J].计算机工程与应用,2004,40(6):108-110. 被引量：4
9屈丹,王炳锡,魏鑫.语言辨识的矢量量化方法(VQ)[J].信息工程大学学报,2002,3(3):54-57.
10李楠.三星（SAMSUNG）高清视界新革命SCB-6000P[J].A&S（安防工程商）,2011(10):22-25.

信号处理

2006年第5期

浏览历史

内容加载中请稍等...

语言辨识系统中最佳线性融合技术的研究

参考文献6

相关作者

相关机构

相关主题

浏览历史