期刊文献+

语言辨识系统中最佳线性融合技术的研究

Optimal Linear Combinations of Classifiers for Language Identification
下载PDF
导出
摘要 本文利用不同参数提取方法对语言辨识系统中的线性融合技术进行了研究。融合系数的获取通过三个准则进行实现,CFM准则、MSE准则和CE准则。实验系统采用了区分性高斯混合模型,利用OGI-TS多语种电话语音语料库,对决策级融合性能进行了评估。实验表明,利用决策级融合技术,选择最佳融合系数,可以很好地改善语言辨识率。 This paper presents the fusions for optimally combining different language identification using different features. The optimal combining coefficients are obtained using three criterions. The criterions considered are; classification figure of merit ( CFM ), mean square error(MSE) and cross entropy(CE). The reference system uses the discriminative training algorithm to get each model parameters. The experiments are conducted using OGI Multi-language speech corpus. The experimental results show the optimal combination of different classifiers using different parameters is very effective in improving the language identification accuracy rates.
出处 《信号处理》 CSCD 北大核心 2006年第5期737-740,共4页 Journal of Signal Processing
基金 国家自然科学基金 No.60372038
关键词 语言辨识 最佳线性融合CFM准则 MSE准则 CE准则 Language identification Optimal linear combination CFM Criterion MSE Criterion CE Criterion.
  • 相关文献

参考文献6

  • 1Hakan Ahincay and Mubeccel Demirekler, Comparison of Different Objective Functions for Optimal Linear Combination of Classifiers For speaker Identification. In Proceedings of ICASSP' 2001, May 2001, Salt Lake City, USA.
  • 2E. Wong and S. Sridharan, Fusion of Output Scores on Language Identification System, Workshop on Multilingual Speech and Language Processing,Aalborg Denmark,2001.
  • 3J. B. Hampshire, A. H. Waibel. A novel objective function for improved phoneme recognition using time-delay neural networks. IEEE Trans. On neutral networks, Vol. 1, No. 2,pp 216-228,1990.
  • 4Hermansky, H. "Perceptual linear predictive ( PLP ) analysis of speech", Journal of the Acoustical Society of America,Vol. 87 ,pp. 1738-1752,1990.
  • 5Qu Dan, Wang Bingxi, Zhang Qiang, Two discriminative training schemes for language identification, ICSP 2004,Vol. 1, pp. p 630-633.
  • 6Y. K. Muthusamy, R. A. Cole, and B. T. Oshika. The OGI Multi-language telephone speech corpus. Technical report,Center for Spoken Language Understanding Oregon Graduate Institute of Science and Technology,Portland,1993.

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部