期刊文献+

模糊判决支持向量机在自动语种辨识中的研究

Automatic Language Identification Based on FDSVM
下载PDF
导出
摘要 支持向量机(SVM)是在统计学习理论的基础上发展起来的一种新的通用学习方法。自动语种辨识是语音信号处理中新出现的分支,也是一项较难的课题。该文提出的模糊判决支持向量机(FDSVM)是对支持向量机的判决结果的合理化改进,并应用于自动语种辨识系统。利用OGI-TS电话语音库对新算法的性能进行测试,然后给出实验结果。结果表明,该算法相对于传统算法是一种更有效的方法。 A support vector machines(SVM)is a new powerful classification machines from the theory of learning systems.Automatic language identification is a new and difficult embranchment of the speech signal processing.In this paper,Fussy Discrimination SVM(FDSVM)algorithm is provided which is an improving method based on SVM.Some experiments are conducted using OGI-TS telephone speech corpus.Then experiments results are described.It is shown that FDSVM is another more efficient method comparing with traditional ways.
作者 张凡 贺苏宁
出处 《计算机工程与应用》 CSCD 北大核心 2004年第21期69-71,共3页 Computer Engineering and Applications
基金 国家部委基金项目(编号:514950307)资助
关键词 模糊判决支持向量机 语种辨识 线性预测倒谱系数 Fussy Discrimination Support Vector Machines(FDSVM),Language Identification(LI ),Linear Prediction Cepstrum Coefficients(LPCC)
  • 相关文献

参考文献9

  • 1Yeshwant K Muthusamy. Reviewing Automatic Language Identification[J].IEEE Signal Processing Magazine, 1994
  • 2Pedro A Torres-Carrasquillo. Approaches to Language Identification using Gaussian Mixture Models and Shifted Delta Cepstral Features[C].In:Proc of Int`l Conf on Spoken Language Processing,Dever,2002-09
  • 3Marc A Zissman. Automatic Language Identification using GMM and HMM[J].IEEE, 1993
  • 4Marc A Zissman. Comparison of Four Approaches to Automatic Language Identification of Telephone Speech[J].IEEE Transactions on Speech and Audio Processing,1996;4
  • 5Massimiliano Pontil,Alessandro Verri. Properties of Support Vector Machines. A I Memo No1612,C B C L Paper,No152
  • 6Chih-Wei Hsu,Chih-Jen Lin.A Comparison of Methods for Multiclass Support Vector Machines[J].IEEE Transactions on neural networks,2002; 13(2)
  • 7J Weston,C Watkins. Support Vector Machines for Multi-Class Pattern Recognition
  • 8Eddie Wong,Sridha Sridharan.Comparison of Linear Prediction Cepstrum Coefficients and Mel-Frequency Cepstrum Coefficients for Language Identification[J].ISIMP, 2001
  • 9屈丹,王炳锡,魏鑫.基于GMM-UBM模型的语言辨识研究[J].信号处理,2003,19(1):85-88. 被引量:10

二级参考文献11

  • 1Y. K. Muthusamy, E. Barnard and R. A. Cole, "Reviewing Automatic Language Identification", IEEE Signal Processing Magazine, October 1994.
  • 2Berkling, K.M., Arai, T., Barnard, E., Cole, R.A., 1994.Analysis of phoneme-based features for language identification. In: International Conference on Acoustics,Speech, and Signal Processing, Vol. 1, Aprikl 1994, pp.289-292.
  • 3M.A. Zissman. Language identification using phoneme recognition phonotactic language modeling. In Proceedings 1995 IEEE International Conference onAcoustics,Speech, and Signal Processing, pages 3503- 3506, May 1995.
  • 4J. Narvratil and Wemer Zuhlke. Double bigramdecoding in Phonotactic language identification. In Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing 97, Munique,Germany, April 1997.
  • 5Y. K. Muthusamy, R. A. Cole, and B. T. Oshika. The OGI Multi-language telephone speech corpus. Technical report,Center for Spoken Language Understanding Oregon Graduate Institute of Science and Technology, Portland,1993.
  • 6D.A. Reynolds, T. E Quaffed, and R. B. Dunn. Speaker verification using adapted Gaussian mixture models.Digital Signal Processing, Vol. 10, pp 19-41, 2000.
  • 7D.A. Reynolds, and R.C. Rose, Rosust text-independence speaker identification using Gaussian mixture speaker models. IEEE Transactions on Speech and Audio Processing, vol.3, No. 1, pp72-83.
  • 8A. E. Rosenberg and S. Parthasarathy, Speaker background models for connected digit password speaker verification. In Proceedings of the International Conference on Acoustics, Speech and Signal Processing,pp 81-84, 1996
  • 9J. L. Gauvain and C.H. Lee, Maximum a postedori estimation for multivariate Gaussian mixture observations of Markov chains, IEEE Trans. Speech Audio Process.Vol.2, pp 291-298,1994.
  • 10M. A. Zissman, "Comparison of four approaches to automatic language identification of telephone speech",IEEE Trans. Speech Audio Process. Vol. 4, pp 31-44.

共引文献9

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部