模糊判决支持向量机在自动语种辨识中的研究

Automatic Language Identification Based on FDSVM

下载PDF

导出

摘要支持向量机(SVM)是在统计学习理论的基础上发展起来的一种新的通用学习方法。自动语种辨识是语音信号处理中新出现的分支,也是一项较难的课题。该文提出的模糊判决支持向量机(FDSVM)是对支持向量机的判决结果的合理化改进,并应用于自动语种辨识系统。利用OGI-TS电话语音库对新算法的性能进行测试,然后给出实验结果。结果表明,该算法相对于传统算法是一种更有效的方法。 A support vector machines(SVM)is a new powerful classification machines from the theory of learning systems.Automatic language identification is a new and difficult embranchment of the speech signal processing.In this paper,Fussy Discrimination SVM(FDSVM)algorithm is provided which is an improving method based on SVM.Some experiments are conducted using OGI-TS telephone speech corpus.Then experiments results are described.It is shown that FDSVM is another more efficient method comparing with traditional ways.

作者张凡贺苏宁

机构地区西南电子电信技术研究所国家级重点实验室

出处《计算机工程与应用》 CSCD 北大核心 2004年第21期69-71,共3页 Computer Engineering and Applications

基金国家部委基金项目(编号:514950307)资助

关键词模糊判决支持向量机语种辨识线性预测倒谱系数 Fussy Discrimination Support Vector Machines(FDSVM),Language Identification(LI ),Linear Prediction Cepstrum Coefficients(LPCC)

分类号 TN192 [电子电信—物理电子学]

引文网络
相关文献

参考文献9

1Yeshwant K Muthusamy. Reviewing Automatic Language Identification[J].IEEE Signal Processing Magazine, 1994
2Pedro A Torres-Carrasquillo. Approaches to Language Identification using Gaussian Mixture Models and Shifted Delta Cepstral Features[C].In:Proc of Int`l Conf on Spoken Language Processing,Dever,2002-09
3Marc A Zissman. Automatic Language Identification using GMM and HMM[J].IEEE, 1993
4Marc A Zissman. Comparison of Four Approaches to Automatic Language Identification of Telephone Speech[J].IEEE Transactions on Speech and Audio Processing,1996;4
5Massimiliano Pontil,Alessandro Verri. Properties of Support Vector Machines. A I Memo No1612,C B C L Paper,No152
6Chih-Wei Hsu,Chih-Jen Lin.A Comparison of Methods for Multiclass Support Vector Machines[J].IEEE Transactions on neural networks,2002; 13(2)
7J Weston,C Watkins. Support Vector Machines for Multi-Class Pattern Recognition
8Eddie Wong,Sridha Sridharan.Comparison of Linear Prediction Cepstrum Coefficients and Mel-Frequency Cepstrum Coefficients for Language Identification[J].ISIMP, 2001
9屈丹,王炳锡,魏鑫.基于GMM-UBM模型的语言辨识研究[J].信号处理,2003,19(1):85-88. 被引量：10

二级参考文献11

1Y. K. Muthusamy, E. Barnard and R. A. Cole, "Reviewing Automatic Language Identification", IEEE Signal Processing Magazine, October 1994.
2Berkling, K.M., Arai, T., Barnard, E., Cole, R.A., 1994.Analysis of phoneme-based features for language identification. In: International Conference on Acoustics,Speech, and Signal Processing, Vol. 1, Aprikl 1994, pp.289-292.
3M.A. Zissman. Language identification using phoneme recognition phonotactic language modeling. In Proceedings 1995 IEEE International Conference onAcoustics,Speech, and Signal Processing, pages 3503- 3506, May 1995.
4J. Narvratil and Wemer Zuhlke. Double bigramdecoding in Phonotactic language identification. In Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing 97, Munique,Germany, April 1997.
5Y. K. Muthusamy, R. A. Cole, and B. T. Oshika. The OGI Multi-language telephone speech corpus. Technical report,Center for Spoken Language Understanding Oregon Graduate Institute of Science and Technology, Portland,1993.
6D.A. Reynolds, T. E Quaffed, and R. B. Dunn. Speaker verification using adapted Gaussian mixture models.Digital Signal Processing, Vol. 10, pp 19-41, 2000.
7D.A. Reynolds, and R.C. Rose, Rosust text-independence speaker identification using Gaussian mixture speaker models. IEEE Transactions on Speech and Audio Processing, vol.3, No. 1, pp72-83.
8A. E. Rosenberg and S. Parthasarathy, Speaker background models for connected digit password speaker verification. In Proceedings of the International Conference on Acoustics, Speech and Signal Processing,pp 81-84, 1996
9J. L. Gauvain and C.H. Lee, Maximum a postedori estimation for multivariate Gaussian mixture observations of Markov chains, IEEE Trans. Speech Audio Process.Vol.2, pp 291-298,1994.
10M. A. Zissman, "Comparison of four approaches to automatic language identification of telephone speech",IEEE Trans. Speech Audio Process. Vol. 4, pp 31-44.

共引文献9

1屈丹,侯风雷,王炳锡,吴保民.基于说话人聚类和高斯混合模型的语言辨识研究[J].信号处理,2004,20(3):285-289.
2张强,屈丹,侯风雷,王炳锡.应用说话人聚类技术改善语言辨识系统识别率[J].电声技术,2007,31(3):44-48.
3顾明亮.一种新的汉语方言辨识特征[J].广西科学,2007,14(4):423-425.
4屈丹,闫红刚,唐晖,王炳锡.基于概率统计直方图的压缩域说话人识别[J].数据采集与处理,2009,24(5):594-599.
5陈业仙,张歆奕,毛杰.基于GMM-UBM的语言辨识算法研究[J].五邑大学学报（自然科学版）,2010,24(3):56-60.
6顾明亮,张彪.半监督矢量量化的汉语方言辨识[J].计算机工程与应用,2011,47(33):109-111. 被引量：1
7韩军.基于DBF的汉语方言自动辨识[J].电声技术,2017,41(4):120-124. 被引量：2
8周大春,邵玉斌,张昊阁,龙华,彭艺.应用于噪声环境下语种识别的GFCC改进算法[J].云南大学学报（自然科学版）,2024,46(2):246-254. 被引量：1
9屈丹,王炳锡.基于GMBM-UBBM模型的语言辨识研究[J].计算机工程与应用,2004,40(3):29-32.

1仲海兵,宋彦,戴礼荣.基于音素识别的语种辨识方法中的因子分析[J].模式识别与人工智能,2012,25(1):105-110. 被引量：1
2郑普亮,许刚.时频分布不同特性进行语音分类[J].计算机工程与应用,2005,41(22):48-50. 被引量：2
3屈丹,王炳锡,魏鑫.语言辨识的矢量量化方法(VQ)[J].信息工程大学学报,2002,3(3):54-57.
4屈丹,王炳锡,魏鑫.基于GMM-UBM模型的语言辨识研究[J].信号处理,2003,19(1):85-88. 被引量：10
5张军胜.北广TVU353型分米波发射机合理化改进[J].视听界（广播电视技术）,2007,0(2):51-53.
6张小燕,宿建军,薛化建,王磊.维吾尔语语音识别语料库中的OOV研究[J].计算机工程与设计,2012,33(2):772-776. 被引量：4
7杜利民.自动语言辨识研究(上)[J].电子科技导报,1996(4):16-19. 被引量：3
8王侠,顾明亮,高原,马勇.基于GMM区分性别的汉语方言识别系统[J].电声技术,2011,35(12):39-41.
9陈继旭,刘明辉,戴蓓蒨,李辉.文本无关说话人确认中的一种新的评分规整方法[J].信号处理,2006,22(4):545-549. 被引量：1
10曹敬春,贺永杰.WQ85-Ⅲ微球聚焦测井仪电子线路的改进[J].内江科技,2012,33(12):127-127.

计算机工程与应用

2004年第21期

浏览历史

内容加载中请稍等...

模糊判决支持向量机在自动语种辨识中的研究

参考文献9

二级参考文献11

共引文献9

相关作者

相关机构

相关主题

浏览历史