模糊矢量量化在语音情感识别中的应用

Improved Fuzzy VQ Algorithm in Speech Emotion Recognition

下载PDF

导出

摘要提出了一种将改进的模糊C均值聚类算法与矢量量化相结合的语音情感识别方法,实现了对4种情感的识别:高兴、生气、悲伤和惊奇。首先提取情感语句全局结构和时序结构特征参数并进行性别规整,再利用改进后的模糊矢量量化方法来设计码本,最后对待识别语音进行辩识。该算法不但解决了模糊C均值算法对初始值敏感、易陷入局部最优的问题,而且性别规整改善了特征参数的有效性,使识别率得以进一步提高。实验结果表明该算法能够有效改善识别率。 A method which combines improved fuzzy c-mean clustering and VQ（Veetor Quantization） is proposed. Four emotions, namely happiness, angry, sadness and surprise, are recognized. Firstly, globe and time sequence features are extracted from speech signals, and modified according to the gender difference. Then code book is designed by improved fuzzy VQ. Finally the emotion of the speech is recognized. The problem of sensitive to initial condition is settled, and the local optimization is also avoided. In addition, the features validity is improved by gender modification. The result shows the better recognition rate.

作者狄金海赵艳赵力

机构地区浙江工贸职业技术学院东南大学信息科学与工程学院

出处《电声技术》 2008年第10期49-51,55,共4页 Audio Engineering

基金国家自然基金(60472058) 教育部博士点基金(20050286001) 教育部"新世纪优秀人才支持计划"

关键词模糊C均值矢量量化语音情感识别性别规整 fuzzy c-mean VQ speech emotion recognition gender modification

分类号 TN912 [电子电信—通信与信息系统]

引文网络
相关文献

参考文献8

1SHIRASAWA T, YAMAMURA T. Discriminating emotion intended in speech[C]// The Preprint of the Acoustical Society of Japan. [S.l.] : AS J, 1997, HIP: 38-96.
2BHATTI M W, WANG Yong-jin, GUAN Ling. A neural network approach for human emotion recognition in speech [C]// Proceedings of the 2004 International Symposium on Circuits and Systems. [S.l.] : IEEE Press, 2004,2 : 181-184.
3LEE C M, MARAYANAN S. Emotion recognition using a data-driven fuzzy inference system[C]// Proceedings of Eurospeech 2003. Geneva:[s.n.],2003:2521-2524.
4王治平,赵力,邹采荣.基于基音参数规整及统计分布模型距离的语音情感识别[J].声学学报,2006,31(1):28-34. 被引量：26
5PAO Tsang-long, CHEN Yu-te. Detecting emotions in mandarin speech[J]. Computational Linguistics and Chinese language processing,2005,10(3) :347-362.
6赵力,王治平,卢韦,邹采荣,吴镇扬.全局和时序结构特征并用的语音信号情感特征识别方法[J].自动化学报,2004,30(3):423-429. 被引量：15
7ZHAO L, KOBAYASHI Y, NIIMI Y. Tone recongintion of Chinese continuous speech using continuous HMMs[J]. Journal of the Acoustic Society of Japan,1997,53(12): 933-940.
8BEZDEK J C, A convergence theorem for the fuzzy ISODATA clustering algorithms[J]. IEEE Trans. on PAMI, 1990(2) : 1-8.

二级参考文献14

1Picard R W. Affective Computing. Cambridge: MIT Press,1997
2Yoshitom Y, KIM S, Kawano T et al. Effect of sensor fusion for recognition of emotional states using voice, face image and thermal image of face. In: Proceedings, 9th IEEE International Workshop on Robot and Human Interactive communication, Osaka, 2000; 1:178-183
3Dellaert F, Polzin T, Waibel A. Recognizing emotion in speech. In: 4th International Conference on Spoken Language Processing, Philadelphia; 1996:1970-1973
4Yacoub S, Simske S, Lin X et al. Recognition of emotions in interactive voice response systems. Hewlett-Pachard Labratories HPL-2003-136, 2003
5Lin X, Chen Y, Lira Set al. Recognition of emotional state from spoken sentenses. In: IEEE 3rd Workshop on Multimedia Signal Processing, Copenhagen, 1999:469-473
6Breazeal C. Regulation and Entrainment in Human-Robot Interaction. International Journal of Robotic Research,2002; 21(10-11): 883-902
7Pao T, Chen Y, Ych Jet al. An exploratory study on emotion recognition in mandarin speech. In: 1st Chinese Conference on Affectie Computing and Intelligent Interaction, Bcijing, 2003; 1:206-212
8Bosch L T. Emotions; What is possible in the ASR framework. In: ISCA Workshop on Speech and Emotion,Belfast, 2000
9Pereira C. Dimensions of emotionM meaning in speech. In:ISCA Workshop on Speech and Emotion, Belfast, Northern Ireland, 2000
10赵力,钱向民,邹采荣,吴镇扬.语音信号中的情感特征分析和识别的研究[J].通信学报,2000,21(10):18-24. 被引量：28

共引文献34

1张石清,刘瑞欣,赵小明.跨库语音情感识别研究进展[J].计算机系统应用,2022,31(11):31-48.
2韩文静,李海峰,韩纪庆.基于长短时特征融合的语音情感识别方法[J].清华大学学报（自然科学版）,2008,48(S1):708-714. 被引量：20
3赵腊生,张强,魏小鹏.语音情感识别研究进展[J].计算机应用研究,2009,26(2):428-432. 被引量：21
4韩文静,李海峰.基于韵律语段的语音情感识别方法研究[J].清华大学学报（自然科学版）,2009(S1):1363-1368. 被引量：8
5马希荣,刘琳,桑婧.基于情感计算的e-Learning系统建模[J].计算机科学,2005,32(8):131-133. 被引量：13
6苏庄銮,汪增福.基于统计方法的普通话情感语调模型[J].自动化学报,2007,33(7):673-677. 被引量：2
7余伶俐,蔡自兴,陈明义.语音信号的情感特征分析与识别研究综述[J].电路与系统学报,2007,12(4):76-84. 被引量：27
8丁辉,唐振民,钱博,李燕萍.易扩展小样本环境说话人辨认系统的研究[J].系统仿真学报,2008,20(10):2779-2781.
9陈雪勤,赵鹤鸣,俞一彪.蚁群聚类神经网络的耳语音声调识别[J].应用科学学报,2008,26(5):511-515.
10孟庆梅,吴伟国.Artificial emotional model based on finite state machine[J].Journal of Central South University of Technology,2008,15(5):694-699. 被引量：4

1吴宪,刘民航,范琨,陈牧原.基于模糊矢量量化的语音转换方法[J].信息化研究,2012,38(2):48-51. 被引量：1
2冯前进,陈武凡,林亚忠.一种新的模糊矢量量化算法[J].中国医学物理学杂志,2001,18(4):199-200.
3杨兵,谢维信.基于基因算法的隐马尔可夫模型参数估计[J].系统工程与电子技术,2002,24(7):74-76. 被引量：1
4陶阿加.即点即唱诺基亚5320 XpressMusic[J].移动信息,2008,0(7):68-70.
5孔祥维,李国平.一种基于高分辨率的模糊矢量量化算法[J].电子学报,2000,28(8):97-99. 被引量：3
6张基宏,谢维信.一种快速模糊矢量量化图像编码算法[J].电子学报,1999,27(2):106-108. 被引量：5
7杨彦,赵力.一种改进的模糊C-均值聚类算法在说话人识别中的应用[J].电声技术,2006,30(1):40-43. 被引量：4
8余华,徐开军.基于模糊集理论的语音情感识别[J].信息化研究,2011,37(2):53-55. 被引量：1
9张基宏,何振亚.一种指数型模糊学习矢量量化图像编码算法[J].通信学报,1998,19(10):1-6. 被引量：6
10肖楠.生物辩识电子护照争议中求发展[J].电子技术（上海）,2005,32(12):22-24. 被引量：2

电声技术

2008年第10期

浏览历史

内容加载中请稍等...

模糊矢量量化在语音情感识别中的应用

参考文献8

二级参考文献14

共引文献34

相关作者

相关机构

相关主题

浏览历史