期刊文献+

模糊矢量量化在语音情感识别中的应用

Improved Fuzzy VQ Algorithm in Speech Emotion Recognition
下载PDF
导出
摘要 提出了一种将改进的模糊C均值聚类算法与矢量量化相结合的语音情感识别方法,实现了对4种情感的识别:高兴、生气、悲伤和惊奇。首先提取情感语句全局结构和时序结构特征参数并进行性别规整,再利用改进后的模糊矢量量化方法来设计码本,最后对待识别语音进行辩识。该算法不但解决了模糊C均值算法对初始值敏感、易陷入局部最优的问题,而且性别规整改善了特征参数的有效性,使识别率得以进一步提高。实验结果表明该算法能够有效改善识别率。 A method which combines improved fuzzy c-mean clustering and VQ(Veetor Quantization) is proposed. Four emotions, namely happiness, angry, sadness and surprise, are recognized. Firstly, globe and time sequence features are extracted from speech signals, and modified according to the gender difference. Then code book is designed by improved fuzzy VQ. Finally the emotion of the speech is recognized. The problem of sensitive to initial condition is settled, and the local optimization is also avoided. In addition, the features validity is improved by gender modification. The result shows the better recognition rate.
出处 《电声技术》 2008年第10期49-51,55,共4页 Audio Engineering
基金 国家自然基金(60472058) 教育部博士点基金(20050286001) 教育部"新世纪优秀人才支持计划"
关键词 模糊C均值 矢量量化 语音情感识别 性别规整 fuzzy c-mean VQ speech emotion recognition gender modification
  • 相关文献

参考文献8

  • 1SHIRASAWA T, YAMAMURA T. Discriminating emotion intended in speech[C]// The Preprint of the Acoustical Society of Japan. [S.l.] : AS J, 1997, HIP: 38-96.
  • 2BHATTI M W, WANG Yong-jin, GUAN Ling. A neural network approach for human emotion recognition in speech [C]// Proceedings of the 2004 International Symposium on Circuits and Systems. [S.l.] : IEEE Press, 2004,2 : 181-184.
  • 3LEE C M, MARAYANAN S. Emotion recognition using a data-driven fuzzy inference system[C]// Proceedings of Eurospeech 2003. Geneva:[s.n.],2003:2521-2524.
  • 4王治平,赵力,邹采荣.基于基音参数规整及统计分布模型距离的语音情感识别[J].声学学报,2006,31(1):28-34. 被引量:26
  • 5PAO Tsang-long, CHEN Yu-te. Detecting emotions in mandarin speech[J]. Computational Linguistics and Chinese language processing,2005,10(3) :347-362.
  • 6赵力,王治平,卢韦,邹采荣,吴镇扬.全局和时序结构特征并用的语音信号情感特征识别方法[J].自动化学报,2004,30(3):423-429. 被引量:15
  • 7ZHAO L, KOBAYASHI Y, NIIMI Y. Tone recongintion of Chinese continuous speech using continuous HMMs[J]. Journal of the Acoustic Society of Japan,1997,53(12): 933-940.
  • 8BEZDEK J C, A convergence theorem for the fuzzy ISODATA clustering algorithms[J]. IEEE Trans. on PAMI, 1990(2) : 1-8.

二级参考文献14

  • 1Picard R W. Affective Computing. Cambridge: MIT Press,1997
  • 2Yoshitom Y, KIM S, Kawano T et al. Effect of sensor fusion for recognition of emotional states using voice, face image and thermal image of face. In: Proceedings, 9th IEEE International Workshop on Robot and Human Interactive communication, Osaka, 2000; 1:178-183
  • 3Dellaert F, Polzin T, Waibel A. Recognizing emotion in speech. In: 4th International Conference on Spoken Language Processing, Philadelphia; 1996:1970-1973
  • 4Yacoub S, Simske S, Lin X et al. Recognition of emotions in interactive voice response systems. Hewlett-Pachard Labratories HPL-2003-136, 2003
  • 5Lin X, Chen Y, Lira Set al. Recognition of emotional state from spoken sentenses. In: IEEE 3rd Workshop on Multimedia Signal Processing, Copenhagen, 1999:469-473
  • 6Breazeal C. Regulation and Entrainment in Human-Robot Interaction. International Journal of Robotic Research,2002; 21(10-11): 883-902
  • 7Pao T, Chen Y, Ych Jet al. An exploratory study on emotion recognition in mandarin speech. In: 1st Chinese Conference on Affectie Computing and Intelligent Interaction, Bcijing, 2003; 1:206-212
  • 8Bosch L T. Emotions; What is possible in the ASR framework. In: ISCA Workshop on Speech and Emotion,Belfast, 2000
  • 9Pereira C. Dimensions of emotionM meaning in speech. In:ISCA Workshop on Speech and Emotion, Belfast, Northern Ireland, 2000
  • 10赵力,钱向民,邹采荣,吴镇扬.语音信号中的情感特征分析和识别的研究[J].通信学报,2000,21(10):18-24. 被引量:28

共引文献34

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部