Novel acoustic features for speech emotion recognition 被引量：2

Novel acoustic features for speech emotion recognition

导出

摘要 This paper focuses on acoustic features that effectively improve the recognition of emotion in human speech.The novel features in this paper are based on spectral-based entropy parameters such as fast Fourier transform(FFT) spectral entropy,delta FFT spectral entropy,Mel-frequency filter bank(MFB) spectral entropy,and Delta MFB spectral entropy.Spectral-based entropy features are simple.They reflect frequency characteristic and changing characteristic in frequency of speech.We implement an emotion rejection module using the probability distribution of recognized-scores and rejected-scores.This reduces the false recognition rate to improve overall performance.Recognized-scores and rejected-scores refer to probabilities of recognized and rejected emotion recognition results,respectively.These scores are first obtained from a pattern recognition procedure.The pattern recognition phase uses the Gaussian mixture model(GMM).We classify the four emotional states as anger,sadness,happiness and neutrality.The proposed method is evaluated using 45 sentences in each emotion for 30 subjects,15 males and 15 females.Experimental results show that the proposed method is superior to the existing emotion recognition methods based on GMM using energy,Zero Crossing Rate(ZCR),linear prediction coefficient(LPC),and pitch parameters.We demonstrate the effectiveness of the proposed approach.One of the proposed features,combined MFB and delta MFB spectral entropy improves performance approximately 10% compared to the existing feature parameters for speech emotion recognition methods.We demonstrate a 4% performance improvement in the applied emotion rejection with low confidence score. This paper focuses on acoustic features that effectively improve the recognition of emotion in human speech. The novel features in this paper are based on spectral-based entropy parameters such as fast Fourier transform (FFT) spectral entropy, delta FFT spectral entropy, Mel-frequency filter bank (MFB) spectral entropy, and Delta MFB spectral entropy. Spectral-based entropy features are simple. They reflect frequency characteristic and changing characteristic in frequency of speech. We implement an emotion rejection module using the probability distribution of recognized-scores and rejected-scores. This reduces the false recognition rate to improve overall performance. Recognized-scores and rejected-scores refer to probabilities of recognized and rejected emotion recognition results, respectively. These scores are first obtained from a pattern recognition procedure. The pattern recognition phase uses the Gaussian mixture model (GMM). We classify the four emotional states as anger, sadness, happiness and neutrality. The proposed method is evaluated using 45 sentences in each emotion for 30 subjects, 15 males and 15 females. Experimental results show that the proposed method is superior to the existing emotion recognition methods based on GMM using energy, Zero Crossing Rate (ZCR), linear prediction coefficient (LPC), and pitch parameters. We demonstrate the effectiveness of the proposed approach. One of the proposed features, combined MFB and delta MFB spectral entropy improves performance approximately 10% compared to the existing feature parameters for speech emotion recognition methods. We demonstrate a 4% performance improvement in the applied emotion rejection with low confidence score.

作者 ROH Yong-Wan KIM Dong-Ju LEE Woo-Seok HONG Kwang-Seok

机构地区 School of Information and Communication Engineering Development Division / DSP Development Team

出处《Science China(Technological Sciences)》 SCIE EI CAS 2009年第7期1838-1848,共11页 中国科学（技术科学英文版）

基金 Supported by MIC,Korea under ITRC IITA-2009-(C1090-0902-0046) the Korea Science and Engineering Foundation(KOSEF) funded by the Korea government(MEST)(Grant No.20090058909)

关键词 SPEECH EMOTION RECOGNITION MFB SPECTRAL ENTROPY ENTROPY EMOTION RECOGNITION REJECTION speech emotion recognition MFB spectral entropy entropy emotion recognition rejection

分类号 TP391.4 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献40

1Bhatti M W,Wang Y,Guan L.A neural network approach for human emotion recognition in speech. Proceedings of the2004Interna-tional Symposium on Circuits and Systems(ISCAS’04) . 2004
2Lee C M,Narayanan S.Towards detecting emotions in spoken dia-logs. IEEE Transactions on Speech and Audio Processing . 2004
3Dellaert F,Polzin T,Waibel A.Recognizing emotion in speech. Proceedings of Fourth International Conference on Spoken Language Processing(ICSLP’96) . 1996
4Amir N.Classifying emotions in speech.A comparison of methods. Proceedings of European Conference on Speech Communication and Technology(EUROSPEECH’01) . 2001
5Lee C M,Narayanan S,Pieraccini R.Recognition of negative emotions from the speech signal. Proceedings of IEEE Work-shop on Automatic Speech Recognition and Understanding . 2001
6Altun H,Polat G.New Frameworks to Boost Feature Selection Alg-orithms in Emotion Detection for Improved Human Computer Interac-tion. Lecture Notes in Computer Science . 2007
7Kim E H,Hyun K H,Kwak Y K.Improvement of emotion recogni-tion from voice by separation of obstruent. 15th IEEE International Symposium on Robut and Human Interactive Communication(RO-MAN06) . 2006
8Kim E H,Hyun K H,Kim S H,et al.Speech emotion recognition using Eigen-FFT in clean and noisy environments. 16th IEEE Inter-national Conference on Robot&Human Interactive Communication . 2007
9Borchert M,Dusterhoft A.Emotion in speech-experiments with prosody and quality features in speech for use in categorical and di-mensional emotion recognition environments. Natural Language Processing and Knowledge Engineering,IEEE NLP-KE′05.Pro-ceedings of2005IEEE International Conference on . 2005
10Noda T,Yano Y,Doki S,et al.Adaptive emotion recognition in speech by feature selection based on KL-divergence. IEEE Interna-tional Conference on System,Man,and Cybernetics . 2006

引证文献2

1Yang Lingzhi,Ban Xiaojuan,Michele Mukeshimana,Chen Zhe.Multiple feature fusion for unimodal emotion recognition[J].The Journal of China Universities of Posts and Telecommunications,2019,26(2):17-29. 被引量：1
2李琳,考希宾,万红.多源异构数据的情绪状态识别[J].人类工效学,2021,27(5):44-47.

二级引证文献1

1柴庆凤,史霖炎,梅珊,熊海涛,贺惠新.基于人工特征和机器特征融合的科技文献知识元抽取[J].数据分析与知识发现,2021,5(8):132-143. 被引量：11

1ZHANG Xiaodan,HUANG Chengwei,ZHAO Li,ZOU Cairong.Recognition of practical speech emotion using improved shuffled frog leaping algorithm[J].Chinese Journal of Acoustics,2014,33(4):441-456. 被引量：4
2STM32F102:MEMS eMotion解决方案[J].世界电子元器件,2014(1):19-20.
3Sarah Kadzomba.The Difference Between Males and Females Regarding the Effect of Children on Relationships[J].Psychology Research,2013,3(5):243-251.
4Zheng-wei HUANG,Wen-tao XUE,Qi-rong MAO.Speech emotion recognition with unsupervised feature learning[J].Frontiers of Information Technology & Electronic Engineering,2015,16(5):358-366. 被引量：1
5陈鹏.ZENBOOK Prime足以媲美Macbook Air 专访华硕全球副总裁许先越先生[J].微型计算机,2012(18):41-41.
6LIU Ye FU QiuFang FU XiaoLan.The interaction between cognition and emotion[J].Chinese Science Bulletin,2009,54(22):4102-4116. 被引量：8
7冯茜芦,潘金贵.一种基于句子的信息检索模型研究[J].计算机应用与软件,2010,27(3):162-164.
8郑艳.电力系统继电保护装置A/D采集系统的设计[J].工业设计,2015(3):92-93. 被引量：1
9程波,刘光远.基于小波变换的表面肌电信号的情感识别[J].计算机工程与应用,2007,43(35):216-218. 被引量：2
10汪浩,王朝坤,徐亚军,宁苑池.Dominant Skyline Query Processing over Multiple Time Series[J].Journal of Computer Science & Technology,2013,28(4):625-635.

Science China(Technological Sciences)

2009年第7期

浏览历史

内容加载中请稍等...

Novel acoustic features for speech emotion recognition 被引量：2

参考文献40

引证文献2

二级引证文献1

相关作者

相关机构

相关主题

浏览历史