Feature Optimization of Speech Emotion Recognition

Feature Optimization of Speech Emotion Recognition

下载PDF

导出

摘要 Speech emotion is divided into four categories, Fear, Happy, Neutral and Surprise in this paper. Traditional features and their statistics are generally applied to recognize speech emotion. In order to quantify each feature’s contribution to emotion recogni-tion, a method based on the Back Propagation (BP) neural network is adopted. Then we can obtain the optimal subset of the features. What’s more, two new characteristics of speech emotion, MFCC feature extracted from the fundamental frequency curve (MFCCF0) and amplitude perturbation parameters extracted from the short- time av-erage magnitude curve (APSAM), are added to the selected features. With the Gaus-sian Mixture Model (GMM), we get the highest average recognition rate of the four emotions 82.25%, and the recognition rate of Neutral 90%. Speech emotion is divided into four categories, Fear, Happy, Neutral and Surprise in this paper. Traditional features and their statistics are generally applied to recognize speech emotion. In order to quantify each feature’s contribution to emotion recogni-tion, a method based on the Back Propagation (BP) neural network is adopted. Then we can obtain the optimal subset of the features. What’s more, two new characteristics of speech emotion, MFCC feature extracted from the fundamental frequency curve (MFCCF0) and amplitude perturbation parameters extracted from the short- time av-erage magnitude curve (APSAM), are added to the selected features. With the Gaus-sian Mixture Model (GMM), we get the highest average recognition rate of the four emotions 82.25%, and the recognition rate of Neutral 90%.

作者 Chunxia Yu Ling Xie Weiping Hu Chunxia Yu;Ling Xie;Weiping Hu(GuangXi Key Lab of Multi-Source Information Mining and Security, GuangXi Normal University, Guilin, China)

机构地区 GuangXi Key Lab of Multi-Source Information Mining and Security

出处《Journal of Biomedical Science and Engineering》 2016年第10期37-43,共8页 生物医学工程（英文）

关键词 Speech Emotion Recognition Feature Selection Feature Extraction BP Neural Network GMM Speech Emotion Recognition Feature Selection Feature Extraction BP Neural Network GMM

分类号 TN9 [电子电信—信息与通信工程]

引文网络
相关文献

1LIN Long,TAN Liang.Multi-Distributed Speech Emotion Recognition Based on Mel Frequency Cepstogram and Parameter Transfer[J].Chinese Journal of Electronics,2022,31(1):155-167. 被引量：1
2Feng Baoguo.New Characteristics and Trend of International Oil and Gas M&A[J].China Oil & Gas,2021,28(4):46-51.
3Mohammad Shorfuzzaman,Mehedi Masud.On the Detection of COVID-19 from Chest X-Ray Images Using CNN-Based Transfer Learning[J].Computers, Materials & Continua,2020(9):1359-1381. 被引量：3
4Disne SIVALINGAM.An Approach to Speech Emotion Classification Using k-NN and SVMs[J].Instrumentation,2021,8(3):36-45.
5K. A. Mohamed Junaid.Classification Using Two Layer Neural Network Back Propagation Algorithm[J].Circuits and Systems,2016,7(8):1207-1212.
6Konstantinos Goulianas,Athanasios Margaris,Ioannis Refanidis,Konstantinos Diamantaras,Theofilos Papadimitriou.A Back Propagation-Type Neural Network Architecture for Solving the Complete n ×n Nonlinear Algebraic System of Equations[J].Advances in Pure Mathematics,2016,6(6):455-480. 被引量：1
7Jiangyan Wang,Zumin Wang,Dan Mao,Dan Wang.The development of hollow multishelled structure: from the innovation of synthetic method to the discovery of new characteristics[J].Science China Chemistry,2022,65(1):7-19. 被引量：1
8David A. E. Vares,Trevor N. Carniello,Michael A. Persinger.Quantification of the Diminishing Earth’s Magnetic Dipole Intensity and Geomagnetic Activity as the Causal Source for Global Warming within the Oceans and Atmosphere[J].International Journal of Geosciences,2016,7(1):78-90.
9LUO Hui,HAN Jiqing.Semi-supervised Robust Feature Selection with l_(q)-Norm Graph for Multiclass Classification[J].Chinese Journal of Electronics,2021,30(4):611-622.
10Splendid Beijing Olympic,Paralympic Winter Games for All,Including Gourmets[J].Women of China,2022(5):7-7.

Journal of Biomedical Science and Engineering

2016年第10期

浏览历史

内容加载中请稍等...

Feature Optimization of Speech Emotion Recognition

相关作者

相关机构

相关主题

浏览历史