归一化振幅商在语音情感识别中的应用被引量：1

Normalized Amplitude Quotient Feature in Emotion Recognition

下载PDF

导出

摘要提出了一种新的连续语音情感识别特征:语音元音段声门激励的时域参数归一化振幅商(the normalized amplitude quotient,NAQ)。该方法首先运用迭代自适应逆滤波器(Iterative Adaptive Inverse Filtering,IAIF)估计声门波,然后采用NAQ值来描述声门开启和闭合的特性。采用eNERFACE’05听视觉情感语音数据库中六种不同情感的语音为实验数据,以情感语音元音段的归一化振幅商值为特征,使用直方图和盒形图分析其特征的分布和对情感的区分能力;以情感语句元音段的NAQ值的均值、方差、最大值、最小值作为特征,用高斯混合模型(Gaussian Mixture Models,GMM)和k-近邻法进行了语音情感识别实验,结果表明NAQ特征对语音情感具有较强的区别能力。 A time - domain parameter of the glottal flow, the normalized amplitude quotient （NAQ） is presented as a new emotion feature in this paper. Six emotional speeches from the eNTERFACE＇05 audio -visual emotion database are inversely filtered using Iterative Adaptive Inverse Filtering （IAIF） to estimate the glottal flow and parameterized using NAQ. To evaluate the properties of the emotion features based on NAQ values, firstly, the histogram and boxplot of NAQ features are plotted to see their ability of distinguishing different emotions. Then, the mean, variance, maximum value and minimum value of NAQ features are used in speech emotion classification using Gaussian Mixture Models andk - nearest neighbor classifier. Experimental results show that NAQ value of vowel segments can be used as an effective emotion feature in emotion recognition from speech.

作者白洁蒋冬梅

机构地区西北工业大学计算机学院海军兵种指挥学院作战指挥系

出处《计算机仿真》 CSCD 北大核心 2009年第2期183-186,共4页 Computer Simulation

基金国家自然科学基金项目(60703104)

关键词归一化振幅商迭代自适应逆滤波高斯混合模型近邻法 Normalized amplitude quotient （ NAQ ） Iterative adaptive inverse filtering （IAIF） Ganssian mixture models （GMM） Nearest neighbor algorithm

分类号 TP391.42 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献10

1Tato Requel, Santos Bocio, Kompe Ralf, J M Pardo. Emotion space improves emotion recognition [ C ]. Proe. ICSLP. Denver, Colorado. 2002,3 : 2029 -2032.
2Laver John. The Phonetic Description of Voice Quality[ M]. Cambridge University Press, 1980.
3Klans R Seherer. Vocal affect expression: A review and a model for future research[ J]. Psychological Bulletin, 1986,99 ( 2 ) : 143 - 165.
4Gobl Christer, Chasaide Ailbhe Ni. The role of voice quality in communicating emotion, mood and attitude[ J]. Speech Communication,2003,40: 189 - 212.
5Alku Paavo, Backstrom Tom, Vilkman Erhhi. Normalized amplitude quotient for parameterization of the glottal flow[ J]. Journal of the Acoustical Society of America, 2002, 112(2) : 701 -710.
6Lehto Laura, et al. Comparison of two inverse filtering methods in parameterization of the glottal closing phase characteristics in different phonation types[J]. Journal Voice, 2007, 21 (2) : 138 - 150.
7Airas Matti, Alku Paavo. Emotions in vowel segments of continuous speech: Analysis of the glottal flow using the normalized amplitude quotient[ J]. Phonetica,2006, 63 ( 1 ) : 26 - 46.
8O Martin, et al. The eNTERFACE' 05 audio - visual emotion database[ C]. Proceedings of the 22nd International Conference on Data Engineering Workshops, 2006.
9Alku Paavo. Glottal wave analysis with pitch synchronous iterative adaptive, inverse filtering [ J ]. Speech Communication, 1992, 11 (2-3): 109-118.
10S J Young. The HTK Hidden Markov Model Toolkit: Design and Philosophy[ R]. Technical Report, CUED, Cambridge University, 1994.

同被引文献2

1张石清,李乐民,赵知劲.人机交互中的语音情感识别研究进展[J].电路与系统学报,2013,18(2):440-451. 被引量：30
2何凌,黄华,刘肖珩.基于声门特征参数的语音情感识别算法研究[J].计算机工程与设计,2013,34(6):2147-2151. 被引量：4

引证文献1

1李昊璇,师宏慧,乔晓艳.融合声门波信号频谱特征的语音情感识别[J].测试技术学报,2017,31(1):8-16.

1白洁,蒋冬梅,谢磊,付中华,任翠红.基于NAQ的语音情感识别研究[J].计算机应用研究,2008,25(11):3243-3245. 被引量：1
2林时来,刘光远,张慧玲.蚁群算法在呼吸信号情感识别中的应用研究[J].计算机工程与应用,2011,47(2):169-172. 被引量：5
3王秀,谢志成,张栋.一种基于特征差异度和SVM投票机制的数字音乐语音情感识别算法[J].福州大学学报（自然科学版）,2015,43(4):460-465. 被引量：2
4李杰,周萍.语音情感识别中特征参数的研究进展[J].传感器与微系统,2012,31(2):4-7. 被引量：2
5李昊璇.基于扩展卡尔曼滤波器的声门激励LF模型参数估计[J].测试技术学报,2013,27(5):425-430. 被引量：1
6李祖贺,樊养余.基于视觉的情感分析研究综述[J].计算机应用研究,2015,32(12):3521-3526. 被引量：6
7黄琪,李鼎权,施涛,孙菁,刘会刚.一种针对椒盐噪声的新型迭代自适应滤波器[J].南开大学学报（自然科学版）,2015,48(5):84-89. 被引量：4
8钱济国.机械故障的时域参数诊断法[J].煤矿机械,2006,27(9):192-193. 被引量：6
9龙清,罗炜.检察机关网络监测系统的数据采集与分析[J].电脑知识与技术,2016,12(10):20-22.
10YANG Ching-yu,LAI Chen-yuan.Use of quotient-embedding scheme with smart arrangement technique to hide gray-scale data[J].通讯和计算机（中英文版）,2008,5(12):23-27.

计算机仿真

2009年第2期

浏览历史

内容加载中请稍等...

归一化振幅商在语音情感识别中的应用被引量：1

参考文献10

同被引文献2

引证文献1

相关作者

相关机构

相关主题

浏览历史

归一化振幅商在语音情感识别中的应用 被引量：1

参考文献10

同被引文献2

引证文献1

相关作者

相关机构

相关主题

浏览历史

归一化振幅商在语音情感识别中的应用被引量：1