情感语音特征对语料库依赖性的统计分析被引量：3

Statistical Analysis for Database Dependence in Classification of Emotional Speech by using Different Features Extraction Approaches

下载PDF

导出

摘要简述线性预测倒谱系数(LPCC)、Teager能量算子(TEO)、梅尔频率倒谱系数(MFCC)和过零峰值幅度(ZCPA)特征提取方法,并将这四种方法应用于情感识别。设计两种实验,第一种是使用TYUT和Berlin语料库的单语言实验,这种实验证明,以上四种特征在单一的语料库单一语言条件下均能够有效地表征语音的情感特征,其中MFCC特征对情感的识别率最高。第二种实验是混合语料库的单一语言实验。之前大多数关于情感特征的研究都是基于某一种语料库中某种特定语言的,但在实际中,说话人的背景环境总是多种多样。因此,对特征的混合语料库研究是有现实意义的。第二种实验证明这四种特征都是语料库依赖性的,其中ZCPA特征的识别率下降最少。 Four approaches of feature extraction： the Linear Predictive Cepstral Coefficient （LPCC）, the Teager Energy Operator （TEO）, the Mel-Frequency Cepstral Coefficient （MFCC） and the Zero Crossings with Peak Amplitudes （ZCPA） are described in this paper. And these approaches are applied to emotional speech recognition. Two kinds of experiments are carded out. The first one is a kind of single language experiments with TYUT database and Berlin database. Its results show that these four approaches can represent speech emotion effectively by using single language of single database. MFCC has the best result of the four approaches. The second kind experiment is merge-database of single language. Most previous work on emotional feature extraction is based on a special language of single speech database. But in practice, the environment of the speaker is various. So the study of emotional feature extraction based on merge-database is signifieative. Experiments of the second kind indicate that the four features are all database dependent. ZCPA features are of the least database dependence of the four approaches.

作者孙颖张雪英

机构地区太原理工大学信息工程学院

出处《噪声与振动控制》 CSCD 北大核心 2011年第4期132-136,共5页 Noise and Vibration Control

基金国家自然科学基金(No.61072087) 山西省自然科学基金(No.2010011020-1) 山西省研究生创新基金(No.20093010)

关键词声学信号处理情感语音识别语料库依赖性情感特征混合语料库 acoustics signal analysis emotional speech recognition database dependence emotional features merge-database

分类号 TN912.34 [电子电信—通信与信息系统]

引文网络
相关文献

参考文献11

1Dimitrio Ververillis, Constantine Kotropoulos. Emotional speech recognition: resources, features, and methods [J]. Spoooh Cornmunitation, 2006, 48:1162-1181.
2罗跃嘉,吴健辉.情绪的心理控制与认知研究策略[J].西南师范大学学报（人文社会科学版）,2005,31(2):26-29. 被引量：24
3刘丽媛,严家明.一种孤立词语音识别的实现方法及改进[J].现代电子技术,2010,33(16):109-112. 被引量：3
4袁正午,肖旺辉.改进的混合MFCC语音识别算法研究[J].计算机工程与应用,2009,45(33):108-110. 被引量：18
5Doh-suk Kim, Soo-Yong Lee, maee M. Kil. Auditory processing of speech signal for robust speech recognition in real-world noisy envirroments [J]. IEEE Trasnsaotions Speoch andAuflio Processing, 1999,7(1) :55-58.
6Ying Sun, Xueying Zhang. A Study of Zero-Crossings with Peak-Amplitudes in Speech Emotion Classification [C]. The First International Conference on Pervasive Computing, Signal Processing and Applications, Harbin, China, Sep. 17-19, 20101 328 - 331.
7焦志平,张雪英,赵姝彦.一种基于听觉模型的抗噪语音识别特征提取方法[J].太原理工大学学报,2005,36(1):13-15. 被引量：8
8He L, Lech M, Maddage N, Allen N. Emotion recognition in speech of parents of depressed adolescents [C]. Proceedings of the Third International Conference on Bioinformatics and Biomedical Engineering (ICBBE 2009). Beijing, China, June 11-13, 2009, 1-4.
9F. Burkhardt, A. Paeschke, M. Rolfes, W. Sendlmeier, B. Weiss. A database of German emotional speech [J]. Proe. Interspeech, 2005: 1517-1520.
10W. M. Chmpbell, J. P. CampeU, D.A. Reynolds, E. Singer, P.A. Torres-Carrasquillo.. Support vector machines for speaker and language recognition [J]. Computex Speech and Language, 2006, 20: 210-229.

二级参考文献34

1黄宇霞,罗跃嘉.国际情绪图片系统在中国的试用研究[J].中国心理卫生杂志,2004,18(9):631-634. 被引量：99
2叶庆云,蒋佳.基于语音MFCC特征的改进算法[J].武汉理工大学学报,2007,29(5):150-152. 被引量：9
3Sandipan C,Anindya R,Sourav M,et al.Capturing complementary information via reversed filter bank and parallel implementation with MFCC for improved text-independent speaker identification[C]// IEEE International Conference on Computing:Theory and Application, India, 2007 : 463-467.
4GERVEN S,XIE FEI.A comparative study of speech detection method[C].EUROSPEECH,Greece,1997:1015-1020.
5GU H,TSENG C,LEE L.Isolated-utterance speech recognition using hidden markov models with bounded states durations[J].IEEE Transaction on SP,l991,39(8):1743-1752.
6易出克,田斌,付强.语音信号处理[M].北京:国防工业出版社,2003.
7韩利竹,王华.Matlab 电子仿真与应用[M].北京:国防工业出版社,2007.
8SILVERMAN H F,PMORGAN D.The application of dynamic programming to connected speech recognition[J].IEEE Assp Mag.,1990,17(7):6-25.
9Levenson R W. The Nature of Emotions[M]. New York: Oxford University Press, 1994, 123-126.
10Ekman P. An argument for basic emotions[J]. Cognition and Emotion, 1992b,6:169-200.

共引文献49

1耿柳娜,刘屈艳扬.负性情绪与工作记忆的关系:认知神经科学新取向[J].中国特殊教育,2009(3):85-89. 被引量：14
2梁五洲,张雪英.基于加权组合过零峰值幅度特征的抗噪语音识别[J].太原理工大学学报,2006,37(1):84-86. 被引量：3
3梁芳泉,张雪英.一种抗噪语音识别算法的DSP实现[J].电脑开发与应用,2006,19(4):12-14. 被引量：2
4罗跃嘉,黄宇霞,李新影,李雪冰.情绪对认知加工的影响:事件相关脑电位系列研究[J].心理科学进展,2006,14(4):505-510. 被引量：75
5孙颖,张雪英.基于高斯小波滤波器的语音识别特征提取方法[J].太原理工大学学报,2007,38(2):146-149. 被引量：2
6林崇德,陈英和.中国发展心理学30年的进展[J].北京师范大学学报（社会科学版）,2009(1):38-46. 被引量：17
7杨集梅,郑涌,徐莹.积极情绪研究述评:健全人格的视角[J].西南大学学报（社会科学版）,2009,35(2):149-152. 被引量：8
8王鹏,张雪英.改进的T-S模糊神经网络在语音识别中的应用[J].计算机工程与应用,2009,45(4):246-248. 被引量：7
9丁乃姝,石文典.试论情绪弹性[J].心理学探新,2009,29(3):18-21. 被引量：20
10孟秀艳,王志良,许鸣珠,张霞.一种非线性模糊情感模型的研究与仿真[J].系统仿真学报,2009,21(19):6232-6238. 被引量：1

同被引文献42

1CHIAVERINI S,SICILIANO B, VILLANI L. A survey of robot interaction control schemes with experimental com- parison [ J ]. IEEE/ASME Trans. Mechatronics, 1999, 4(3) :273-285.
2GASSERT R, MOSER R, BURDET E, et ah MRI/fMRI- compatible robotic system with force feedback for interac- tion with human motion [ J ]. IEEE/ASME Trans. Mecha- tronics, 2006,11 (2) :216-224.
3KULJI B ,JANOS S ,TIBOR S. Mobile robot controlled by voice [C ]. International Symposium on Intelligent Systems and Informatics ,2007:89-192.
4LIU P X,CHAN A D C,CHEN R, et al. Voice based ro- bot control [ C ]. International Conference on Information Acquisition. 2005:543-547.
5JEAN J H, HSIEH M J, LIN Z. Development of a house- keeping robot with visual servoing capabilities [ C ]. IC- CAS-SICE,2009:712-716.
6BUDIHARTO W, JAZIDIE A, PURWANTO D. Indoor navigation using adaptive neuro fuzzy controller for serv- ant robot[ C ]. International Conference on Computer En- gineering and Applications (ICCEA) ,2010:582-586.
7TI-I/ANG D W. Limited speech recognition for controlling movement of mobile robot implemented on ATmega162 mi- crocontmller[ C]. International Conference on Computer and Automation Engineering (ICCAE) ,2009:347-350.
8WEIGAND E. Emotions:The simple and the complex[ M].Amsterdam/Philadelphia:John Benjamins Publishing Com- pany ,2004.
9MURRAY I R, ARNOTr J L. Toward the simulation of emotion in synthetic speech:a review of the literature on human vocal emotion[ J]. Journal of the Acoustical Society of America, 1993,93 (2) : 1097-1108.
10GUYON I, GUNN S,NIKRAVESH M,et al. Feature extrac- tion, foundations and applications [ M ]. Springer,2006.

引证文献3

1李翔,李昕,胡晨,卢夏衍.面向智能机器人的Teager语音情感交互系统设计与实现[J].仪器仪表学报,2013,34(8):1826-1833. 被引量：10
2宋静,张雪英,孙颖,张卫.基于PAD情绪模型的情感语音识别[J].微电子学与计算机,2016,33(9):128-131. 被引量：10
3孙颖,吕慧芬,张雪英,马江河.情感维度下的深度情感关联模型[J].西安电子科技大学学报,2019,46(5):24-30. 被引量：8

二级引证文献27

1赵其杰,邵辉,卢建霞.基于头眼行为的交互意图检测方法[J].仪器仪表学报,2014,35(10):2313-2320. 被引量：8
2孙凌云,何博伟,刘征,杨智渊.基于语义细胞的语音情感识别[J].浙江大学学报（工学版）,2015,49(6):1001-1008. 被引量：2
3金银超,杨晖,李然,纪振发.基于Teager能量算子的语音端点检测算法研究[J].信息技术,2017,41(2):137-140. 被引量：5
4陈逸灵,程艳芬,陈先桥,王红霞,李超.PAD三维情感空间中的语音情感识别[J].哈尔滨工业大学学报,2018,50(11):160-166. 被引量：6
5李吉,黄微,郭苏琳,孙悦.网络口碑舆情情感强度测度模型研究——基于PAD三维情感模型[J].情报学报,2019,38(3):277-285. 被引量：24
6黎雨星,梁正友,孙宇.结合差分演化和逻辑回归的构音障碍自动识别方法[J].计算机与现代化,2019,0(8):1-5. 被引量：1
7孙颖,吕慧芬,张雪英,马江河.情感维度下的深度情感关联模型[J].西安电子科技大学学报,2019,46(5):24-30. 被引量：8
8褚钰,李田港,叶硕,叶光明.语音情感识别中的特征选择方法[J].应用声学,2020,39(2):216-222. 被引量：5
9叶硕,褚钰,王祎,李田港.语音识别中声学模型研究综述[J].计算机技术与发展,2020,30(3):181-186. 被引量：5
10孔冉,秦陈.受众情感在交互式广告设计中的建构研究[J].包装工程,2020,41(8):227-232. 被引量：10

1IFA2013前瞻:整合生态系统或成趋势[J].工业设计,2013(10):61-62.
2宋静,张雪英,孙颖,张卫.基于PAD情绪模型的情感语音识别[J].微电子学与计算机,2016,33(9):128-131. 被引量：10
3张子恒,孙颖,姚慧.基于混沌特性的情感语音非线性特征研究[J].微电子学与计算机,2017,34(4):65-68. 被引量：2
4黄丽霞,张雪英,刘雪艳.多种前端滤波器的ZCPA对语音多变性的鲁棒性研究[J].太原理工大学学报,2011,42(3):215-218.
5成凌飞,冯艳伟.“信息论与编码”课程教学探索与实践[J].电气电子教学学报,2012,34(5):25-26. 被引量：5
6姚慧,孙颖,张雪英.情感语音的非线性动力学特征[J].西安电子科技大学学报,2016,43(5):167-172. 被引量：14
7李宏松,苏健民,黄英来,于慧伶.基于声音信号的特征提取方法的研究[J].信息技术,2006,30(1):91-94. 被引量：25
8李哲军,周萍,景新幸.基于改进噪声估计的谱减法应用于说话人识别[J].计算机测量与控制,2016,24(4):155-158.
9舒若,李世宝,潘辛.SVAC音频编码的特征参数量化器改进[J].信息技术,2014,38(6):50-54.
10孙颖,姚慧,张雪英,张奇萍.基于混沌特性的情感语音特征提取[J].天津大学学报（自然科学与工程技术版）,2015,48(8):681-685. 被引量：12

噪声与振动控制

2011年第4期

浏览历史

内容加载中请稍等...

情感语音特征对语料库依赖性的统计分析被引量：3

参考文献11

二级参考文献34

共引文献49

同被引文献42

引证文献3

二级引证文献27

相关作者

相关机构

相关主题

浏览历史

情感语音特征对语料库依赖性的统计分析 被引量：3

参考文献11

二级参考文献34

共引文献49

同被引文献42

引证文献3

二级引证文献27

相关作者

相关机构

相关主题

浏览历史

情感语音特征对语料库依赖性的统计分析被引量：3