期刊文献+

有效频带多分辨率特征提取及说话人年龄识别 被引量:4

Multi Resolution Feature Extraction of Effective Frequency Bands for Age Recognition
下载PDF
导出
摘要 针对文本无关非特定说话人年龄识别,本文提出了一种基于有效频带多分辨率特征的统计分析识别方法。输入语音,通过小波包变换进行有效频带分解,然后将各有效频带的小波包系数连接构成一个整体计算美尔频率倒谱系数,得到有效频带多分辨率特征参数WPMFC(Wavelet Packet Mel-Frequency Cepstrum),说话人按年龄划分为儿童、青年、中年和老年四个阶段,并进一步按性别训练各年龄段语音得到8个高斯混合模型。测试语音依据最大似然准则进行识别判决。实验对本文提出的方法与传统的短时谱统计分析方法进行了比较,结果显示本文提出的方法有较好的识别性能,集内平均识别率达到65.17%。同时,实验结果也说明相对语音文本变化的影响,不同说话人发音特征的变化对识别性能的影响更大。 For speaker and text independent age recognition, a new multi-resolution feature extraction algorithm is pro- posed. The input speech is decomposed by wavelet packet transform, and then the wavelet packet coefficients of each effec- tive frequency band are connected to form a intermediate signal for further calculating of its Mel-frequency cepstrum coeffi- cients which is called Wavelet Packet Mel-Frequency Cepstrum Coefficient (WPMFC). The speaker age is divided into four age groups such as children, youths, adult and older, and totally eight Gaussian mixture models are trained for each age group and gender. Testing speech recognition decision is based on maximum likelihood criterion. The results of experi- mental prove that the performance of age recognition based on proposed feature extraction algorithm is successful compared with traditional short time spectral statistical analysis methods, the average recognition rate of outset speaker age reached 65.17%. What's more, comparing with the influence of the change of the voice content, the change of the characteristics of the speaker' s pronunciation has more influence on the recognition performance.
出处 《信号处理》 CSCD 北大核心 2016年第9期1101-1107,共7页 Journal of Signal Processing
关键词 说话人年龄识别 有效频带 多分辨率特征 小波包变换 speaker age recognition effective frequency bands multi-resolution features wavelet packet transform
  • 相关文献

参考文献11

  • 1Van Heerden C, Barnard E, Davel M, et al. Combining regression and classification methods for improving auto- matic speaker age recognition [ C ]//Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on. IEEE, 2010:5174-5177.
  • 2Verma R, Sarkar P, Rao K S. Conversion of Neutral Speech to Storytelling Style Speech [ C ] ff Advances in Pattern Recognition (ICAPR), 2015 Eighth International Conference on. IEEE, 2015:1-6.
  • 3Minematsu N, Sekiguchi M, Hirose K. Automatic esti- mation of one's age with his/her speech based upon a- coustic modeling techniques of speakers [ C ] ff Acoustics Speech and Signal Processing (ICASSP), 2002 IEEE In- ternational Conference on. IEEE, 2002:I-137-I-140.
  • 4Printz H, Gulati V. Method and Apparatus for Automati- cally Determining Speaker Characteristics for Speech-Di- rected Advertising or Other Enhancement of Speech-Con- trolled Devices or Services: US, US 20080103761 A1 [P]//2008.
  • 5Feld M, Barnard E, Van Heerden C, et al. Multilingual Speaker Age Recognition: Regression Analyses on the Lwazi Corpus [ C ]//Automatic Speech Recognition & Un- derstanding (ASRU), 2009 IEEE Workshop on. IEEE,2009:534-539.
  • 6Yue Mengdi, Chen Ling, Zhang Jie, et al. Speaker age recognition based on isolated words by using SVM [ C ]/// Cloud Computing and Intelligence Systems (CCIS), 2014 IEEE 3rd International Conference on. IEEE, 2014:282- 286.
  • 7Chen O T C, Gu J J. Improved gender/age recognition sys- tem using arousal-selection and feature-selection schemes [ C] JJDigital Signal Processing (DSP), 2015 IEEE Interna- tional Conference on. IEEE, 2015 : 148-152.
  • 8Hui Lin, Yu Yibiao. Acoustic feature analysis and con- version of age speech [ C ]//J6th IET International Confer- ence on Wireless, Mobile and Multimedia Networks (IC- WMMN) ,2015 : 147-151.
  • 9Zhang Lei, Han Jiqing, Wang Chengfa. A Novel Weigh- ted Likelihood Measure for Speech Recognition Under G- Force [ A ] ///Joint Conference on Information Science. USA: North Carolina. 2003:692-696.
  • 10Yang Z, Ling K, Wang J. Notice of Retraction Applica- tion of a new wavelet algorithm in hydrological periodic a- nalysis[ C] JJComputer Engineering and Technology ( IC- CET). 2010 2nd International Conference on. IEEE, 2010 : V6-17-V6-21.

二级参考文献8

共引文献9

同被引文献19

引证文献4

二级引证文献9

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部