期刊文献+

融合非线性幂函数和谱减法的CFCC特征提取 被引量:11

CFCC feature extraction for fusion of the power-law nonlinearity function and spectral subtraction
下载PDF
导出
摘要 为提高噪声环境下的语音识别准确率,提出一种改进的语音特征提取算法。该算法采用模拟人耳听觉特性的非线性幂函数提取一种新的耳蜗滤波倒谱系数,并在特征提取前端引入谱减法对信号进行增强,将提取到的新的特征及其一阶差分组成一种混合特征参数;再联合主成分分析对该混合特征进行降维,将最终得到的特征用于一个非特定人、孤立词、小词汇量的语音识别系统。实验结果表明:采用非线性幂函数提取的耳蜗滤波倒谱系数特征与传统的耳蜗滤波倒谱系数特征相比,明显提高了语音识别准确率;混合特征参数相比单一特征能达到更佳的语音识别性能;结合主成分分析后的特征集在信噪比为0dB时的识别正确率可达到88.10%。 This paper presents an improved speech feature extraction algorithm for improving the accuracy of speech recognition in noisy environment.A New Cochlear Filter Cepstral Coefficient(NCFCC)is extracted by the power-law nonlinear function which can simulate the auditory characteristics of the human ear.Then,the spectral subtraction is introduced in the feature extraction front end to enhance the signal,and the new feature and the first order difference are composed of a mixed feature parameter,after which the combined principal component analysis is made to reduce the dimension of the hybrid feature.The final feature is used in a non-specific persons,isolated words,and small-vocabulary speech recognition system.Experimental results show that,compared with the traditional Cochlear Filter Cepstral Coefficients(CFCC)feature,the Cochlear Filter Cepstral Coefficients extracted from the power-law nonlinear function significantly improve the accuracy of speech recognition.The mixed feature parameter can achieve a better speech recognition performance than a single feature.Combined with the feature set of the principal component analysis(PCA),the recognition accuracy can reach up to 88.10%when the signal to noise ratio(SNR)is 0 dB.
作者 白静 史燕燕 薛珮芸 郭倩岩 BAI Jing;SHI Yanyan;XUE Peiyun;GUO Qianyan(College of Information and Computer,Taiyuan University of Technology,Taiyuan 030024,China)
出处 《西安电子科技大学学报》 EI CAS CSCD 北大核心 2019年第1期86-92,共7页 Journal of Xidian University
基金 山西省科技攻关(社会发展)项目(20120313013-6) 山西省青年科技研究基金(2013021016-1)
关键词 语音识别 非线性幂函数 耳蜗滤波倒谱系数 谱减法 peech recognition power-law nonlinearity function cochlear filter cepstral coefficients spectral subtraction
  • 相关文献

参考文献7

二级参考文献61

  • 1王丹,皮建辉,唐佳,吴飞健,陈其才.弱噪声对下丘神经元声强敏感性的动态调制(英文)[J].生理学报,2005,57(1):59-65. 被引量:10
  • 2陶智,赵鹤鸣,龚呈卉.基于听觉掩蔽效应和Bark子波变换的语音增强[J].声学学报,2005,30(4):367-372. 被引量:39
  • 3魏传锋,贾阳,王浚.航天器在轨自主热故障诊断专家系统研究[J].装备环境工程,2006,3(3):54-57. 被引量:4
  • 4李可,庞丽萍,刘旺开,王浚.环境模拟舱体的建模仿真及控制方法[J].北京航空航天大学学报,2007,33(5):535-538. 被引量:15
  • 5KINNUNEN T, LI H Z.An overview of text-independent speaker recognition: from features to supervectors [J].Speech Communication, 2010,52:12-40.
  • 6HAMID R,SEYYED A ,HOSSEIN B,et al..A new representation for speech frame recognition based on redundant wavelet filter banks [J].Speech Communication, 2012, 54:256-271.
  • 7TYLER K P, STEPHANIE N,JOHN D,et al..Human voice recognition depends on language ability [J].Science, 2011,333:595.
  • 8PARVIN Z,SEYYED A.Robust speech recognition by extracting invariant features [J].Procedia - Social and Behavioral Sciences, 2012,32(3):230-237.
  • 9SHAO Y,JIN ZH ZH,WANG D L.An auditory based feature for robust speech recognition [C].ICASSP,2009:4625-4628.
  • 10MAK B K W, LAI T C, TSANG I W, et al..Maximum penalized likelihood kernel regression for fast adaptation [J].IEEE Transactions on Audio, Speech and Language Processing, 2009, 17(7): 1372-1381.

共引文献45

同被引文献63

引证文献11

二级引证文献18

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部