期刊文献+

基于改进CFCC特征提取的语种识别算法研究

Research on language recognition algorithm based on improved CFCC feature extraction
下载PDF
导出
摘要 针对在低信噪比下语种识别准确率低的问题,提出一种基于分数阶小波变换的语种识别算法。首先,在特征提取前端采用自适应滤波法对带噪信号进行噪声滤除,以减小噪声对特征提取的影响,提升系统对带噪信号的处理能力。其次,采用新型分数阶小波变换作为小波基函数来模拟信号在耳蜗基底膜上的传播过程,利用非线性幂函数对信号进行压缩处理。最后,通过模拟人耳听觉过程提取改进耳蜗滤波器倒谱系数(CFCC)。实验结果表明,改进CFCC与传统CFCC相比显著提升了语种识别准确率,在0dB信噪比下语种识别准确率平均提升了11.1%,充分验证了所提算法的有效性和稳健性。 Aiming at the problem of low language recognition rate under low signal-to-noise ratio, a language recognition method based on fractional wavelet transform was proposed. Firstly, the adaptive filtering algorithm was used to filter the noise of the noisy signal, so as to reduce the influence of noise on the feature extraction and improve the processing ability of the system for non-stationary signals. Secondly, the motion of the signal on the basilar membrane of the cochlea was simulated, and then the signal was compressed by a nonlinear power function. Finally, the improved CFCC were extracted by simulating the human hearing process. Experiments show that compared with the traditional CFCC, the language recognition rate is significantly improved, and the language recognition rate is increased by 11.1% on average under the 0 dB signal-to-noise ratio, which verifies the effectiveness and robustness of the proposed algorithm.
作者 龙华 黄张衡 邵玉斌 杜庆治 苏树盟 LONG Hua;HUANG Zhangheng;SHAO Yubin;DU Qingzhi;SU Shumeng(Faculty of Information Engineering and Automation,Kunming University of Science and Technology,Kunming 650500,China)
出处 《通信学报》 EI CSCD 北大核心 2022年第12期211-221,共11页 Journal on Communications
基金 国家自然科学基金资助项目(No.61761025)。
关键词 语种识别 自适应滤波 分数阶小波变换 神经网络 耳蜗滤波器倒谱系数 language recognition adaptive filtering fractional wavelet transform neural network cochlear filter cepstral coefficient
  • 相关文献

参考文献11

二级参考文献57

  • 1楼红伟,胡光锐.基于Teager能量算子和频率弯折小波变换的语音识别特征参数[J].上海交通大学学报,2003,37(z1):79-82. 被引量:8
  • 2岳倩倩,周萍,景新幸.基于非线性幂函数的听觉特征提取算法研究[J].微电子学与计算机,2015,32(6):163-166. 被引量:5
  • 3TAO Ran,DENG Bing,WANG Yue.Research progress of the fractional Fourier transform in signal processing[J].Science in China(Series F),2006,49(1):1-25. 被引量:99
  • 4Parham Aarabi(著),平利川.基于相位的语音信号处理[J].国外科技新书评介,2006(12):12-13. 被引量:1
  • 5Vijayasenan D, Valente F, Bourlard H. Multistream speaker dia- rization of meetings recordings beyond MFCC and TI)OA fea- tures [J]. Speech Communication, 2012,54 (1) : 55-67.
  • 6Wang L, Minami K, Yamamoto K, et al. Speaker Recognition by Combining MFCC and Phase Information in Noisy Conditions [J]. IEICE Transactions on Information and Systems, 2010, E93D(9) :2397-2406.
  • 7Li Q, Huang Y. An Auditory-Based Feature Extraction Algo- rithm for Robust Speaker Identification Under Mismatched Conditions [J]. IEEE Transactions on Audio Speech and Lan- guage Processing, 2011,19(6) : 1791-1801.
  • 8Li Qi. An auditory-based transfrom for audio signal processing [C]// 2009 IEEE Workshop on Applications of Signal Proces- sing to Audio and Acoustics. New Paltz, NY, United states, Oct. 2009 : 181-184.
  • 9Dimitriadis D,Maragos P,Potamianos A. On the Effects of Fil- terbank Design and Energy Computation on Robust Speech Recog-nition[J]. IEEE Transactions on Audio Speech and Lan- guage Processing, 2011,19(6) : 1504-1516.
  • 10Tu C-C,Juang C-F. Recurrent type-2 fuzzy neural network using Haar wavelet energy and entropy features for speech detection in noisy environments [J]. Expert Systems With Applications, 2012,39 (3): 2479-2488.

共引文献50

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部