期刊文献+

Mel频率下语音信号深度频谱特征提取方法仿真 被引量:6

Simulation of Depth Spectrum Feature Extraction Method for Speech Signal under Mel Frequency
下载PDF
导出
摘要 采用当前方法提取语音信号频谱特征时,不能有效去除语音信号中存在的噪声信号,提取得到的特征误差较大,存在抗干扰性能差和特征提取结果准确率低的问题。针对上述问题,提出Mel频率下语音信号深度频谱特征提取方法。对噪声信号进行经验模态分解,将分解得到的IMF分量进行门限域处理,通过对应的滤波方案去除语音信号中存在的噪声信号。采用Mel滤波器处理去噪后的语音信号,得到Mel频率的语音信号。利用线性预测系数描述Mel频率下的语音信号,并对其进行微分处理,将微分向量进行加权处理,根据一定的加权比例重组微分向量,利用语音信号深度频谱特征参数,完成Mel频率下语音信号的深度频谱特征提取。仿真结果表明,所提方法的抗干扰性能高、特征提取结果准确率高。 The current method cannot effectively remove the noise signal in voice signal. The extracted feature error is large. The anti-interference performance is poor and the accuracy of feature extraction result is low. Therefore, a method to extract the depth spectrum feature of voice signal at Mel frequency was proposed. The empirical mode decomposition was performed on the noise signal. The decomposed IMF component was subjected to threshold domain processing. The noise signal existing in voice signal was removed by the corresponding filtering scheme. Mel filter was used to process the denoised voice signal, so as to obtain the Mel frequency. The linear prediction coefficient was used to describe the voice signal at Mel frequency and the voice signal was differentiated. The differential vector was weighted. According to a certain weighting ratio, the differential vector was reconstructed. Finally, the depth spectrum feature parameter of voice signal was used to complete the depth spectrum feature extraction of voice signal at Mel frequency. Simulation results show that the proposed method has high anti-interference performance and high accuracy of feature extraction result.
作者 张红兵 ZHANG Hong-bing(Criminal Investigation Police University of China,Shenyang Liaoning 110854,China)
出处 《计算机仿真》 北大核心 2020年第5期197-200,267,共5页 Computer Simulation
基金 辽宁省自然科学基金指导计划项目(201602810) 公安理论及软科学研究计划项目(2017LLYJXJXY040)。
关键词 语音信号 深度频谱特征 特征提取 Voice signal Depth spectrum feature Feature extraction
  • 相关文献

参考文献10

二级参考文献91

  • 1张鑫瑜,李雪耀,张汝波,于桂芬.基于语音抑制的飞机识别研究[J].华中科技大学学报(自然科学版),2011,39(S2):291-294. 被引量:1
  • 2王宏霞,范明泉.基于质心的混合域半脆弱音频水印算法[J].中国科学:信息科学,2010,40(2):313-326. 被引量:12
  • 3张汝波,刘冠群,吴俊伟,吕西宝.移动机器人语音控制技术研究与实现[J].华中科技大学学报(自然科学版),2013,41(S1):348-351. 被引量:8
  • 4林奕琳,韦岗,杨康才.语音情感识别的研究进展[J].电路与系统学报,2007,12(1):90-98. 被引量:33
  • 5ZUBAIR S, WANG W. Audio classification based on sparse coefficients[C]//Sensor Signal Processing for Defence (SSPD 2011. London, UK:The Institution of Engineering and Technology Press, 2011:1-5.
  • 6ZUBAIR S, YAN F, WANG W. Dictionary learning based sparse coefficients for audio classification with max and average pooling[J]. Digital Signal Processing, 2013, 23(3):960-970.
  • 7CHU S, NARAYANAN S, KUO C C J. Environmental sound recognition with time-frequency audio features[J]. IEEE Transactions on Audio, Speech, and Language Processing, 2009, 17(6):1142-1158.
  • 8SIVASANKARAN S, PRABHU K M M. Robust features for environmental sound classification[C]//Proceedings of the 2013 IEEE International Conference on Electronics, Computing and Communication Technologies. Piscataway, NJ:IEEE, 2013:1-6.
  • 9WANG J C, LIN C H, CHEN B W, et al. Gabor-based nonuniform scale-frequency map for environmental sound classification in home automation[J]. IEEE Transactions on Automation Science and Engineering, 2014, 11(2):607-613.
  • 10TAKEUCHI K, ISHIGURO K, KIMURA A, et al. Non-negative multiple matrix factorization[C]//Proceedings of the 23rd International Joint Conference on Artificial Intelligence. Beijing:AAAI, 2013:1713-1720.

共引文献64

同被引文献77

引证文献6

二级引证文献2

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部