期刊文献+

工业噪声环境下多麦状态空间模型语音增强算法 被引量:3

Speech enhancement using multi-microphone state space model under industrial noise environment
下载PDF
导出
摘要 在协同作业的工业环境中进行语音通信时,语音往往会淹没于工业噪声中,致使语音通信的有效性受到影响。针对这种工业噪声下的语音环境,提出了一种采用多麦克风的卡尔曼语音增强算法。该算法简化了状态空间模型(SSM)中的差分方程以降低复杂度,每个采样点实时得到去噪信号从而增强了实时性。另外,为了进一步简化复杂度,还利用最小二乘原则来对语音进行增强。实验中采用了公开数据库的语音信号和工厂噪声信号来模拟多麦下的带噪语音,将所提算法与传统算法进行了对比。实验结果表明,所提算法的输出语噪比(增强后的语音与残留噪声之比)优于传统算法约2 dB,而运行时间仅不到传统算法的2%,且延迟时间仅是毫秒级。 When a speech communication is performed in the industrial environment of collaborative operation,the speech is often submerged in the industrial noise.In this case,the effectiveness of speech communication is affected.For the speech environment with industrial noise,a Kalman speech enhancement algorithm using multi-microphone was proposed.In the algorithm,the difference equation in the State Space Model(SSM)was simplified to reduce the complexity,and the denoising signal was obtained in each sampling point to improve the real-time performance.In addition,to further simplify the complexity,the least square method was used to enhance the speech.In experiments,the speech signals and factory noise signals from a public database were used to simulate the noisy speech under multi-microphone environment,and the proposed algorithm was compared with the traditional algorithm.The experimental results show that the proposed algorithm has the output speech-to-noise ratio(a ratio of enhanced speech to residual noise)higher than the traditional algorithm by about 2 dB,and the running time less than 2%of that of the traditional algorithm.At the same time,the delay time of the algorithm is only several milliseconds.
作者 吴庆贺 吴海锋 沈勇 曾玉 WU Qinghe;WU Haifeng;SHEN Yong;ZENG Yu(School of Electric and Informative Engineering,Yunnan Minzu University,Kunming Yunnan 650504,China)
出处 《计算机应用》 CSCD 北大核心 2020年第5期1476-1482,共7页 journal of Computer Applications
基金 国家自然科学基金资助项目(61762093) 云南省应用基础研究重点项目(2018FA036) 云南省高校科技创新团队。
关键词 工业噪声 状态空间模型 多麦克风 语音增强 industrial noise State Space Model(SSM) multi-microphone speech enhancement
  • 相关文献

参考文献4

二级参考文献70

  • 1Kim G, Lu Y, Hu Y, Loizou P C. An algorithm that im- proves speech intelligibility in noise for normal-hearing lis- teners. The Journal of the Acoustical Society of America, 2009, 126(3): 1486-1494.
  • 2Dillon H. Hearing Aids. New York: Thieme, 2001.
  • 3Allen J B. Articulation and intelligibility. Synthesis Lectures on Speech and Audio Processing, 2005, 1(1): 1-124.
  • 4Seltzer M L, Raj B, Stern R M. A Bayesian classifier for spectrographic mask estimation for missing feature speech recognition. Speech Communication, 2004, 43(4): 379-393.
  • 5Weninger F, Erdogan H, Watanabe S, Vincent E, Le Roux J, Hershey J R, Schuller B. Speech enhancement with LSTM recurrent neural networks and its application to noise-robust ASR. In: Proceedings of the 12th International Conference on Latent Variable Analysis and Signal Separation. Liberec, Czech Republic: Springer International Publishing, 2015.91 -99.
  • 6Weng C, Yu D, Seltzer M L, Droppo J. Deep neural networks for single-channel multi-talker speech recognition. IEEE/ ACM Transactions on Audio, Speech, and Language Pro- cessing, 2015, 23(10): 1670-1679.
  • 7Boll S F. Suppression of acoustic noise in speech using spec- tral subtraction. IEEE Transactions on Acoustics, Speech, and Signal Processing, 1979, 27(2): 113-120.
  • 8Chen J D, Benesty J, Huang Y T, Doclo S. New insights into the noise reduction wiener filter. IEEE Transactions on Audio, Speech, and Language Processing, 2006, 14(4): 1218 -1234.
  • 9Loizou P C. Speech Enhancement: Theory and Practice. New York: CRC Press, 2007.
  • 10Liang S, Liu W J, Jiang W. A new Bayesian method incor- porating with local correlation for IBM estimation. IEEE Transactions on Audio, Speech, and Language Processing, 2013, 21(3): 476-487.

共引文献70

同被引文献25

引证文献3

二级引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部