期刊文献+

基于改进增益函数的电子耳蜗语音增强 被引量:1

Speech Enhancement for Cochlear Implant Based on Improved Gain Function
下载PDF
导出
摘要 目前在安静环境下电子耳蜗编码技术已取得较高的语音识别率,但在噪声条件下听觉感知性能下降明显。针对该问题,提出基于改进增益函数的电子耳蜗语音增强算法。以组合编码算法为基础,采用约束方差的噪声谱估计算法进行噪声功率谱估计并应用于信噪比估计。结合人耳掩蔽阈值在子频带中自适应调节增益函数,将改进的增益函数与通道选择相结合,实现电子耳蜗语音增强。实验结果表明,与采用基本谱减法前端去噪和传统增益函数的电子耳蜗语音增强算法相比,该算法的语音平均识别率分别提高了53%和22%,在保留更多语音信息的同时能有效消除背景噪声干扰。 Currently,the Cochlear Implant (CI) coding techniques achieve a high speech recognition rate in quiet environment,but the auditory perception performance significantly decreases in noisy conditions.In order to solve this problem,this paper proposes an enhancement method in CI on the basis of improved gain function.Based on the combined coding algorithm,this paper makes use of the spectrum estimation algorithm of constrained variance noise to calculate the noise power spectrum estimation and applies it into Signal to Noise Ratio (SNR) estimation,and combines it with human ears' masking threshold to adaptively adjust the gain function in sub-band.The speech enhancement in CI is achieved by combining the improved gain function with the channel selection.Experimental results show that comparing with the methods of front-end de-noising spectral subtraction algorithm and the traditional gain function algorithm of the speech enhancement for CI,the proposed algorithm keeps more voice information and greatly removes the background noise.The average recognition rate of this method is respectively improved by 53% and 22%.
出处 《计算机工程》 CAS CSCD 2014年第8期237-241,共5页 Computer Engineering
基金 国家自然科学基金资助项目(61271359) 苏州大学捷美生物医学工程仪器联合重点实验室基金资助项目
关键词 电子耳蜗 语音增强 组合编码算法 改进增益函数 噪声估计 人耳掩蔽阈值 Cochlear Implant (CI) speech enhancement combinational encoding algorithm improved gain function noise estimation human ears' masking threshold
  • 相关文献

参考文献12

  • 1黄雅婷,陶智,顾济华,赵鹤鸣,严冬明.基于人耳掩蔽效应的电子耳蜗语音增强方法[J].计算机工程,2008,34(10):280-282. 被引量:2
  • 2Yang Liping,Fu Qianjie.Spectral Subtraction-based Speech Enhancement for Cochlear Implant Patients in Background Noise[J].Journal of Acoustic Society of America,2005,117:1001-1004.
  • 3Loizou P C,Lobo A,Hu Y.Subspace Algorithms for Noise Reduction in Cochlear Implants[J].Journal of Acoustic Society of America,2005,118:2791-2793.
  • 4Loizou P.Speech Processing in Vocoder-centric Cochlear Implants[J].Advance in Oto-RhinoLaryngology,2006,64:109-143.
  • 5Hu Yi,Loizou P C,Li Ning,et al.Use of a Sigmoidalshaped Function for Noise Attenuation in Cochlear Implants[J].Journal of the Acoustical Society of America,2007,122:128-134.
  • 6Dawson P W,Mauger S J,Hersbach A A.Clinical Evaluation of Signal-to-Noise Ratio Based Noise Reduction in Nucleus Coch lear-implant Recipients[J].Ear Hear,2011,32(3):382-390.
  • 7Derakhshan N,Akbari A,Ayatollahi A.Noise Power Spectrum Estimation Using Constrained Variance Spectral Smoothing and Minima Tracking[J].Speech Communication,2009,51:1098-1113.
  • 8周成燕,周强,顾济华,赵鹤鸣,陶智.基于约束方差的噪声谱估计算法[J].计算机工程与应用,2012,48(18):127-131. 被引量:2
  • 9Martin R.Bias Compensation Methods for Minimum Statistics Noise Power Spectral Density Estimation[J].Signal Processing,2006,86 (6):1215-1229.
  • 10Hasan M K,Salahuddin S,Khan M R.A Modified a Priori SNR for Speech Enhancement Using Spectral Subtraction Rules[J].IEEE Signal Processing Letters,2004,11 (4):450-453.

二级参考文献21

  • 1邹霞,陈亮,张雄伟.甚低速率语音编码中的高效模拟退火算法研究[J].系统仿真学报,2004,16(10):2181-2184. 被引量:5
  • 2陶智,赵鹤鸣,龚呈卉.基于听觉掩蔽效应和Bark子波变换的语音增强[J].声学学报,2005,30(4):367-372. 被引量:39
  • 3王晓娣.多带谱相减结合感觉加权的语音增强方法研究[J].电力系统通信,2005,26(12):50-53. 被引量:6
  • 4李春晓,潘翔,刘琚,聂开宝.频域和时域信息对连续交替取样策略汉语声调识别的贡献[J].生物医学工程学杂志,2006,23(1):41-44. 被引量:2
  • 5Yang W, Dixon M, Yantorno R. A modified Bark spectral distortion measure with uses noise masking threshold [C]//Proc of IEEE Workshop on Speech Coding for Communications. Pocono Manor, USA:IEEE, 1997:55-56.
  • 6Wang S, Sekey A, Gersho A. An objective measure for predicting subjective quality of speech coders[J].IEEE J Select Areas Commun, 1992,10(5):819-829.
  • 7Yang W. Enhanced modified Bark spectral distortion(EMBSD): an objective speech quality measure based on audible distortion and cognition model [D].Philadelphia: Department of Electrical &. Computer Engineering of Temple University, 1999: 28-30, 63-75.
  • 8ITU T Group 12. P. 862 perceptual evaluation of speech quality (PESQ).. an objective method for end-to-end speech quality assessment of narrow-band telephone networks and speech codecs [S].Geneva, Switzerland, 2001.
  • 9Wilson B,Finley C C,Lawson D,et al.Design and Evaluation of Continuous Intedeaved Sampling(CIS) Processing Strategy for Multichannel Cochlear Implants[J].Journal of Rehab.and Research and Development,1993,30(1):110-116.
  • 10Nathalie Virag.Single Channel Speech Enhancement Based on Masking Properties of Human Auditory System[J].IEEE Transactions on Speech and Audio Processing,1999,7(2):126-137.

共引文献7

同被引文献13

引证文献1

二级引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部