基于多窗谱相关加权语音增强被引量：7

Correlation Weighting Enhancement of Speech Based on Multitaper Spectrum

下载PDF

导出

摘要与传统的周期谱图相比,多窗谱分析方法是一种低方差、高分辨率的谱分析方法,尤其适合非线性系统中强噪声背景下弱信号、时频演变信号的分析。基于多窗谱估计这种优点,提出多窗谱相关加权语音增强方法,先对噪声与含噪信号比(NNSR)进行估计,用基于NNSR的幅度谱减实现预语音增强,再用相关加权规则获得最终的增强语音。通过客观和主观测试表明,在相同的实验条件下,多窗谱相关加权算法能更好地抑制背景噪声和音乐噪声,同时也较好地保持了语音的可懂度和自然度。 Compared with the traditional periodogram,the Multitaper method of spectral analysis provides a means for spectral estimation with low variance and high resolution.It is particularly well-suitd for the diagnosis analysis of weak signals with a time-depended amplitude and frequency against a high-noise background.Because of the low variance feature of the multitaper spectrum,an algorithm of correlation weighting enhancement of speech based on multitaper spectrum is presented.The noise spectrum and the Noise to Noisy Signal Ratio（NNSR） are estimated based on the multitaper spectrum of the noisy signal,and the pre-enhanced speech is obtained by the spectral amplitude subtraction method,whose gain is a function of NNSR.The final enhanced speech is obtained by correlation weighting rule,and subjective and objective tests are made on this algorithm.The results show that this algorithm is very effective to reduce the background noise and music noise,moreover,ensures the intelligibility and naturalness of speech.

作者彭军王忠刘兴涛胡建超

机构地区四川大学电气信息学院

出处《计算机仿真》 CSCD 北大核心 2011年第3期142-145,共4页 Computer Simulation

关键词语音增强多窗谱相关加权音乐噪声 Speech enhancement Multitaper spectrum Correlation weighting Music noise

分类号 TN912.3 [电子电信—通信与信息系统]

引文网络
相关文献

参考文献11

1D J Thomson. Spectrum estimation and harmonic analysis Proc [J]. IEEE, 1982,70(9):1055-1096.
2J Park. Envelope estimation for quasiperiodic geophy-sical signals in noise[ M]. In:A Multitaper Approach in Statistics in the Environmental and Earth Sciences London, Edward A mold Press, 1992. 189-219.
3M E Mann, J Park. Spatial correlations of vari-ation in global surface temperatures [ M ]. Geophys, ResLet, 1993,20 : 1055-1058.
4Y Hu, P C Loizou. Incorporating a psychoacoustical model in frequency domain speech enhancement [ J ]. IEEE Signal Processing letters,2004,11 (2) :270-273.
5O Cappe. Elimination of the Musical Noise Phenomenon with the Ephraim and Malah Noise Suppressor[J]. IEEE Trans. on Speech and Audio Processing, 1994,2(2) : 345-349.
6M Berouti, R Schwartz, J Markhoul. Enhancement of Speech Corrupted by Acoustic Noise [ J]. IEEE Trans , onAcoustics Speech, and Signal processing, 1979,4 : 208-211.
7武鹏鹏,赵刚,邹明.基于多窗谱估计的改进谱减法[J].现代电子技术,2008,31(12):150-152. 被引量：20
8D Slepian. Some Comments on Fourier Analysis[ M]. Uncertainty, and Modeling Review, 1983,25:379-393.
9D Slepian, H O Pollak. Prolate Spheroidal Wave Functions Fourier Analysis and Uncertainty I[ J]. Bell System Tech, 1961,40: 43 -64.
10D G Manolakis, V K Lngle, S M Kogon. Statistical and adaptive signal processing[ M ]. 北京:清华大学出版社, 2003. 246 - 255.

二级参考文献25

1卜凡亮,王为民,戴启军,陈砚圃.基于噪声被掩蔽概率的优化语音增强方法[J].电子与信息学报,2005,27(5):753-756. 被引量：16
2陶智,赵鹤鸣,龚呈卉.基于听觉掩蔽效应和Bark子波变换的语音增强[J].声学学报,2005,30(4):367-372. 被引量：39
3吴红卫,吴镇扬,赵力.基于多窗谱的心理声学语音增强[J].声学学报,2007,32(3):275-281. 被引量：12
4Thomson D J. Spectrum estimation and harmonic analysis. Proc. IEEE, 1982; 70(9): 1055--1096
5Hu Y, Loizou P C. Incorporating a psychoacoustical model in frequency domain speech enhancement. IEEE Signal Processing letters, 2004; 11(2): 270--273
6Cappe O. Elimination of the musical noise phenomenon with the Ephraim and Malah noise suppressor. IEEE Trans. on Speech and Audio Processing, 1994; 2(2): 345-- 349
7Virag N. Single channel speech enhancement based on masking properties of the human auditory system. IEEE Trans. Speech and Audio Processing, 1999; 7(2): 126--137
8Gustafsson S, Jax P, Vary P. A novel psychoacoustically motivated audio enhancement algorithm preserving background noise characteristics. In: Proc. IEEE Int. Conf. Acoustics, Speech and Signal Processing, 1998:397--400
9Johnston J D. Transform coding of audio signal using perceptual noise criteria. IEEE J. Select. Areas Commun., 1988; 6(2): 314--323
10Manolakis D G, Lngle V K, Kogon S M. Statistical and adaptive signal processing. 北京:清华大学出版社, 2003: 246-255

共引文献30

1吴红卫,俞一彪,吴镇扬.基于Laplace-Gauss模型和简化相位判别的离散余弦变换域语音增强[J].声学学报,2008,33(3):244-251. 被引量：4
2武鹏鹏,赵刚,邹明.基于多窗谱估计的改进谱减法[J].现代电子技术,2008,31(12):150-152. 被引量：20
3陈克安,马苗,张燕妮,王娜,闫靓.汉语语境下的车辆噪声听觉属性评价与分析[J].声学学报,2008,33(4):348-353. 被引量：13
4李晓伟,曾毓敏,汤小飞.基于多正弦窗谱估计的改进谱减法语音增强[J].信息化研究,2009,35(12):18-21. 被引量：1
5王云专,王珊,董相杰,于承业.多窗谱分析在Q值估算中的应用[J].地球物理学进展,2009,24(6):2156-2162. 被引量：6
6张峰,石现峰.多正弦窗谱估计应用于振动信号频谱分析[J].西安工业大学学报,2010,30(4):387-391. 被引量：6
7吴边,王忠,刘兴涛.强背景噪声下语音端点检测的算法研究[J].计算机工程与应用,2011,47(33):137-139. 被引量：6
8韦丽兴,张淼,钟映春,韩光.采用PCNN的有噪特定人语音识别系统[J].计算机工程与应用,2012,48(3):133-136. 被引量：2
9周俊,董琪,庄柳静,吴春生,刘清君,王平.基于在体植入电极的生物嗅觉传感系统设计及其气味识别[J].传感技术学报,2012,25(8):1023-1028. 被引量：2
10王玥,李平,崔杰.听觉频域掩蔽效应的自适应β阶贝叶斯感知估计语音增强算法[J].声学学报,2013,38(4):501-508. 被引量：5

同被引文献65

1华强,夏哲雷,祝剑英.一种改进的变步长LMS自适应滤波算法及其仿真[J].中国计量学院学报,2012,23(3):304-308. 被引量：8
2肖述才,王作英.端点检测中的一种新的对数能量特征[J].电声技术,2004,28(6):37-41. 被引量：12
3闫润强,朱贻盛.基于信号递归度分析的语音端点检测方法[J].通信学报,2007,28(1):35-39. 被引量：8
4武薇,范影乐,庞全.基于广义维数距离的语音端点检测方法[J].电子与信息学报,2007,29(2):465-468. 被引量：11
5Loizou P C, Ma J. Extending the articulation index to account for non-linear distortions introduced by noise suppression algorithms[J]. The Journal of the Acous-tical Society of America, 2011,130(2) 986-995.
6Upadhyay N, Karmakar A. A perceptually motivated multi-band spectral subtraction algorithm for enhance- ment of degraded speech[C]//Computer and Commu- nication Technology (ICCCT), 2012 Third Interna- tional Conference on. India, IEEE, 2012 : 340-345.
7Ping W. An improved spectral subtraction algorithm based on auditory masking in voice human-computer interaction[C]//Mechatronics and Automation (IC- MA), 2010 International Conference on. China..Xi'an, IEEE, 2010: 1938-1941.
8Cao L, Zhang T, Gao H, et al. Multi-band spectral subtraction method combined with auditory masking properties for speech enhancement [C]//Image and Signal Processing (CISP), 2012 5th International Con- gress on. India: Guwahati, IEEE, 2012 : 72-76.
9Lu Yang, Loizou P C. A geometric approach to spec- tral subtraction[J]. Speech communication, 2008, 50 (6) : 453-466.
10Martin R. Noise power spectral density estimation based on optimal smoothing and minimum statistics[J]. Speech and Audio Processing, IEEE Transactions on, 2001, 9 (5) : 504-512.

引证文献7

1万义龙,张天骐,王志朝,金静.一种基于几何谱减法和听觉掩蔽效应的语音增强方法[J].微电子学与计算机,2014,31(2):80-84. 被引量：5
2陈紫强,李欣阳,谢跃雷.结合相位谱补偿的调制域谱减法[J].信号处理,2015,31(4):468-473. 被引量：9
3董胡.低信噪比环境下改进的语音端点检测算法[J].计算机技术与发展,2016,26(3):71-74. 被引量：10
4李睿,唐顺仙,何建新.MTM算法在全固态天气雷达LFM信号中的应用[J].成都信息工程学院学报,2016,31(1):42-48.
5赵发.基于多窗谱估计谱减法和能熵比法的语音端点检测算法[J].巢湖学院学报,2016,18(6):80-85. 被引量：2
6徐文超,王光艳,陈雷.改进的变步长最小均方误差电子耳蜗语音增强算法[J].计算机应用,2017,37(4):1212-1216. 被引量：6
7和丽华,江涛,潘文林,杨建香,解雪琴,王璐,余彩裙.佤语语音语料端点检测算法[J].云南民族大学学报（自然科学版）,2019,28(2):186-190. 被引量：2

二级引证文献32

1陈胜,徐岩.基于人耳感知掩蔽效应的子空间语音增强算法研究[J].电子质量,2014(12):80-84.
2杨龙,陈建明.语音增强算法及进展[J].电声技术,2015,39(7):35-39. 被引量：5
3胡丹,曾庆宁,龙超.调制域谱减法用于鲁棒性语音识别[J].科学技术与工程,2016,16(4):216-220. 被引量：5
4邓子龄.外界环境下语音信号快速捕获仿真研究[J].计算机仿真,2017,34(1):296-299.
5程小伟,王健,曾庆宁,谢先明,龙超.基于调制域谱减法的鲁棒性说话人识别[J].科学技术与工程,2017,17(3):252-257. 被引量：5
6张瑞.英语语音合理性优化识别建模仿真研究[J].计算机仿真,2017,34(2):289-292. 被引量：20
7董胡.基于先验信噪比和能零熵的语音端点检测算法[J].计算机技术与发展,2017,27(7):72-75. 被引量：4
8蔡军,李飞,张毅.基于听觉掩蔽效应的语音增强算法[J].计算机工程,2017,34(7):288-292. 被引量：7
9王群,曾庆宁,郑展恒.低信噪比下语音端点检测算法的改进研究[J].科学技术与工程,2017,17(21):50-56. 被引量：8
10吴进,崔旭.低信噪比语音信号的经验模态分解与端点检测[J].西安工业大学学报,2017,37(7):532-538. 被引量：2

1查诚,杨平,潘平.小波包分解下的多窗谱估计语音增强算法[J].计算机工程,2012,38(5):291-292. 被引量：5
2吴红卫,吴镇扬,赵力.基于多窗谱的心理声学语音增强[J].声学学报,2007,32(3):275-281. 被引量：12
3吴边,王忠,刘兴涛.强背景噪声下语音端点检测的算法研究[J].计算机工程与应用,2011,47(33):137-139. 被引量：6
4武鹏鹏,赵刚,邹明.基于多窗谱估计的改进谱减法[J].现代电子技术,2008,31(12):150-152. 被引量：20
5彭雨晨,王忠.多窗谱估计的语音增强减法研究[J].计算机工程与应用,2012,48(19):114-118. 被引量：1
6郗昌庆.基于干扰温度的频谱感知算法研究[J].世界科技研究与发展,2013,35(3):330-332.
7陈文钢,田岚,姜晓庆,孙英明.一种噪声谱快速跟踪的语音增强方法[J].山东大学学报（工学版）,2006,36(4):26-28. 被引量：1
8邹琦萍,阮忠.基于非参数谱估计的一种认知无线电频谱空洞检测方法[J].广西轻工业,2008,24(3):46-47.
9马恒.认知无线电中频谱检测技术研究[J].无线电工程,2014,44(3):77-80. 被引量：12
10刘志鹏.认知无线电中的宽频段频谱感知器设计[J].信息通信,2015,28(6):47-50. 被引量：1

计算机仿真

2011年第3期

浏览历史

内容加载中请稍等...

基于多窗谱相关加权语音增强被引量：7

参考文献11

二级参考文献25

共引文献30

同被引文献65

引证文献7

二级引证文献32

相关作者

相关机构

相关主题

浏览历史

基于多窗谱相关加权语音增强 被引量：7

参考文献11

二级参考文献25

共引文献30

同被引文献65

引证文献7

二级引证文献32

相关作者

相关机构

相关主题

浏览历史

基于多窗谱相关加权语音增强被引量：7