Evaluating single-channel speech separation performance in transform-domain 被引量：1

Evaluating single-channel speech separation performance in transform-domain

导出

摘要 Single-channel separation (SCS) is a challenging scenario where the objective is to segregate speaker signals from their mixture with high accuracy. In this research a novel framework called subband perceptually weighted transformation (SPWT) is developed to offer a perceptually relevant feature to replace the commonly used magnitude of the short-time Fourier transform (STFT). The main objectives of the proposed SPWT are to lower the spectral distortion (SD) and to improve the ideal separation quality. The performance of the SPWT is compared to those obtained using mixmax and Wiener filter methods. A comprehensive statistical analysis is conducted to compare the SPWT quantization performance as well as the ideal separation quality with other features of log-spectrum and magnitude spectrum. Our evaluations show that the SPWT provides lower SD values and a more compact distribution of SD,leading to more acceptable subjective separation quality as evaluated using the mean opinion score. Single-channel separation （SCS） is a challenging scenario where the objective is to segregate speaker signals from their mixture with high accuracy. In this research a novel framework called subband perceptually weighted transformation （SPWT） is developed to offer a perceptually relevant feature to replace the commonly used magnitude of the short-time Fourier transform （STFT）. The main objectives of the proposed SPWT are to lower the spectral distortion （SD） and to improve the ideal separation quality. The performance of the SPWT is compared to those obtained using mixmax and Wiener filter methods. A comprehensive statistical analysis is conducted to compare the SPWT quantization performance as well as the ideal separation quality with other features of log-spectrum and magnitude spectrum. Our evaluations show that the SPWT provides lower SD values and a more compact distribution of SD, leading to more acceptable subjective separation quality as evaluated using the mean opinion score.

作者 Pejman MOWLAEE Abolghasem SAYADIYAN Hamid SHEIKHZADEH

机构地区 Department of Electronic Engineering

出处《Journal of Zhejiang University-Science C(Computers and Electronics)》 SCIE EI 2010年第3期160-174,共15页 浙江大学学报C辑（计算机与电子（英文版）

关键词 Single-channel separation （SCS） Magnitude spectrum Vector quantization （VQ） Subband perceptually weightedtransformation （SPWT） Spectral distortion （SD）

分类号 TN912.3 [电子电信—通信与信息系统]

引文网络
相关文献

参考文献47

1Bach, F.R., Jordan, M.I., 2006. Learning spectral clustering, with application to speech separation. J. Mach. Learn. Res., 7(1): 1963-2001.
2Barker, J., Shao, X., 2007. Audio-Visual Speech Fragment Decoding. Proc. Int. Conf. on Auditory-Visual Speech Processing, p.37-42.
3Barker, J., Cooke, M., Ellis, D., 2005. Decoding speech in the presence of other sources. Speech Commun., 45(1):5-25. [doi:10.1016/j.specom.2004.05.002].
4Barker, J., Coy, A., Ma, N., Cooke, M., 2006. Recent Advances in Speech Fragment Decoding Techniques. 9th Int. Conf. on Spoken Language Processing, p.85-88.
5Benaroya, L., Bimbot, F., Gribonval, R., 2006. Audio source separation with a single sensor. IEEE Trans. Audio Speech Lang. Process., 14(1):191-199. [doi:10.1109FrSA. 2005.854110].
6Bishop, C.M., 2006. Pattern Recognition and Machine Learning. Information Science and Statistics Series. Springer, New York, USA, p.2-3. [doi:10.10071978-0- 387-45528-0].
7Chatterjee, S., Sreenivas, T.V., 2008. Predicting VQ performance bound for LSF coding. IEEE Signal Process. Lett., 15(1): 166-169. [doi:l 0.1109/I-SP.2007.914786].
8Chhikara, R., Folks, L., 1989. The Inverse Gaussian Distribution: Theory, Methodology and Applications. CRC Press, Marcel Dekker Inc., New York, USA, p.39-52.
9Christensen, M.G., Jakobsson, A., 2009. Multi-Pitch Estima- tion. Synthesis Lectures on Speech and Audio Processing. Morgan and Claypool Publishers, San Rafael, CA, USA, p.1-24. [doi:10.2200/S00178EDIV01Y200903SAP005].
10Cooke, M.E, Barker, J., Cunningham, S.E, Shao, X., 2006. An audiovisual corpus for speech perception and automatic speech recognition. J. Acoust. Soc. Am., 120(5):2421- 2424; [doi:]0.1121/1.2229005].

引证文献1

1Pejman MOWLAEE,Abolghasem SAYADIAN,Hamid SHEIKHZADEH.Split vector quantization for sinusoidal amplitude and frequency[J].Journal of Zhejiang University-Science C(Computers and Electronics),2011,12(2):140-154.

1GUO Haiyan,YANG Zhen,ZHU Weiping,YE Lei.Single-channel Speech Separation by l0 Optimization Using Quasi-KLT Bases[J].Chinese Journal of Electronics,2012,21(3):535-540. 被引量：1
2YAO Wen-po,WU Min,LIU Tie-bing,WANG Jun,SHEN Qian.Speech Separation Based on Robust Independent Component Analysis[J].Chinese Journal of Biomedical Engineering(English Edition),2013,22(4):169-177. 被引量：1
3HuangXiuxuan WeiGang.SPEECH SEPARATION ALGORITHM FOR AUDITORY SCENE ANALYSIS[J].Journal of Electronics(China),2004,21(3):261-264. 被引量：1
4郑畅.意法半导体推出先进的单通道栅驱动器芯片STGAP1S[J].半导体信息,2014(6):24-24.
5ZhangXichun LiYunjie ZhangJun WeiGang.CONCURRENT SPEECHES SEPARATION USING WRAPPED DISCRETE FOURIER TRANSFORM[J].Journal of Electronics(China),2005,22(4):427-430.
6贾楠,李唐军,钟康平,王目光,陈明,李晶,池剑锋.A Clock Enhanced Loop for Simultaneous Error-Free Demultiplexing and Clock Recovery of 160Gb/s OTDM Signal Single-Channel Transmission over 100km[J].Chinese Physics Letters,2010,27(11):121-124.
7卢宇潇,孙麓,李哲,周健军.A single-channel 10-bit 160 MS/s SAR ADC in 65 nm CMOS[J].Journal of Semiconductors,2014,35(4):138-145.
8应英子,马力,郭圣明.Adaptive and optimal detection of elastic object scattering with single-channel monostatic iterative time reversal[J].Chinese Physics B,2011,20(5):300-304.
9郑铮.Experimental studies on the impact of ASE noise of single-channel optical amplifiers in central office applications[J].Chinese Optics Letters,2004,2(6):311-313. 被引量：3
10易航,黄晓涛,李杨寰.一种单通道SAR对多个地面动目标定位的方法[J].计算机仿真,2010,27(6):6-9. 被引量：1

Journal of Zhejiang University-Science C(Computers and Electronics)

2010年第3期

浏览历史

内容加载中请稍等...

Evaluating single-channel speech separation performance in transform-domain 被引量：1

参考文献47

引证文献1

相关作者

相关机构

相关主题

浏览历史