Binaural Speech Separation Algorithm Based on Long and Short Time Memory Networks

下载PDF

导出

摘要 Speaker separation in complex acoustic environment is one of challenging tasks in speech separation.In practice,speakers are very often unmoving or moving slowly in normal communication.In this case,the spatial features among the consecutive speech frames become highly correlated such that it is helpful for speaker separation by providing additional spatial information.To fully exploit this information,we design a separation system on Recurrent Neural Network(RNN)with long short-term memory(LSTM)which effectively learns the temporal dynamics of spatial features.In detail,a LSTM-based speaker separation algorithm is proposed to extract the spatial features in each time-frequency(TF)unit and form the corresponding feature vector.Then,we treat speaker separation as a supervised learning problem,where a modified ideal ratio mask(IRM)is defined as the training function during LSTM learning.Simulations show that the proposed system achieves attractive separation performance in noisy and reverberant environments.Specifically,during the untrained acoustic test with limited priors,e.g.,unmatched signal to noise ratio(SNR)and reverberation,the proposed LSTM based algorithm can still outperforms the existing DNN based method in the measures of PESQ and STOI.It indicates our method is more robust in untrained conditions.

作者 Lin Zhou Siyuan Lu Qiuyue Zhong Ying Chen Yibin Tang Yan Zhou

机构地区 School of Information Science and Engineering Department of Psychiatry College of Internet of Things Engineering

出处《Computers, Materials & Continua》 SCIE EI 2020年第6期1373-1386,共14页 计算机、材料和连续体（英文）

基金 This work is supported by the National Nature Science Foundation of China(NSFC)under Grant Nos.61571106,61501169,41706103 the Fundamental Research Funds for the Central Universities under Grant No.2242013K30010.

关键词 Binaural speech separation long and short time memory networks feature vectors ideal ratio mask

分类号 TN9 [电子电信—信息与通信工程]

引文网络
相关文献

1J.Nikhil,K.N.Megha,Prashanth Prabhu.Diurnal changes in differential sensitivity and temporal resolution in morning-type and evening-type individuals with normal hearing[J].World Journal of Otorhinolaryngology-Head and Neck Surgery,2018,4(4):229-233. 被引量：1
2ZHAO Yu,ZHANG Tao.An Image Denoising Method Based on Group Sparsity and Low Rank[J].Wuhan University Journal of Natural Sciences,2021,26(4):349-357.
3潘文雯,赵洲,俞俊,吴飞.基于文本引导的注意力图像转发预测排序网络[J].自动化学报,2021,47(11):2547-2556.
4Loukas Bampis,Antonios Gasteratos,Evangelos Boukas.CNN-based novelty detection for terrestrial and extra-terrestrial autonomous exploration[J].IET Cyber-Systems and Robotics,2021,3(2):116-127.
5Xiaoqing Liu,Kunlun Gao,Bo Liu,Chengwei Pan,Kongming Liang,Lifeng Yan,Jiechao Ma,Fujin He,Shu Zhang,Siyuan Pan,Yizhou Yu.Advances in Deep Learning-Based Medical Image Analysis[J].Health Data Science,2021(1):20-33.
6Lu Zhang,Jun Yang,Shihua Li.A Model-Based Unmatched Disturbance Rejection Control Approach for Speed Regulation of a Converter-Driven DC Motor Using Output-Feedback[J].IEEE/CAA Journal of Automatica Sinica,2022,9(2):365-376.
7Xiaoyan Zhao,Shuwen Chen,Lin Zhou,Ying Chen.Sound Source Localization Based on SRP-PHAT Spatial Spectrum and Deep Neural Network[J].Computers, Materials & Continua,2020(7):253-271. 被引量：2
8Peng Xu,Jianwei Zhang.An Expected Patch Log Likelihood Denoising Method Based on Internal and External Image Similarity[J].Journal on Internet of Things,2020,2(1):13-21.
9Pengchun Li,Yongchang Zhang,Guangyu Zhang,Dekai Zhou,Longqiu Li.A Bioinspired Soft Robot Combining the Growth Adaptability of Vine Plants with a Coordinated Control System[J].Research,2021(1):1266-1273.
10Xu Gao,Yutao Zhao,Xizhou Kai,Wei Qian,Liwei Jin,Chuang Guan,Peng Sheng.Characteristics on microstructure and mechanical performances of 6111Al influenced by Ce-containing precipitates[J].Journal of Rare Earths,2022,40(1):153-160. 被引量：3

Computers, Materials & Continua

2020年第6期

浏览历史

内容加载中请稍等...

Binaural Speech Separation Algorithm Based on Long and Short Time Memory Networks

相关作者

相关机构

相关主题

浏览历史