期刊文献+

基于双生成器与频域判别器GAN语音增强算法

Speech enhancement algorithm based on dual generator and frequency domain discriminator GAN
下载PDF
导出
摘要 针对在低信噪比条件下,生成对抗网络语音增强算法难以捕捉带噪语音的时域分布信息,导致语音信号被噪音淹没,进而影响模型的增强效果,可能产生增强后语音失真等问题,提出了一种基于双生成器与频域判别器的新型生成对抗网络语音增强算法.首先,该算法采用了两个参数相同的生成器,通过多阶段的增强映射改善语音质量;然后,每个生成器模型在原有模型的基础上增加了自注意力层,以提升模型性能和增强效果;最后,判别器模型采用了频域结构,以频域上的分布信息作为判断增强语音与干净语音相似度的依据.实验结果表明,所提出的方法在低信噪比环境下的语音增强任务中相较于对比方法表现出更好的增强效果,在PESQ和STOI指标平均提高了0.18和1.67%. Aiming at the problem that under low signal-to-noise ratio conditions,the generative adversarial network speech enhancement algorithm is difficult to capture the time-domain distribution information of the noisy speech,which leads to the speech signal being flooded by the noise,which in turn affects the enhancement effect of the model,and may produce the distortion of the speech after enhancement,a new generative adversarial network speech enhancement algorithm based on the dual generator and frequency domain discriminator is proposed.Firstly,the algorithm employs two generators with the same parameters to improve speech quality through a multistage enhancement mapping.Then,each generator model adds a self-attention layer to the original model to improve the model performance and enhancement effect.Finally,the discriminator model adopts a frequency domain structure to use the distribution information on the frequency domain as the basis for judging the similarity between enhanced speech and clean speech.The experimental results show that the proposed method exhibits better enhancement in speech enhancement tasks in low signal-to-noise ratio environments compared to the comparison method,with an average improvement of 0.18 and 1.67 in PESQ and STOI metrics.
作者 纪鹏威 全海燕 JI Pengwei;QUAN Haiyan(Faculty of Information Engineering and Automation,Kunming University of Science and Technology,Kunming 650500,Yunnan,China)
出处 《云南大学学报(自然科学版)》 CAS CSCD 北大核心 2024年第5期871-880,共10页 Journal of Yunnan University(Natural Sciences Edition)
基金 国家自然科学基金(61861023).
关键词 语音增强 生成对抗网络 双生成器 自注意力 频域 speech enhancement generative adversarial network dual generator self-attention frequency domain
  • 相关文献

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部