期刊文献+

基于平均模型和误差削减网络的语声转换系统

A voice conversion system based on average model and error reduction network
下载PDF
导出
摘要 现阶段用于语声转换的深度学习方法多是通过使用大量的训练数据来生成高质量的语声。该文提出了一种基于平均模型和误差削减网络的语声转换框架,可用于有限数量的训练数据。首先,基于CBHG网络的平均模型使用排除源说话人和目标说话人的多说话人语声数据进行训练;然后,在有限数量的目标语声数据下对平均模型执行自适应训练;最后,提出一种误差削减网络,可以进一步改善转换后语声的质量。实验表明,所提出的语声转换框架可以灵活地处理有限的训练数据,并且在客观和主观评估方面均优于传统框架。 So far,many of the deep learning approaches for voice conversion produce good quality speech by using a large amount of training data.This paper presents an average model and error reduction network-based voice conversion framework that can work with a limited amount of training data.We propose to implement a CBHG based average model that is trained with data from many speakers excluding source and target speakers;then,we propose to perform adaptation with a limited amount of target data;last,we propose an error reduction network that can improve the voice conversion quality even further.The experiments show that the proposed voice conversion framework is flexible to work with limited training data and outperforms the traditional frameworks in both objective and subjective evaluations.
作者 王媛媛 王新宇 张明阳 周锋 赵力 WANG Yuanyuan;WANG Xinyu;ZANG Mingyang;ZHOU Feng;ZHAO Li(School of Information Technology,Yancheng Institute of Technology,Yancheng 224051,China;Department of Electrical and Computer Engineering,National University of Singapore,Singapore 117583,Singapore;School of Information Science and Engineering,Southeast University,Nanjing 210096,China)
出处 《应用声学》 CSCD 北大核心 2023年第3期620-626,共7页 Journal of Applied Acoustics
基金 国家自然科学基金项目(61673108,62076215) 江苏省高等学校自然科学研究重大项目(19KJA110002)。
关键词 语声转换 CBHG 平均模型 误差削减网络 Voice conversion CBHG Average model Error reduction network
  • 相关文献

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部