期刊文献+

采用联合字典优化的噪声鲁棒性语音转换算法 被引量:1

A noise robust voice conversion algorithm based on joint dictionary optimization
下载PDF
导出
摘要 针对含噪语音难以实现有效的语音转换,本文提出了一种采用联合字典优化的噪声鲁棒性语音转换算法。在联合字典的构成中,语音字典采用后向剔除算法(Backward Elimination algorithm,BE)进行优化,同时引入噪声字典,使得含噪语音与联合字典相匹配。实验结果表明,在保证转换效果的前提下,后向剔除算法能够减少字典帧数,降低计算量。在低信噪比和多种噪声环境下,本文算法与传统NMF算法和基于谱减法消噪的NMF转换算法相比具有更好的转换效果,噪声字典的引入提升了语音转换系统的噪声鲁棒性。 A noise robust voice conversion algorithm based on joint dictionary optimization is proposed in this paper to solve the problem that it is difficult to effectively convert noisy source speech into the target one.In the composition of the joint dictionary,the speech dictionary is optimized using a backward elimination algorithm.At the same time,a noise dictionary is introduced to match the noisy speech with the joint dictionary.The experimental results show that the backward elimination algorithm can decrease the number of dictionary frames and reduce the amount of calculation while ensuring the conversion effect.In low SNR and multiple noise environments,the algorithm has better conversion effect than the traditional NMF algorithm and the NMF conversion algorithm plus spectral subtraction de-noising.The proposed algorithm improves the robustness of the voice conversion system.
作者 张石磊 简志华 孙闽红 钟华 刘二小 ZHANG Shilei;JIAN Zhihua;SUN Minhong;ZHONG Hua;LIU Erxiao(School of Communication Engineering,Hangzhou Dianzi University,Hangzhou 310018)
出处 《声学学报》 EI CSCD 北大核心 2019年第6期1074-1082,共9页 Acta Acustica
基金 国家自然科学基金项目(61201301,61271214,61301248,41704154,61772166) 浙江省科技计划项目(LGG18F010009)资助
  • 相关文献

参考文献4

二级参考文献21

  • 1左国玉,刘文举,阮晓钢.声音转换技术的研究与进展[J].电子学报,2004,32(7):1165-1172. 被引量:32
  • 2康永国,双志伟,陶建华,张维.基于混合映射模型的语音转换算法研究[J].声学学报,2006,31(6):555-562. 被引量:13
  • 3苏庄銮,汪增福.基于统计方法的普通话情感语调模型[J].自动化学报,2007,33(7):673-677. 被引量:2
  • 4Stylianou Y,Toda T,Wu C H,Kain A,Rosec O.Introduction to the special section on voice transformation.IEEE Audio,Speech,and Language Processing,2010;18(5):909
  • 5Abe M,Nakamura S,Shikano K,Kuwabara H.Voice Conversion through vector quantization.In:Proc.ICASSP,1988:655—658
  • 6Krendranath M,Murthy H,Barnwelt T,Nielsen A.Perceptual relevance of objectively measured descriptors for speaker characterization.In:Proc.ICASSP,1998:869—872
  • 7Valbret H,Moulines E,Tubach J.Voice Transformation Using PSOLSA Technique.In:Proc.ICASSP,1992:145-148
  • 8Kain A,Macon M.Spectral voice conversion for text-tospeech synthesis.In:Proc.ICASSP,1998:285—288
  • 9Elina Helander,Hanna Silen,Tuomas Virtanen,Moncef Gabbouj.Voice conversion using dynamic kernel partial least squares regression.IEEE Audio,Speech,and Language Processing,2012;20(3):806—817
  • 10Athanasios Mouchtaris,Jan Van der Spiegel,Paul Mueller.Nonparallel training for voice conversion based on a parameter adaptation approach.IEEE Transactions on Audio,Speech,and Language Processing,2006;14:952—963

共引文献17

同被引文献6

引证文献1

二级引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部