期刊文献+

语声转换技术发展及展望 被引量:3

An Overview of Voice Conversion
下载PDF
导出
摘要 语声转换通过改变语音信号的声学特征参数来调整语音的个性特征,从而使得转换后的源说话人语音听起来就像是目标说话人的声音一样。系统地介绍了当前语声转换技术的发展状况,在描述语声转换技术的应用场景和系统框架的基础上,着重阐述了系统的转换模块,即声道特性的转换和韵律转换,特别是重点介绍了声道特性的转换算法。简要地介绍了系统性能的测试方法,最后对全文进行了总结,并针对当前语声转换技术还存在的一些问题,对未来的发展进行了展望。 Voice conversion attempts to transform the personal characteristics of speech through adapting the acoustic parameters. The object is to make the speech uttered by a particular source speaker sound as if spoken by a designed target speaker. This paper introduces the development of voice conversion techniques in details. Firstly, the application of voice conversion and its system framework are described. Then, current conversion algorithms for the characteristics of vocal tract and prosody are presented, which is the core process of voice conversion. After that, the system performance evaluation methods, including subjective and objective measure, are introduced. Finally, the summary is given with a discussion of some existing problems in the current proposed algorithms.
作者 简志华 杨震
出处 《南京邮电大学学报(自然科学版)》 2007年第6期88-94,共7页 Journal of Nanjing University of Posts and Telecommunications:Natural Science Edition
基金 江苏省青蓝工程(QL003YZ)资助项目
关键词 语音处理 语声转换 声道特性 韵律信息 Speech processing Voice conversion Vocal tract characteristic Prosody information
  • 相关文献

参考文献41

  • 1GHILDERS D G,WU K,HICKS D M,et al. Voice conversion[ J]. Speech Communication, 1989,8 : 147 - 158.
  • 2KUWABARA H,SAGISAKA Y. Acoustic characteristics of speaker individuality : control and conversion [ J ]. Speech Communication, 1995,16 : 165 - 173.
  • 3QUATIERI T F,MCAULAY R J. Speech transformation based on a sinusoidal representation[ J]. IEEE Trans on Acoustics, Speech, and Signal Processing, 1986,34 ( 6 ) : 1449 - 1464.
  • 4GEORGE E B,SMITH M J T. Speech analysis/synthesis and modification using an analysis-by-synthesis/overlap-add sinusoidal model [ J ]. IEEE Trans on Speech and Audio Processing, 1997,5(5) :389 -406.
  • 5MACON M W,CLEMENTS M A. Sinusoidal modeling and modification of unvoiced speech[J]. IEEE Trans on Speech and Audio Processing, 1997,5(6) :557 - 560.
  • 6SHIKANO K, NAKAMURA S, ABE M. Speaker adaptation and voice conversion by codebook mapping [ C ]// ICASSP. Toronto, Canada, May 14 - 17,1991,1:594 - 597.
  • 7MOUCHTARIS A, NARAYANAN S S, KYRIAKAKIS C. Multichannel audio synthesis by subband-based spectral conversion and parameter adaptation [ J ]. IEEE Trans on Speech and Audio Processing, 2005,13 (2) :263 - 274.
  • 8KAIN A,MACON M W. Spectral voice conversion for text-to-speech synthesis [ C ]// IEEE ICASSP. Seattle, USA, 1998:285 - 288.
  • 9NIELSEN A S, BROCK D P. Speaker recognizability testing for voice coders[ C ]// IEEE ICASSP. Atlanta, 1996,2 : 1149 - 1152.
  • 10LEE C L, CHANG W W, CHIANG Y C. Spectral and prosodic transformations of hearing-impaired Mandarin speech[J]. Speech Communication, 2006,48:207 - 219.

同被引文献18

引证文献3

二级引证文献10

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部