期刊文献+

基于分类线性加权的源-目标话者声音转换算法的研究 被引量:1

Voice conversion from source speaker to target speaker based on classified linearly weighted transformation
下载PDF
导出
摘要 源-目标话者的声音转换是一种变换说话人声音特性的技术,它将源说话人的声音转换成另一个指定的目标说话人的声音。对源话者声道谱特性的修改是声音转换的关键之一。为了克服一般分类线性转换算法中分类不准确所带来的误差,本文引入了分类线性加权转换的策略,根据不同子类的转换函数对谱特性的贡献,赋予不同的加权系数,给出了一种基于GMM后验概率加权的线性转换算法。在微软汉语普通话语音数据库上做的四组对比实验表明,该算法在谱转换性能上均有不同程度的提高。 voice conversion technique aims to modify the source speaker's speech to make it sound like a designated target speaker's speech, of which the spectral envelope mapping algorithm is the key part. A classified linearly transformation is introduced to reduce transformation error caused by inaccurate classification. Different weighted values are added based on the contribution of each class to the whole spectral envelope, and a weighted linearly transformation based on the GMM posterior probability is presented. Experimental results show the proposed algorithm can improve the performance of converted spectral envelope.
出处 《电路与系统学报》 CSCD 北大核心 2008年第3期106-110,105,共6页 Journal of Circuits and Systems
关键词 声音转换 源-目标话者 声道谱转换 高斯混合模型 分类线性转换 分类线性加权转换 voice conversion the source-target speaker spectral envelope transformation Gauss mixture model classified linearly transformation classified linearly weighted transformation
  • 相关文献

参考文献8

  • 1E Moulines, et al. Voice conversion: state of the art and perspectives [J]. Elsevier, 1995-02, 16(2): 125-126.
  • 2M Abe, et al. Voice conversion through vector quantization [A]. Proceedings of ICASSP [C]. 1988, 1: 655-658.
  • 3左国玉,刘文举,阮晓钢.声音转换技术的研究与进展[J].电子学报,2004,32(7):1165-1172. 被引量:32
  • 4H Valbret, et al. Voice transformation using PSLOA technique [J]. Speech Communication, 1992, 11:175-187.
  • 5Ye Hui, Young Steve. Perceptually Weighted Linear Transformation for Voice Conversion [A]. Proceedings of Eurospeech [C]. 2003. 2409-2412.
  • 6Erie Chang, Y Shi, J Zhou, C Huang. Speech lab in a box: a mandarin speech toolbox to jumpstart speech related research [A]. Proceedings of Eurospeech [C]. 2001. 2799-2802.
  • 7Athanaslos Monchtaris, el aL Non-parallel training for voice conversion by maximum likelihood constrained adaptation [A]. Proceedings of ICASSP [C]. 2004-05, 1: 1-4.
  • 8A Kain, M. Macon. Spectral voice conversion for text-to-speech synthesis [A]. Proceedings of ICASSP [C]. 1998-05, 1 : 285-288.

二级参考文献56

  • 1H Kuwabara and Y Sagisaka.Acoustic characteristics of speaker individuality:control and conversion[J].Speech Communication.1995,16(2):165-173.
  • 2D Klatt and L C Klatt.Analysis,synthesis,and perception of voice quality variations among female and male talkers[J].J Acoust Soc Am,1990,87(2):820-857.
  • 3P H Milenkovic.Voice source model for continuous control of pitch period[J].J Acoust Soc Am,1993,93(2):1087-1096.
  • 4H Matsumoto,et al.Multidimensional representation of personal quality of vowels and its acoustical correlates[J].IEEE Trans Audio and Electroacoustics,1973,21(5):428-436.
  • 5S Furui.Research on individuality features in speech waves and automatic speaker recognition techniques [J].Speech Communication,1986,5(2):183-197.
  • 6K S Lee,et al.A new voice transformation based on both linear and nonlinear prediction[A].Proc ICSLP[C].Philadelphia,USA:ESCA,1996.1401-1404.
  • 7L M Arslan.Speaker transformation algorithm using segmental codebooks (STASC)[J].Speech Communication,1999,28(3):211-226.
  • 8H Mizuno and M Abe.Voice conversion algorithm based on piecewise linear conversion rules of formant frequency and spectrum tilt[J].Speech Communication.1995,16(2):165-173.
  • 9T Yoshimura,et al.Speaker interpolation in HMM-based speech synthesis system[A].Proc.Eurospeech [C].Rhodes,Greece:ESCA,1997.2523-2526.
  • 10D G Childers.Glottal source modeling for voice conversion [J].Speech Communication.1995,16 (2):127-138.

共引文献31

同被引文献5

引证文献1

二级引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部