期刊文献+

基于混合线性变换的语声转换算法 被引量:2

An Algorithm for Voice Conversion Based on Mixtures of Linear Transformation
下载PDF
导出
摘要 针对在没有对称语音库的情况下,该文提出了一种基于混合线性变换的语声转换算法,在最大似然估计准则下,使用EM迭代算法计算变换函数的参量。为了减小线性加权对语音谱包络的平滑作用,使用线性调频Z变换来调节语音信号的LPC系数。客观评测和主观感受的实验结果都表明,基于混合线性变换的语声转换算法也可以取得与传统语声转换技术相当的转换效果,解除了传统语声转换技术需要对称语音库的要求。 This paper proposes an algorithm for voice conversion based on mixtures of linear transformation which avoids the need for parallel training corpus inherent in conventional approaches. In maximum likelihood framework the EM algorithm is used to compute the parameters of the transfer function. And the chirp Z-transform is utilized to enhance the smoothed spectral envelop due to the linear weighted averaging. The proposed voice conversion system is evaluated using both objective and subjective measures. The experiment results demonstrate that the proposed approach is capable of effectively transforming speaker identity and can achieve comparable results of the conventional methods where a parallel corpus is needed.
作者 简志华 杨震
出处 《电子与信息学报》 EI CSCD 北大核心 2007年第7期1700-1702,共3页 Journal of Electronics & Information Technology
基金 江苏省青蓝工程项目(QL003YZ)资助课题
关键词 语声转换 混合线性变换 最大期望算法 线性调频Z变换 Voice conversion Ms-LT EM algorithm Chirp Z-transform
  • 相关文献

参考文献12

  • 1Childers D G,Wu K,and Hicks D M,et al..Voice conversion.Speech Communication,1989,8(2):147-158.
  • 2Abe M,Nakamura S,Shikano K,and Kuwabara H.Voice conversion through vector quantization.IEEE Proceedings of ICASSP,New York,USA,Apr.11-14,1988:565-568.
  • 3Arslan L M.Speaker transformation algorithm using segmental codebooks.Speech Communication,1999,28(3):211-226.
  • 4Narendranath M,Murthy H A,and Rajendran S,et al..Transformation of formants for voice conversion using artificial neural networks.Speech Communication,1995,16(2):207-216.
  • 5Iwahashi N and Sagisaka Y.Speech spectrum conversion based on speaker interpolation and multi-functional representation with weighting by radial basis function networks.Speech Communication,1995,16(2):139-151.
  • 6Stylianou Y,Cappe O,and Moulines E.Continuous Probabilistic Transform for Voice Conversion.IEEE Trans on Speech and Audio Processing,1998,6(2):131-142.
  • 7Kain A and Macon M W.Spectral voice conversion for text-to-speech synthesis.IEEE Proceedings of ICASSP,Seattle,USA,May 12-15,1998:285-288.
  • 8Smits R and Yegnanarayana B.Determination of instants of significant excitation in speech using group delay function.IEEE Trans.on Speech and Audio Processing,1995,3(5):325-333.
  • 9Diakoloukas V D and Digalakis V V.Maximum likelihood stochastic transformation adaptation of hidden Markov models.IEEE Trans.on Speech and Audio Processing,1999,7(2):177-187.
  • 10Wang T T.The segmented chirp z-transform and its application in spectrum analysis.IEEE Trans.on Instrumentation and Measurement,1990,39(2):318-324.

同被引文献19

引证文献2

二级引证文献9

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部