期刊文献+

一种基于正弦激励的线性预测模型的语音转换方法 被引量:2

Voice Conversion Based on Linear Prediction Model with Sinusoidal Excitation
下载PDF
导出
摘要 在正弦激励模型的线性预测(LP)残差转换的基础上,提出了一种改进语音特征转换性能的语音转换方法。基于线性预测分析和综合的构架,该方法一方面通过谱包络估计声码器提取源说话人的线性预测编码(LPC)倒谱包络,并使用双线性变换函数实现倒谱包络的转换;另一方面由谐波正弦模型对线性预测残差信号建模和分解,采用基音频率变换将源说话人的残差信号转换为近似目标说话人的残差信号。最后由修正后的残差信号激励时变滤波器得到转换语音,滤波器参数通过转换得到的LPC倒谱包络实时更新。实验结果表明,该方法在主观和客观测试中都具有良好的结果,能有效地转换说话人声音特征,获得高相似度的转换语音。 By using a sinusoidal excitation method for voice spectral linear prediction (LP) residual transformation, an algorithm for the voice conversion technology is proposed to improve the target characteristics in the converted voice. The algorithm is based on the LP coding (LPC) analysis/synthesis framework and achieves LPC cepstral spectral envelope of the source speaker by the spectral envelope estimation vocoder (SEEVOC). The spectral envelope is converted by the bilinear transform function. LP residual signals are modeled and decomposed by the harmonic sinusoidal model. Pitch modification is applied to the source speaker residual to approximate the target speaker pitch range. Then, the modified LP residual is used to excite the time varying filter. Filter parameters are updated according to the desired LPC cepstral spectral envelope. Experimental results indicate that the proposed method has a good performance in both objective and subjective tests and can convert the speaker personality with high similarity.
作者 尹伟 易本顺
出处 《数据采集与处理》 CSCD 北大核心 2010年第2期218-222,共5页 Journal of Data Acquisition and Processing
关键词 语音转换 正弦模型 线性残差分析 voice conversion sinusoidal model linear prediction residual analysis
  • 相关文献

参考文献9

  • 1左国玉,刘文举,阮晓钢.声音转换技术的研究与进展[J].电子学报,2004,32(7):1165-1172. 被引量:32
  • 2Kain A.High resolution voice transformation[D].Portland,OR:OGI School of Science and Engineering,Oregon Health and Science University,2001.
  • 3Toda T,Black A W,Tokuda K.Mapping from articulatory movements to vocal tract spectrum with Gaussian mixture model for articulatory voice synthesis[C]//Proceeding of 5th ISCA Speech Synthesis Workshop.Pittsburgh,PA,USA:ISCA Press,2004:31-36.
  • 4Percybrooks W,Moore II E.New algorithm for LPC residual estimation from LSF vectors for a VC system[C]//Proceeding of 8th International Conference on INTERSPEECH.Antwerp,Belgium:ISCA Press,2007:1977-1980.
  • 5Paul B.The spectral envelope estimation vocoder[J].IEEE Transactions on Acoustics,Speech and Signal Processing,1981,29(1):786-794.
  • 6Smith J O,Abel J S.Bark and ERB bilinear transform[J].IEEE Transactions on Speech and Audio Processing,1999,7(6):697-708.
  • 7Quatieri T F,McAulay R J.Pitch estimation and voicing detection based on a sinusoidal model[J].IEEE Transactions on Acoustics,Speech and Signal Processing,1990,4(1):249-252.
  • 8Arslan L M.Speaker transformation algorithm using segmental codebooks[J].Speech Communication,1999,28(3):211-226.
  • 9Sreenivasa R K,Yegnanarayana B.Voice conversion by prosody and vocal tract modification[C]//Proceeding of 9th International Conference on Information Technology.Bhubaneswar,Orissa,India:IEEE Press,2006:111-116.

二级参考文献56

  • 1H Kuwabara and Y Sagisaka.Acoustic characteristics of speaker individuality:control and conversion[J].Speech Communication.1995,16(2):165-173.
  • 2D Klatt and L C Klatt.Analysis,synthesis,and perception of voice quality variations among female and male talkers[J].J Acoust Soc Am,1990,87(2):820-857.
  • 3P H Milenkovic.Voice source model for continuous control of pitch period[J].J Acoust Soc Am,1993,93(2):1087-1096.
  • 4H Matsumoto,et al.Multidimensional representation of personal quality of vowels and its acoustical correlates[J].IEEE Trans Audio and Electroacoustics,1973,21(5):428-436.
  • 5S Furui.Research on individuality features in speech waves and automatic speaker recognition techniques [J].Speech Communication,1986,5(2):183-197.
  • 6K S Lee,et al.A new voice transformation based on both linear and nonlinear prediction[A].Proc ICSLP[C].Philadelphia,USA:ESCA,1996.1401-1404.
  • 7L M Arslan.Speaker transformation algorithm using segmental codebooks (STASC)[J].Speech Communication,1999,28(3):211-226.
  • 8H Mizuno and M Abe.Voice conversion algorithm based on piecewise linear conversion rules of formant frequency and spectrum tilt[J].Speech Communication.1995,16(2):165-173.
  • 9T Yoshimura,et al.Speaker interpolation in HMM-based speech synthesis system[A].Proc.Eurospeech [C].Rhodes,Greece:ESCA,1997.2523-2526.
  • 10D G Childers.Glottal source modeling for voice conversion [J].Speech Communication.1995,16 (2):127-138.

共引文献31

同被引文献21

  • 1左国玉,刘文举,阮晓钢.声音转换技术的研究与进展[J].电子学报,2004,32(7):1165-1172. 被引量:32
  • 2左国玉,刘文举,阮晓钢.一种使用声调映射码本的汉语声音转换方法[J].数据采集与处理,2005,20(2):144-149. 被引量:4
  • 3赵力.语音信号处理[M].北京:机械工业出版社,2008.
  • 4Stylianou Y. Voice transformation : a survey [ C ] HInternation Conference on Acoustics, Speech and Signal Processing. [ s1! 1. ]:[s. n. ] ,2009:3585-3588.
  • 5Nakamura K, Toda T, Saruwatari H, et al. Speaking- aid sys- tems using GMM-based voice conversion for electrolaryngeal speech [ J ]. Speech Communication, 2012,54 ( 1 ) : 134- 1 46.
  • 6Laskar R H ,Talukdar F A ,Bhattacharjee R,et al. Voice con- version by mapping the spectral and prosodic features usingsupport vector machine [ J ]. Applications of Soft Computing, 2009,58:519-528.
  • 7Kunikoshi A, Qian Yao, Soong F, et al. Improve FO modeling and generation in voice conversion [ C ]//IEEE International Conference on Acoustics, Speech and Signal Processing. [ s. 1. ] :[ s. n. ] ,2011:4568-4571.
  • 8Rao K S. Voice conversion by mapping the speaker-specific features using pitch synchronous approach [ J 1. Computer Speech and Language ,2010,24( 3 ) :474-494.
  • 9陈芝,张玲华.基频轨迹转换算法及在语音转换系统中的应用研究[J].南京邮电大学学报(自然科学版),2010,30(5):83-87. 被引量:1
  • 10李燕萍,张玲华,丁辉.基于音素分类的汉语语声转换算法[J].南京邮电大学学报(自然科学版),2011,31(1):10-15. 被引量:1

引证文献2

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部