期刊文献+

ON USING NON-LINEAR CANONICAL CORRELATION ANALYSIS FOR VOICE CONVERSION BASED ON GAUSSIAN MIXTURE MODEL

ON USING NON-LINEAR CANONICAL CORRELATION ANALYSIS FOR VOICE CONVERSION BASED ON GAUSSIAN MIXTURE MODEL
下载PDF
导出
摘要 Voice conversion algorithm aims to provide high level of similarity to the target voice with an acceptable level of quality.The main object of this paper was to build a nonlinear relationship between the parameters for the acoustical features of source and target speaker using Non-Linear Canonical Correlation Analysis(NLCCA) based on jointed Gaussian mixture model.Speaker indi-viduality transformation was achieved mainly by altering vocal tract characteristics represented by Line Spectral Frequencies(LSF).To obtain the transformed speech which sounded more like the target voices,prosody modification is involved through residual prediction.Both objective and subjective evaluations were conducted.The experimental results demonstrated that our proposed algorithm was effective and outperformed the conventional conversion method utilized by the Minimum Mean Square Error(MMSE) estimation. Voice conversion algorithm aims to provide high level of similarity to the target voice with an acceptable level of quality. The main object of this paper was to build a nonlinear relationship between the parameters for the acoustical features of source and target speaker using Non-Linear Canonical Correlation Analysis (NLCCA) based on jointed Gaussian mixture model. Speaker indi- viduality transformation was achieved mainly by altering vocal tract characteristics represented by Line Spectral Frequencies (LSF). To obtain the transformed speech which sounded more like the target voices, prosody modification is involved through residual prediction. Both objective and subjective evaluations were conducted. The experimental results demonstrated that our proposed algorithm was effective and outperformed the conventional conversion method utilized by the Minimum Mean Square Error (MMSE) estimation.
出处 《Journal of Electronics(China)》 2010年第1期1-7,共7页 电子科学学刊(英文版)
基金 Supported by the National High Technology Research and Development Program of China (863 Program,No.2006AA010102)
关键词 Speech processing Voice conversion Non-Linear Canonical Correlation Analysis(NLCCA) Gaussian Mixture Model(GMM) Speech processing Voice conversion Non-Linear Canonical Correlation Analysis (NLCCA) Gaussian Mixture Model (GMM)
  • 相关文献

参考文献10

  • 1Jian Zhihua,Yang Zhen.Voice conversion using canonical correlation analysis based on Gaussian mixture model[].th ACIS International Conference on Software EngineeringArtificial IntelligenceNet-workingand Parallel/Distributed Computing.2007
  • 2E.Moulines,et al.Voice conversion: state of the art and perspectives[].Speech Communication.1995
  • 3Arslan L M.Speaker Transformation Algorithm Using Segmental Codebooks (STASC)[].Speech Communication.1999
  • 4Narendranath M,Murthy H M,Rajendran S,et al.Transformation of formants for voice conversion using artificial neural networks[].Speech Communication.1995
  • 5Stylianou Y,Cappe O,Moulines E.Continuous probabilistic transform for voice conversion[].IEEE Transactions on Speech and Audio Processing.1998
  • 6Kain A,Macon M.Spectral voice conversion for text-to-speech synthesis[].Proceedings of the IEEE International Conference on Acoustics Speech and Signal Processing.1998
  • 7C. H. Wu,C. C. Hsia,T. H. Liu,,and J. F. Wang.Voice conversion using duration-embedded bi-HMMs for expressive speech synthesis[].IEEE Trans on Audio Speech and Language Processing.2006
  • 8O. Turk,and L. M. Arslan.Robust processing tech- niques for voice conversion[].Computer Speech and Language.2006
  • 9LEE C L,CHANG W W,CHIANG Y C.Spectral and prosodictransformations of hearing-impaired Mandarin speech[].Space Communications.2006
  • 10K. Shikano,,S. Nakamura,and M. Abe.Speaker ad- aptation and voice conversion by codebook mapping[].IEEE Proceeding of ISCAS.1991

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部