期刊文献+

A Study of Bilinear Models in Voice Conversion

A Study of Bilinear Models in Voice Conversion
下载PDF
导出
摘要 This paper presents a voice conversion technique based on bilinear models and introduces the concept of contextual modeling. The bilinear approach reformulates the spectral envelope representation from line spectral frequencies feature to a two-factor parameterization corresponding to speaker identity and phonetic information, the so-called style and content factors. This decomposition offers a flexible representation suitable for voice conversion and facilitates the use of efficient training algorithms based on singular value decomposition. In a contextual approach (bilinear) models are trained on subsets of the training data selected on the fly at conversion time depending on the characteristics of the feature vector to be converted. The performance of bilinear models and context modeling is evaluated in objective and perceptual tests by comparison with the popular GMM-based voice conversion method for several sizes and different types of training data. This paper presents a voice conversion technique based on bilinear models and introduces the concept of contextual modeling. The bilinear approach reformulates the spectral envelope representation from line spectral frequencies feature to a two-factor parameterization corresponding to speaker identity and phonetic information, the so-called style and content factors. This decomposition offers a flexible representation suitable for voice conversion and facilitates the use of efficient training algorithms based on singular value decomposition. In a contextual approach (bilinear) models are trained on subsets of the training data selected on the fly at conversion time depending on the characteristics of the feature vector to be converted. The performance of bilinear models and context modeling is evaluated in objective and perceptual tests by comparison with the popular GMM-based voice conversion method for several sizes and different types of training data.
机构地区 不详
出处 《Journal of Signal and Information Processing》 2011年第2期125-139,共15页 信号与信息处理(英文)
关键词 Line Spectral Frequencies (LSF) Gaussian Mixture Model (GMM) BILINEAR Models (BL) SINGULAR Value DECOMPOSITION (SVD) Temporal DECOMPOSITION (TD) Factor Analysis Line Spectral Frequencies (LSF) Gaussian Mixture Model (GMM) Bilinear Models (BL) Singular Value Decomposition (SVD) Temporal Decomposition (TD) Factor Analysis
  • 相关文献

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部