期刊文献+

基于GMM和概率修正码本的源-目标说话人声门波转换 被引量:2

Glottal Flow Transformation from Source Speaker to Target Speaker Based on GMM and Probability Correct Codebook
下载PDF
导出
摘要 提出了一种用于源-目标说话人声门波导数参数转换的、基于勒让德正交分解的声门波导数波形参数提取方法。该方法将声门波导数波形在6维正交勒让德坐标系中的投影构成了描述其形状的特征矢量,并采用基于GMM的概率分类加权转换算法,使每个特征矢量的转换规则可由多个类所对应的规则的线性加权组合得到,可以使转换性能得到较大的提高。在此基础上,又给出了一种基于GMM的声门波导数波形的码本修正算法,以弥补声门波导数波形参数化而损失的含有说话人个性特征的高频送气分量和波纹分量。实验结果表明,本文方法转换性能明显好于基于矢量量化(VQ)的码本映射算法。 For high quality voice transformation, a novel parameter extraction scheme for glottal flow derivative is proposed based on Legendre orthogonal decomposition. The algorithm uses the six-dimensional Legendre orthogonal coefficients to form a vector for describing the shape of glottal flow derivative. Moreover, this paper utilizes probability weighted transformation algorithm based on Gaussian mixture model (GMM), which linearly combines a few rules derived improve be lost codeboo from each subclass transformation, thus the transformation accuracy is significantly d. Furthermore, to model high frequency aspirated and ripple information, which may in the procedure of parameterization for glottal flow derivative, a probability correct k is used to compensate such information. Experimental results are proved to be effective.
出处 《数据采集与处理》 CSCD 北大核心 2007年第1期19-24,共6页 Journal of Data Acquisition and Processing
关键词 声音转换 声门波导数 勒让德正交分解 高斯混合模型(GMM) 概率加权修正码本 voice transformation glottal flow derivative Legendre orthogonal decomposition Gaussian mixture model probability correct codebook
  • 相关文献

参考文献10

  • 1Childers D G,Ahn C.Modeling the glottal volume velocity waveform for three voice types[J].J Acoust Soc Amer,1995,97:505-519.
  • 2Plumpe M D,Quatieri T F,Reynolds D A.Modeling of the glottal flow derivative waveform with application to speaker identification[J].IEEE Transactions on Speech and Audio Processing,1999,5:221-234.
  • 3Moore E,Clements M.Algorithm for automatic glottal wave form estimation without the reliance on precise glottal closure information[C]//IEEE Proceedings of the International Conference on Acoustics,Speech,and Signal Processing.USA:IEEE,2004,I:101-104.
  • 4Childers D G,Lee C K.Vocal quality factors:analysis,synthesis and perception[J].J Acoust Soc Amer,1991,90:2390-2410.
  • 5Rosenberg A.Effect of glottal pulse shape on the quality of natural vowels[J].J Acoust Soc Amer,1971,8:583-590.
  • 6Milenkovic P.Voice source model for continuous control of pitch period[J].J Acoust Sot Amer,1993,6:1087-1096.
  • 7Fant G,Liljencrants J,Lin Q.A four parameter model of glottal flow[C]//STL-QPSR 4.Franch-Swedish Symp.Grenoble:[s.n],1985:1-13.
  • 8Skoglund J.Analysis and quantization of glottal pulse shapes[J].Speech Communication,1998,24(4):133-152.
  • 9Childers D G.Glottal source modeling for voice conversion[J].Speech Communication,1995,16(2):127-138.
  • 10Chang E,Shi Y,Zhou J L,et al.Speech lab in a box:a mandarin speech toolbox to jumpstart speech related research[C]//Eurospeech 2001.Aalborg,Denmark:[s.n],2001:333-336.

同被引文献18

  • 1左国玉,刘文举,阮晓钢.声音转换技术的研究与进展[J].电子学报,2004,32(7):1165-1172. 被引量:32
  • 2张晓洲,黄德智,蔡莲红.考虑帧间动态特征的音色变换算法[J].清华大学学报(自然科学版),2006,46(10):1767-1770. 被引量:1
  • 3Abe M, Nakamura S, Shikano K. Voice Conversion Through Vector Quantization, Proc. Of ICASSP, 1988, ( 1 ) :655-658.
  • 4Arslan L Speaker transformation algorithm using segment codebook. Speech Communication. 1999,28 (3) :211-226.
  • 5Stylianou Y, Cappe o, Moulines E. Continuous Probabilistic Transformation for Voice Conversion. IEEE Tran. on Speech and Audio Processing, 1998,6 (2) : 131-142.
  • 6Kain A. , Macon M.W. Design and evaluation of a voice conversion algorithm based on spectral envelope mapping and residual prediction. Proc. Of ICASSP,2001, (2) :813- 816.
  • 7Quafiefi T.F.离散时间语音信号处理-原理与应用.北京:电子工业出版社,2004.
  • 8Toda T, Saruwatari H, Shikano K. Voice conversion algo-rithm based on Gaussian mixture model with dynamic frequency warping of straight spectrum . Proc. Of ICASSP, 2001, (2) : 841- 844.
  • 9CHEN Yining, CHU Min, Chang E. Voice conversion with smoothed GMM and MAP adaptation [ C ] Proc of Euro-speech, Geneva, Switzerland, 2003, ( 1 ) : 2413- 2416.
  • 10Toda T, Black A W , Tokuda K. Spectral conversion based on maximum likelihood estimation considering global variance of converted parameter, Proc. Of ICASSP, 2005, (1) : 9-12.

引证文献2

二级引证文献2

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部