期刊文献+

一种改进高斯混合模型均值项的语音转换方法

A method to improve the performence of GMM voice conversion by modifying mean value items
下载PDF
导出
摘要 语音转换技术主要应用于计算机语音合成、计算机语音翻译、语音编辑、广播及多媒体等方面。高斯混合模型(GMM)是目前语音转换的主流方法,但它的最大不足是会导致转换频谱的过平滑。其中GMM转换函数中的均值项和相关项共同导致了过平滑现象,并且均值项的影响更大。为此提出了结合码本映射法和GMM方法的修正均值法,实验表明,使用修正均值法能够有效抑制过平滑问题,改善转换性能。 Voice conversion has application in text to speech synthesis, voice editing, broadcasting and multimedia voice applications. GMM is a mostly used algorithm in the applications of voice conversion. However it causes overfitting in the converted voice spectrum which affects the transformed voice's quality. This paper analyzed this problem and found that it is caused by both of the mean value and covariance items in transformation function. To improve the performance of voice conversion, this paper proposed a new method combined codebook mapping method and GMM. Objective evaluations show that this method reduces the effect of overfitting, and improves the converted voice's quality.
作者 赵义正
出处 《微型机与应用》 2012年第19期68-70,共3页 Microcomputer & Its Applications
关键词 语音转换 高斯混合模型 码本映射法 过平滑 voice conversion GMM codebook mapping method overfitting
  • 相关文献

参考文献6

  • 1BENISTY H, MALAH D. Voice conversion using GMM with enhanced global variance [C]. INTERSPEECH 2011: 669-672.
  • 2HELANDER E, VIRTANEN T, NURMINEN J, et al. Voice conversion using partial least squares regression[J]. IEEE Transactions on Audio, Speech, and Language Processing, 2010, 18(5): 912-921.
  • 3DESAI S, BLACK A W, YEGNANARAYANA B, et al. Voice conversion using artificial neural networks [C]. ICASSP 2009: 3893-3896.
  • 4吕声,尹俊勋,黄建成.基于高斯混合模型和残差预测的说话人转换系统[J].电声技术,2004,28(6):33-36. 被引量:4
  • 5Chen Yining, Chu Min. Voice conversion with smoothed GMM and MAP adaption[C]. Geneva, Switzerland: Proceedings of Eurospeech. 2003: 2413"2416.
  • 6康永国,双志伟,陶建华,张维.基于混合映射模型的语音转换算法研究[J].声学学报,2006,31(6):555-562. 被引量:13

二级参考文献27

  • 1左国玉,刘文举,阮晓钢.声音转换技术的研究与进展[J].电子学报,2004,32(7):1165-1172. 被引量:32
  • 2Kain A., Macon M.W. Design and evaluation of a voice conversion algorithm based on spectral envelope mapping and residual prediction. In IEEE International Conference on Acoustics, Speech, and Signal Processing, Proceedings,2001,2:813-816.
  • 3Arslan L. Speaker transformation algorithm using segment codebook. Speech Communication Journal. 1999, 28:211-226.
  • 4Y. Stylianou, O. Cappe, E. Moulines. Statistical method for voice quality transformation. In Proc. EUROSPPECH, 1995.
  • 5Y. Stylianou, O. Cappe, E. Moulines. Continuous probabilistic transform for voice conversion In IEEE Transaction on speech and audio processing, 1998,6 (2):131-142.
  • 6Arslan L, Talkin D. Voice conversion by codebook mapping of line spectral frequencies and excitation spectrum. In:Proceedings of Eurospeech, Rhodes, Greece, 1997: 1347-1350
  • 7SHUANG Zhiwei, WANG Zixiang, LING Zhenhua, WANG Renhua. A novel voice conversion system based on codebook mapping with phoneme-tied weighting. In: Proceedings of ICSLP, Jeju, 2004:1197-1200
  • 8Stylianou Y et al. Continuous probabilistic transform for voice conversion. IEEE Transactions on Speech and Audio Processing, 1998; 6(2): 131-142
  • 9Alexander Blouke Kain. High resolution voice transformation. Ph.D. dissertation, Oregon Health and Science University, October 2001
  • 10Valbret H et al. Voice transformation using PSOLA technique. Speech Communication, 1992; 11(2-3): 175-187

共引文献14

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部