一种改进高斯混合模型均值项的语音转换方法

A method to improve the performence of GMM voice conversion by modifying mean value items

下载PDF

导出

摘要语音转换技术主要应用于计算机语音合成、计算机语音翻译、语音编辑、广播及多媒体等方面。高斯混合模型(GMM)是目前语音转换的主流方法,但它的最大不足是会导致转换频谱的过平滑。其中GMM转换函数中的均值项和相关项共同导致了过平滑现象,并且均值项的影响更大。为此提出了结合码本映射法和GMM方法的修正均值法,实验表明,使用修正均值法能够有效抑制过平滑问题,改善转换性能。 Voice conversion has application in text to speech synthesis, voice editing, broadcasting and multimedia voice applications. GMM is a mostly used algorithm in the applications of voice conversion. However it causes overfitting in the converted voice spectrum which affects the transformed voice＇s quality. This paper analyzed this problem and found that it is caused by both of the mean value and covariance items in transformation function. To improve the performance of voice conversion, this paper proposed a new method combined codebook mapping method and GMM. Objective evaluations show that this method reduces the effect of overfitting, and improves the converted voice＇s quality.

作者赵义正

机构地区合肥电子工程学院

出处《微型机与应用》 2012年第19期68-70,共3页 Microcomputer & Its Applications

关键词语音转换高斯混合模型码本映射法过平滑 voice conversion GMM codebook mapping method overfitting

分类号 TN912.3 [电子电信—通信与信息系统]

引文网络
相关文献

参考文献6

1BENISTY H, MALAH D. Voice conversion using GMM with enhanced global variance [C]. INTERSPEECH 2011: 669-672.
2HELANDER E, VIRTANEN T, NURMINEN J, et al. Voice conversion using partial least squares regression[J]. IEEE Transactions on Audio, Speech, and Language Processing, 2010, 18(5): 912-921.
3DESAI S, BLACK A W, YEGNANARAYANA B, et al. Voice conversion using artificial neural networks [C]. ICASSP 2009: 3893-3896.
4吕声,尹俊勋,黄建成.基于高斯混合模型和残差预测的说话人转换系统[J].电声技术,2004,28(6):33-36. 被引量：4
5Chen Yining, Chu Min. Voice conversion with smoothed GMM and MAP adaption[C]. Geneva, Switzerland: Proceedings of Eurospeech. 2003: 2413"2416.
6康永国,双志伟,陶建华,张维.基于混合映射模型的语音转换算法研究[J].声学学报,2006,31(6):555-562. 被引量：13

二级参考文献27

1左国玉,刘文举,阮晓钢.声音转换技术的研究与进展[J].电子学报,2004,32(7):1165-1172. 被引量：32
2Kain A., Macon M.W. Design and evaluation of a voice conversion algorithm based on spectral envelope mapping and residual prediction. In IEEE International Conference on Acoustics, Speech, and Signal Processing, Proceedings,2001,2:813-816.
3Arslan L. Speaker transformation algorithm using segment codebook. Speech Communication Journal. 1999, 28:211-226.
4Y. Stylianou, O. Cappe, E. Moulines. Statistical method for voice quality transformation. In Proc. EUROSPPECH, 1995.
5Y. Stylianou, O. Cappe, E. Moulines. Continuous probabilistic transform for voice conversion In IEEE Transaction on speech and audio processing, 1998,6 (2):131-142.
6Arslan L, Talkin D. Voice conversion by codebook mapping of line spectral frequencies and excitation spectrum. In:Proceedings of Eurospeech, Rhodes, Greece, 1997: 1347-1350
7SHUANG Zhiwei, WANG Zixiang, LING Zhenhua, WANG Renhua. A novel voice conversion system based on codebook mapping with phoneme-tied weighting. In: Proceedings of ICSLP, Jeju, 2004:1197-1200
8Stylianou Y et al. Continuous probabilistic transform for voice conversion. IEEE Transactions on Speech and Audio Processing, 1998; 6(2): 131-142
9Alexander Blouke Kain. High resolution voice transformation. Ph.D. dissertation, Oregon Health and Science University, October 2001
10Valbret H et al. Voice transformation using PSOLA technique. Speech Communication, 1992; 11(2-3): 175-187

共引文献14

1夏菁,尹俊勋,黄建成,黄锋.基于正弦加噪声模型的说话人转换方法[J].电声技术,2005,29(2):49-52. 被引量：1
2张凯,朱立新,赵义正.基于重训练高斯混合模型的语音转换方法[J].声学技术,2010,29(1):52-55. 被引量：3
3赵义正.改进GMM谱包络转换性能的语音转换算法研究[J].科学技术与工程,2010,10(17):4172-4174. 被引量：3
4赵义正.一种新的分维高斯混合模型语音转换方法[J].计算机与现代化,2010(9):82-84.
5李燕萍,张玲华,丁辉.基于音素分类的汉语语声转换算法[J].南京邮电大学学报（自然科学版）,2011,31(1):10-15. 被引量：1
6陈雪勤,赵鹤鸣.有效高斯分量通用背景模型下耳语音声道系统转换研究[J].声学学报,2013,38(2):195-200. 被引量：5
7CHEN Xueqin,ZHAO Heming.Research of whispered speech vocal tract system conversion based on universal background model and effective Gaussian components[J].Chinese Journal of Acoustics,2013,32(4):400-410. 被引量：1
8解伟超,张玲华.基于自组织聚类和改进粒子群算法的语音转换方法[J].声学学报,2014,39(1):130-136. 被引量：1
9简志华,王向文.采用压缩感知的改进的语音转换算法[J].声学学报,2014,39(3):400-406. 被引量：5
10JIAN Zhihua,WANG Xiangwen.A modified voice conversion algorithm using compressed sensing[J].Chinese Journal of Acoustics,2014,33(3):323-333. 被引量：8

1祁玉生,阮永红.DS-CDMA系统多址干扰的改进高斯近似法分析[J].南京邮电学院学报,1998,18(5):33-36. 被引量：1
2徐波,刘洋.移动互联时代的语音识别技术[J].微电脑世界,2001(12):85-91.
3王民,王明明,王燕妮,王稚慧,赵伟.基于KLD改进高斯混合模型的语音转换技术[J].科教导刊（电子版）,2015,0(21):147-148.
4陈国亮.一种多用语音卡的设计与应用[J].电子技术（上海）,1995,22(2):9-10.
5张蓉.语音技术在客服中心的应用研究[J].中国信息化,2014(17):50-53. 被引量：2
6林道发,杨家沅.连续语音识别和语音翻译[J].计算机应用与软件,1994,11(2):15-19.
7无线通信设备[J].个人电脑,2003,9(8):141-141.
8苗新法,范春晓.依赖OSYNO6188的SMS TTS系统的实现[J].电子技术（上海）,2005,32(10):68-70.
9能说会听海尔V76手机[J].数字生活,2008,0(7):92-93.
10李哲学.改进高斯混合模型的遥感图像增强方法[J].激光杂志,2016,37(7):31-34. 被引量：4

微型机与应用

2012年第19期

浏览历史

内容加载中请稍等...

一种改进高斯混合模型均值项的语音转换方法

参考文献6

二级参考文献27

共引文献14

相关作者

相关机构

相关主题

浏览历史