基于音素HMM模型语音转换

Voice Conversion Using Phoneme-dependent HMMS

下载PDF

导出

摘要通过对语音转换的研究，提出了一种把源说话人特征转换为目标说话人特征的方法。语音转换特征参数分为两类：（1）频谱特征参数；（2）基音和声调模式。分别描述信号模型和转换方法。频谱特征用基于音素的2维HMMS建模，F0轨迹用来表示基音和音调。用基音同步叠加法对基音厨期、声调和语速进行变换。 This paper presents a voice conversion method based on transformation of characteristic features of source speaker towards a target.Voice characteristic features are grouped into two main categories：（1）the spectral features at formants;（2）the pitch and intonation patterns. Signal modeling and transformation methods for each group of voice features are outlined.The spectral features at formants are modeled using a set of two-dimension phoneme-dependent HMMS.F0 contour is used for modeling the pitch and intonation patterns of speech.A PSOLA based method is employed for transformation of pitch ,intonation patterns and speaking rate.

作者钱开华 QIAN Kai-hua （Nanjing Umversity of Posts and Telecoms Signal and Information Processing,Nanjing 210003,China）

机构地区南京邮电大学信号与信息处理

出处《电脑知识与技术》 2008年第4期132-134,共3页 Computer Knowledge and Technology

关键词语音转换语音频谱基频曲线声门激励 voice conversion speech spectrum pitch contour glottal excitation

分类号 TP181 [自动化与计算机技术—控制理论与控制工程]

引文网络
相关文献

参考文献2

1左国玉,刘文举,阮晓钢.声音转换技术的研究与进展[J].电子学报,2004,32(7):1165-1172. 被引量：32
2李波,王成友,蔡宣平,唐朝京,张尔扬.语音转换及相关技术综述[J].通信学报,2004,25(5):109-118. 被引量：34

二级参考文献82

1初敏.韵律研究与合成语音的自然度[A].第五届全国现代语音学学术会议.新世纪的现代语音学[C].北京: 清华大学出版社,2001.295-301.
2H Kuwabara and Y Sagisaka.Acoustic characteristics of speaker individuality:control and conversion[J].Speech Communication.1995,16(2):165-173.
3D Klatt and L C Klatt.Analysis,synthesis,and perception of voice quality variations among female and male talkers[J].J Acoust Soc Am,1990,87(2):820-857.
4P H Milenkovic.Voice source model for continuous control of pitch period[J].J Acoust Soc Am,1993,93(2):1087-1096.
5H Matsumoto,et al.Multidimensional representation of personal quality of vowels and its acoustical correlates[J].IEEE Trans Audio and Electroacoustics,1973,21(5):428-436.
6S Furui.Research on individuality features in speech waves and automatic speaker recognition techniques [J].Speech Communication,1986,5(2):183-197.
7K S Lee,et al.A new voice transformation based on both linear and nonlinear prediction[A].Proc ICSLP[C].Philadelphia,USA:ESCA,1996.1401-1404.
8L M Arslan.Speaker transformation algorithm using segmental codebooks (STASC)[J].Speech Communication,1999,28(3):211-226.
9H Mizuno and M Abe.Voice conversion algorithm based on piecewise linear conversion rules of formant frequency and spectrum tilt[J].Speech Communication.1995,16(2):165-173.
10T Yoshimura,et al.Speaker interpolation in HMM-based speech synthesis system[A].Proc.Eurospeech [C].Rhodes,Greece:ESCA,1997.2523-2526.

共引文献57

1岳振军,王浩,张雄伟.基于正弦谐波模型和BP神经网络的语音变换算法及实现[J].信号处理,2005,21(z1):208-211. 被引量：7
2孙健,贾永兴,陈向东.一种基于DCT和PSOLA的语音变换方法[J].军事通信技术,2008,29(2):23-26.
3吴梅,冯瑞杰.试论一种语音转换系统的设计与实现[J].中亚信息,2010(S1):61-63.
4左国玉,刘文举,阮晓钢.语音转换技术在电话语音识别中的应用研究(英文)[J].系统仿真学报,2005,17(2):448-452.
5夏菁,尹俊勋,黄建成,黄锋.基于正弦加噪声模型的说话人转换方法[J].电声技术,2005,29(2):49-52. 被引量：1
6左国玉,刘文举,阮晓钢.一种使用声调映射码本的汉语声音转换方法[J].数据采集与处理,2005,20(2):144-149. 被引量：4
7李元良,李波,王成友.语音转换中基于系统单位冲激响应的频谱搬移方法[J].矿业研究与开发,2005,25(5):59-61. 被引量：1
8陆静芳,李波,王成友.语音转换中系统单位冲激响应的频谱搬移方法研究[J].现代电子技术,2005,28(24):40-42.
9王浩,苏巨诗,许胜华,岳振军.基于正弦谐波模型的语音变换算法及实现[J].解放军理工大学学报（自然科学版）,2005,6(6):525-530.
10符敏,程德福.支持向量回归在声音转换中的应用[J].电声技术,2006,30(3):45-48. 被引量：1

1曾超.汉语语音实时声调模式识别研究（上）[J].电子技术参考,1996(2):1-12.
2曾超.汉语语音实时声调模式识别研究（下）[J].电子技术参考,1996(3):15-24.
3孙燕,姜占才,王得芳.语音频谱分析与应用[J].计算机与现代化,2010(4):200-202. 被引量：7
4李强,明艳.语音频谱分析仿真系统的实现[J].科学咨询,2009(23):91-91. 被引量：1
5卢一男,单宝钰,关超.声纹识别技术现状与发展应用[J].信息系统工程,2017,30(2):11-11. 被引量：5
6李昊璇.基于扩展卡尔曼滤波器的声门激励LF模型参数估计[J].测试技术学报,2013,27(5):425-430. 被引量：1
7俞振利,张礼和.从任意连续语音中实时提取说话人特征及三维显示[J].杭州大学学报（自然科学版）,1992,19(4):390-397.
8宋凌.基于主成分分析的说话人特征变换研究[J].电子技术与软件工程,2013(17):241-243. 被引量：1
9新型计算机语音识别系统功能接近大脑[J].测绘技术装备,2006,8(2):9-9.
10谢崇文,柴佩琪.中文文语转换系统中基于决策树的基频模型提取[J].微型电脑应用,2007,23(7):4-7.

电脑知识与技术

2008年第4期

浏览历史

内容加载中请稍等...

基于音素HMM模型语音转换

参考文献2

二级参考文献82

共引文献57

相关作者

相关机构

相关主题

浏览历史