期刊文献+

语境相关的音素级语音合成系统中拼接平滑算法 被引量:1

Smoothing algorithm for contextual phone concatenation in speech synthesis
原文传递
导出
摘要 为提高语音合成系统的性能,产生自然流畅的合成语音,该文结合多种拼接点过渡平滑算法,提出了一种以语境相关的音素为基本单元的基于隐Markov(hidden Markov model,HMM)模型的英语拼接合成系统。该合成方法兼有拼接合成以及参数合成的优点,具有相对的灵活性,以及一定的语音自然度。以音素为基本单元尽可能减少了拼接点的个数,降低拼接失真。实验结果表明,多种平滑算法的采用,保证了拼接边界过渡平滑连贯,提高了最终的拼接效果。 A hidden Markov model(HMM)-based English speech synthesis system was built using contextual phone concatenation to improve speech synthesis quality for natural speech.The smoothing algorithm is the key module to ensure both spectrum and prosody continuity between segments.This method flexibly concatenates the segments to generate relatively natural speech.The basic synthesis unit is the phone,which reduces the number of concatenative points and the distortion.A smoothing algorithm is used to smooth the phon...
出处 《清华大学学报(自然科学版)》 EI CAS CSCD 北大核心 2008年第S1期640-644,共5页 Journal of Tsinghua University(Science and Technology)
关键词 语音合成 基于HMM的拼接合成 语境相关的音素 拼接点平滑 speech synthesis HMM-based concatenative synthesis contextual phone concatenative point smoothing
  • 相关文献

参考文献7

  • 1Tokuda K,,Zen H,Black A.An HMM-based speechsynthesis system applied to English[].Proc of IEEEWorkshop on Speech Synthesis.2002
  • 2Tokuda K,Yoshimura T,Masuko T,et al.Speechparameter generation algorithms for HMM-based speechsynthesis[].Proc of ICASSP.2000
  • 3Zen H,Masuko T,Tokuda K,et al.A hidden semi-Markovmodel based speech synthesis system[].IEICE Transactions on Information and Systems.2007
  • 4Toda T,Tokuda K.A speech parameter generationalgorithm considering global variance for HMM-based speechsynthesis[].IEICE Transactions on Information and Systems.2007
  • 5Hirai T,Tenpaku S.Using 5 ms segments in concatenativespeech synthesi[].Proc of th ISCA Speech SynthesisWorkshop.2004
  • 6LING Zhenhua,WANG Renhua.HMM-based unit selectionusing frame sized speech segments[].Proc of th ICSLP.2006
  • 7Chappell D T,,Hansen J H L.Spectral smoothing for speechsegment concatenation[].Speech Communication.2002

同被引文献9

  • 1郑玉玲.韵律词边界的协同发音问题——对语音合成自然度的思考[J].清华大学学报(自然科学版),2008,48(S1):645-651. 被引量:2
  • 2蒋丹宁,蔡莲红,陶建华.带有频谱补偿的基频修改算法[J].清华大学学报(自然科学版),2004,44(7):974-977. 被引量:1
  • 3Zheng Yuling,Can Jianfen,Bao Huaiqiao.Co-articulation and prosodic hierarchy[C]//Second International Conference on Tonal Aspects of Languages.La Rochelle,France,2006:145-150.
  • 4Matsumoto H,Hiki S,Sone T,et al.Multidimensional representation of personal quality of vowels and its acoustical correlates[J].IEEE Trans on Audio and Electroacoustics,1973,21(5):428-436.
  • 5Furui S.Digital Speech Processing,Synthesis,and Recognition[M].New York:Marcel Dekker Inc,1989.
  • 6Gutiérrez-Arriola J M,Montero J M,Vallejo J A,et al.A new multi-speaker formant synthesizer that applies voice conversion techniques[C]//Proc Eurospeech.Aalborg,Denmark:ISCA,2001:357-360.
  • 7Rao K S,Yegnanarayana B.Prosodic manipulation using instants of significant excitation[C]//Int Conf Acoust Speech Signal Processing.Maryland,USA,2003:234-238.
  • 8Rabiner L,Juang B-H.Fundamentals of Speech Recognition[M].New Jersey:Prentice Hall,Inc,Upper Saddle River,1993.
  • 9周迅溢,王蓓,杨玉芳,李晓庆.语句中协同发音对音节知觉的影响[J].心理学报,2003,35(3):340-344. 被引量:10

引证文献1

二级引证文献2

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部