期刊文献+

朝鲜语文语转换及其拼接代价的研究 被引量:1

Korean text-to-speech and concatenation cost function
原文传递
导出
摘要 拼接代价函数是决定合成音自然度的重要因素之一。针对拼接代价函数,以往的研究只考虑拼接点处的特征向量是否相等,而没涉及到特征向量的一阶连续性。该文研究并实现了以三音子做为基元的朝鲜语文语转换系统,并研究了基元动态特征对拼接的影响,设计了能反映特征连续性的一种新的拼接代价函数,利用两个基元在拼接点处的特征向量及其一阶差分值计算拼接代价。实验结果表明,基于动态特征连续性的拼接代价函数可以较好地改善拼接处的频谱连续性,有效地提高了朝鲜语合成语音的自然度。 The concatenation cost function is a key factor affecting the naturalness of synthesized speech. Previous research on concatenation cost functions has only discussed whether the characteristic vectors are the same at the concatenation point, but had not considered the first-order continuity. This paper describes a Korean text-to-speech (TTS) system based on triphones and the influence of the dynamic features of the triphones on concatenation. A concatenation cost function was developed which incorporates dynamic continuity by calculating the cost vector using the characteristic vectors and their first derivatives at the concatenation point, Test results show that the concatenation cost function improves continuity at the unit boundaries to substantially improve the naturalness of the Korean text-to-speech system.
出处 《清华大学学报(自然科学版)》 EI CAS CSCD 北大核心 2006年第4期596-599,共4页 Journal of Tsinghua University(Science and Technology)
基金 国家自然科学基金资助项目(60275014)
关键词 语音合成 朝鲜语文语转换 拼接代价函数 speech synthesis Korean text-to-speech (TTS) concatenation cost function
  • 相关文献

参考文献6

  • 1Hunt A,Black A.Unit selection in a concatenative speech synthesis system using a large speech database[A].Proc ICASSP[C].Atlanta:IEEE,1996.373-376.
  • 2Campbell W N.CHATR:A high-definition speech resequencing system[A].Proc 3rd ASA/ASJ Joint Meeting[C].Hawaii:Acoustical Society of America and Acoustical Society of Japan,1996.1223-1228.
  • 3Klabbers E,Veldhuis R.Reducing audible spectral discontinuities[J].IEEE Trans Speech and Audio Processing,2001,9(1):39-51.
  • 4Stylianou Y,Syrdal A K.Perceptual and objective detection of discontinuities in concatenative speech synthesis[A].Proc ICASSP[C].Salt Lake City:IEEE,2001.837-840.
  • 5Kim S H,Kim H R.An effectiveness of automatic labeling using speech recognizer[A].International Conference on Phonetic Sciences(SICOPS'96)[C].Seoul:The Phonetic Society of Korea,1996.468-471.
  • 6Narayanan S,Alwan A.Text to Speech Synthesis:New Paradigms and Advances[M].Indiana:Prentice Hall PTR,2004.

同被引文献5

引证文献1

二级引证文献5

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部