期刊文献+

合成语音自然度客观测度 被引量:2

Objective Measure of Naturalness for Concatenate Speech Synthesis
下载PDF
导出
摘要 目前合成语音的自然度有待提高,论文根据目前的研究现状提出了一种合成语音自然度的客观评价方法,该方法主要从语音韵律特征的主要参数出发,计算同一发音人的自然语音和合成语音之间的基频、时长、音强等参数的差距,其中由于两种语音基频时间不匹配,所以采用DTW(Dynamic Time Warping)算法来对两种语音的基频进行了时间弯折对准。最后再将计算结果与主观评测(MOS)的结果进行比较。实验数据表明,论文提出的基频曲线失真测度与MOS之间具有很强的相关性,从韵律特征角度给出的评价结果能够衡量合成语音的自然度。 In this paper,a new objective evaluation method of naturalness for concatenate speech synthesis is proposed.Considering the prosodic parameters of speech,the objective distance of patch parameters,duration parameters and intensity parameters between the natural speech and the synthesized speech are calculated.For mismatch of two speeches in duration,the DTW(Dynamic Time Warping) algorithm is used to allow approximate matching.The formal Mean Opinion Score(MOS) obtained subjectively is compared with the result.The correlation coefficient between the objective measure and subjective measure is strong.The experiments show that the proposed method can serve as the objective evaluation of naturalness for concatenate speech synthesis.
作者 赵博 蔡莲红
出处 《计算机工程与应用》 CSCD 北大核心 2005年第7期32-33,152,共3页 Computer Engineering and Applications
基金 国家自然科学基金项目(编号:60275014)
关键词 语音合成 评测 自然度 speech synthesis,evaluation,naturalness
  • 相关文献

参考文献7

  • 1陈静,周毅刚,周建林.符合人耳听觉特性的语音音质的客观评价方法[J].哈尔滨工业大学学报,1998,30(6):80-83. 被引量:3
  • 2吴志勇,蔡莲红.语音合成中的韵律关联模型[J].中文信息学报,2004,18(2):44-50. 被引量:8
  • 3Robert Bat usek. An Objective Measure for Assessment of the Concatenative TTS Segment Inventories[J].Eurospeech,2001.
  • 4Jun Xu,Cuntai Guan,Haizhou Li. An Objective Measure for Assessment of a Corpus-Based Text-to-Speech System[C].In:IEEE 2002TTS Workshop ,2002.
  • 5初敏.韵律研究与合成语音的自然度[C]..见:第五届全国现代语音学学术会议-新世纪的现代语音学[C].北京:清华大学出版社,2001.295-301.
  • 6周迅溢,王蓓,杨玉芳,李晓庆.语句中协同发音对音节知觉的影响[J].心理学报,2003,35(3):340-344. 被引量:10
  • 7吕士楠 林凡 张连毅.基于大语音库的拼接合成语音特征分析[C]..见:第五届全国现代语音学学术会议--新世纪的现代语音学[C].北京:清华大学出版社,2001.307-310.

二级参考文献13

  • 1吕士楠,齐士钤,张家.合成言语自然度的研究[J].声学学报,1994,19(1):59-65. 被引量:7
  • 2初敏.韵律研究与合成语音的自然度[A].第五届全国现代语音学学术会议.新世纪的现代语音学[C].北京: 清华大学出版社,2001.295-301.
  • 3吴志勇 蔡莲红 陶建华.基于汉语韵律参数的语音基元选取[A]..第六届全国人机语音通讯学术会议[C].深圳,2001.199-202.
  • 4G.Fant.言语产生中的相互作用现象[M].,1987..
  • 5[1]Singh S, Woods D R, Becker G M. Perceptual structure of 22 prevocalic English consonants, Journal of American Acoustic Society, 1972, 52: 1668~1713
  • 6[3]Daniel Recasens. An electropalatographic and acoustic study of consonant-to-vowel coarticulation. Journal of Phonetics, 1991, 19: 177~192
  • 7陈永彬,语言信号处理,1990年,129页
  • 8王刚,电子情报通信学会论文志.A,1988年,71卷,12期,2111页
  • 9王轩,李巍,王晓龙,赵淑香.大标记集汉语字(词)Markov 语言模型的建立[J].哈尔滨工业大学学报,1997,29(5):23-27. 被引量:3
  • 10陶建华,蔡莲红,赵世霞,吴志勇.汉语文语转换系统中可训练韵律模型的研究[J].声学学报,2001,26(1):67-72. 被引量:14

共引文献21

同被引文献18

  • 1Sarkar S.The humanid gait challenge problem:data sets,performance,and analysis[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2005,27(2):162-177.
  • 2BenAbdelkader C,Cutler R G,Davis L S.Gait recognition using image self-similarity[J].EURASIP Journal on Applied Signal Processing,2004 (4):572-585.
  • 3Wang L,Tan T,Ning H,et al.Silhouette analysis-based gait recognition for human identification[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2003,25(12):1505-1518.
  • 4Boyd J.Synchronization of oscillations for machine perception of gaits[J].Computer Vision and Image Understanding,2004,96(1):35-59.
  • 5Belongie S,Malik J,Puzicha J.Shape matching and object recognition using shape contexts[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2002,24(4):509-522.
  • 6Peng C,Hausdorff J,Goldberger A.Fractal mechanisms in neural control:human heartbeat and gait dynamics in health and disease[C]//Self-Organized Biological Dynamics and Nonlinear Control.[S.l.]:Cambridge University Press,2000.
  • 7Dubuisson M P,Jain A K.A modified Hausdorff distance for object matching[C]//12th International Conference on Pattern Recognition,Jerusalem,1994:566-568.
  • 8Shutler J D,Grant M G,Nixon M S,et al.On a large sequencebased human gait database[C]//4th International Conference on Recent Advances in Soft Computing,Nottingham(UK),2002:66-71.
  • 9Mowbray S D,Nixon M S.Automatic gait recognition via fourier desc riptors of deformable objects[C]//4th International Conference on Audio-and Video-Based Biometric Person Authentication,Guildford,UK,2003:566-573.
  • 10Wagg D K,Nixon M S.On automated model-based extraction and analysis of gait[C]//Sixth I EEE International Conference on Automatic Face and Gesture Recognition,Seoul,Korea,2004:11-16.

引证文献2

二级引证文献5

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部