期刊文献+

一种基于实例学习的高精度英文未登录词发音的自动预测方法

A High Accuracy Approach for Prediction of Pronunciation for Out-of-Vocabulary English Words Based on Exemplar Learning
下载PDF
导出
摘要 在英文TTS(texttospeech)系统中 ,需要根据文本中每一个单词的发音来合成语音 由于在真实文本的处理中 ,无论词典规模如何大 ,都不可能包括文本中的每一个单词 ,所以需要使用某种算法来预测词典中未登录单词的发音 介绍了一种基于实例学习的方法 ,并在一个大规模的英语词典上进行了性能评测 结果表明 ,这种方法的单词发音正确率可以达到 70 1% 。 In TTS(text to speech)systems, the pronunciation of each word is needed to synthesize the voice Because every word in the text can not be listed exhaustively when processing the real world documents, no matter what the scope of dictionary is, some kinds of algorithms are needed to automatically predict the pronunciation of word which is not included in the lexicon In this paper an approach based on exemplar learning is introduced and its performance evaluated on a large scale English dictionary Experimental results show that this method can achieve accuracy of 70 1%, obviously higher than the published approaches
出处 《计算机研究与发展》 EI CSCD 北大核心 2004年第5期796-801,共6页 Journal of Computer Research and Development
关键词 机器学习 实例学习 machine learning exemplar learning
  • 相关文献

参考文献8

  • 1M Dedina,H Nusbaum.PRONOUNCE:A Program for Pronunciation by Analogy.Computer Speech and Language,1991,5(1):55~64
  • 2R I Damper,Y Marchand,M J Adamson et al.A comparison of letter-to-sound conversion techniques for English text-to-speech synthesis.Proceedings of the Institute of Acoustics,1998,20(6):245~254
  • 3Y Marchand,R I Damper.A multi-strategy approach to improving pronunciation by analogy.Computational Linguistics,2000,26(2):195~219
  • 4V Pagel,K Lenzo,A Black.Letter to sound rules for accented lexicon compression.In:Robert H Mannell,Jordi Robert-Ribes eds.Proc of the 5th Int'l Conf on Spoken Language Processing,v91.5.Sydney,Australia:Australian Speech and Technology Association,Incorporated(ASSTA),1998.2015~2018
  • 5H S Elovitz,Johnson R,McHngh A et al.Letter-to-sound rules for automatic translation of English text to phonetics.IEEE Trans on Acoustics,Speech and Signal Processing,1976,24(6):446~459
  • 6NMcCulloch MBedworth JBridle.NETspeak—A re—9.Elovitz的规则集合采用的发音音标集合与CMU词典有所不同,我们将其发音转换为CMU使用的音标集合,可能对最终的精度结果有一定影响.implementation of NETtalk[J].Computer Speech and Language,1987,(2):284-301.
  • 7C Stanfill,D Waltz.Toward memory-based reasoning.Communications of the ACM,1986,29(12):1212~1228
  • 8T G Dietterich,G Bakiri.Error-correcting output codes:A general method for improving multiclass inductive learning programs.In:Dannenberg ed.Proc of the 9th National Conf on Artificial Intelligence,Vol 2.Anaheim,Califomia:AAAI Press,1991.572~577

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部