期刊文献+

大规模语音语料库及其在TTS中应用的几个问题 被引量:12

Problems on Large-Scale Speech Corpus and the Applications in TTS
下载PDF
导出
摘要 首先介绍了大规模语音语料库以及基于大规模语音语料库的文语转换技术的研究现状,接着介绍了一个大规模连续汉语语音语料库的实例Slib的结构和内容;在此基础上,讨论了面向大规模语音语料库的索引技术,提出了语料库检索中的集合运算和最小包容问题,证明了最小包容问题是NP完全的,给出了求解该问题的贪婪算法以及算法的近似比;最后,讨论了基于集合运算的大规模语音语料库的检索技术在文语转换系统中的应用,特别是在基本语言单位实例的选取问题上实现了一种基于最小包容的优化方法,对提高文语转换系统的自然度有实用价值. The recent advances of large-scale speech corpus (LSSC) and text-to-speech (TTS) technologies are briefly reviewed,then the architecture and annotation information of a large-scale speech corpus Slib are introduced.Based on Slib,the LSSC-oriented indexing methods is discussed,the set operations and the minimum cover problem related to information retrieval in LSSC are presented.The minimum cover problem is a NP-complete problem,and a greedy algorithm is proposed to obtain an approximation solution.The approximation ratio of the proposed algorithm is analyzed.The application and realization of set operations in TTS are presented,and an approach for choosing proper speech instances of linguistic units based on minimum cover is developed,which can improve the naturalness of the synthesized speech of TTS system.
出处 《计算机学报》 EI CSCD 北大核心 2010年第4期687-696,共10页 Chinese Journal of Computers
基金 国家自然科学基金(60572125)资助~~
关键词 语音语料库 集合运算 文语转换 最小包容 信息检索 speech corpus set operation text to speech minimum cover information retrieval
  • 相关文献

参考文献15

  • 1孙岭 胡郁 王仁华.中文语音合成系统中的语料库设计[A]..第六届全国人机语音通讯学术会议论文集[C].,2001..
  • 2汤胜良,张士礼,张志平,吴玺宏,迟惠生.基于新闻联播语料库的语音合成系统//第八届全国人机语音通讯学术会议.北京,2005.
  • 3王天庆,李爱军.连续汉语语音识别语料库的设计//第6届全国现代语音学学术会议.天津,2003.
  • 4蔡莲红,崔丹丹,蔡锐.汉语普通话语音合成语料库TH-CoSS的建设和分析[J].中文信息学报,2007,21(2):94-99. 被引量:12
  • 5李爱军,殷治纲,王茂林,徐波,宗成庆.口语对话语音语料库CADCC和其语音研究//第5届现代语音学学术会议文集.北京,2001.
  • 6Tao Jianhua, Yu Jian, Kang Yongguo. An expressive mandarin speech eorpus//Proceedings of the International Committee for the Co-ordination and Standardization of Speech Databases and Assessment Techniques. Bali Island, Indonesia, 2005.
  • 7Wu Tian, Yang Yingchun, Wu Zhaohui, Li Dongdong. 2006 MASC: A speech corpus in mandarin for emotion analysis and affective speaker recognition//Proceedings of 2006 IEEE Odyssey--The Speaker and Language Recognition Workshop. San Juan, Puerto Rico, 2006.
  • 8Chou Fu-Chiang, Tseng Chiu-Yu, Lee Lin-Shan. A set of corpus-based text-to-speech synthesis technologies for mandarin Chinese. IEEE Transactions on Speech and Audio Processing, 2002, 10(7): 481-494.
  • 9Chou F C, Tseng C Y, Lee L S. Selection of waveform units for corpus-based mandarin speech synthesis based on decision trees and prosodic modification costs//Proceedings of the Eurospeech. Budapest, Hungary, 1999.
  • 10Wang H C, Seide F, Tseng C Y, Lee L S. MAT-2000- Design, collection, and validation of a mandarin 2000-speaker telephone speech database//Proceedings of the 6th International Conference on Spoken Language Processing. Beijing, 2000.

二级参考文献6

  • 1蔡莲红,赵世霞.汉语语音合成语料库的研究与建立[J].语言文字应用,1999(3):97-102. 被引量:6
  • 2崔丹丹,蔡莲红.基于决策树的语料库分析[J].计算机工程,2006,32(21):3-5. 被引量:2
  • 3Weibin Zhu, Wei Zhang, Corpus Building for Data-driven TTS Systems [A]. In: Proceedings of 2002IEEE Workshop on Speech Synthesis [C]. 11-13 Sept.2002. 199-202.
  • 4孙岭,胡郁,王仁华.中文语音合成系统中的语料库设计[A].第六届全国人机语音通讯学术会议[C].深圳:2001.11.
  • 5Yiqing ZU, Yingzhi CHEN. A Super Phonetic System and Multi-dialect Chinese Speech Corpus for Speech Recognition [A]. In: ISCSLP [C]. 2002.
  • 6Blouin, C.,Bagshaw, P.C., Rosec, O.. A Method of Unit Pre_selection of Speech Synthesis Based on Acoustic Clustering and Decision trees [A]. In: ICASSP[C]. 2003.

共引文献13

同被引文献131

引证文献12

二级引证文献54

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部