期刊文献+

CUCBNC:一个引入播音学知识的广播新闻语音库 被引量:3

CUCBNC:a broadcasting news corpus integrated with broadcast announcing knowledge
原文传递
导出
摘要 该文描述了广播新闻语音库CUCBNC的构建过程。建设该语音库的目的是为了能将播音学相关知识应用到言语工程中。为此,通过解读播音学相关论述,提出了新的韵律特征,包括声音表达特征、语篇重音、意合群和复合韵律短语,并融入到CUCBNC语音库的韵律和文本标注规范中,目前已标注了约14h的语音数据。最后,通过观察相关韵律特征在标注数据中的统计分布,来检验融入了新特征的韵律标注规范是否合适。实验结果表明所提出的韵律特征是科学合理的。 This paper introduces CUCBNC,a broadcasting news corpus for applying broadcast announcing knowledge into speech engineering.The labeling process tokenized and integrated some knowledge from the broadcast announcing research into the annotation scheme.Therefore,some new prosody features are identified including voice expression characters,discourse stresses,meaning expression clusters and compound prosodic phrases.The annotated data includes about 14 hours of broadcasting speech announced by 2 women.The distribution of the new prosody features in the annotated data is analyzed to show that these prosodic features are reasonable.
出处 《清华大学学报(自然科学版)》 EI CAS CSCD 北大核心 2011年第9期1313-1316,共4页 Journal of Tsinghua University(Science and Technology)
基金 中国传媒大学211工程项目(21103010105) 中国传媒大学科研培育项目(P201012)
关键词 语音库 韵律标注 播音学 speech corpus prosody annotation broadcasting announcing
  • 相关文献

参考文献8

  • 1李爱军,陈肖霞,孙国华,华武,殷治纲.CASS:一个具有语音学标注的汉语口语语音库[J].当代语言学,2002,4(2):81-89. 被引量:9
  • 2李爱军,殷治纲,徐波,等.口语对话语音语料库CADCC和其语音研究[C]//第五届全国现代语音学学术会议论文集.北京:清华大学出版社,2001:317-322.
  • 3TSENG Shuchuan. Spoken corpora and analysis of natural speech [J]. Taiwan Journal of Linguistics, 2008, 6(2) : 1 - 26.
  • 4WANG Hsinmin, CHEN Berlin, KUO Jenwei, et al. MATBN: a Mandarin Chinese broadcast news corpus [J]. International Journal of Computational Linguistics and Chinese Language Processing, 2005, 10(2) : 219 - 236.
  • 5中国科学院计算技术研究所.自然广播语流语料库[Z/OL].[2011-05-01].http://www.chinip.csdb.cn/resourcelist.php?begin=20&count=20.
  • 6ZOU Yu, WU Jiyuan, HE Wei. Syntactic correlations of prosodic phrase in broadcasting news speech [C]// Proc the 6th IEEE International Conference on Natural Language Processing and Knowledge Engineering, Beijing: IEEE Press, 2010: 190- 194.
  • 7Institute of Linguistics, Chinese Academy of Social Sciences. C TOBI Ver 2.0 [Z/OL]. [2011 -05- 01]. http: //ling. cass. cn/yuyin/english/ctobi/ctobi, htm.
  • 8Chan M. Pan-Mandarin ToBI System [Z/OL]. [2011 -05- 01]. http: //people. cohums.ohio-state, edu/chan9/MToBI.htm.

二级参考文献13

  • 1陈肖霞.连续话语语料库的语音切分和标记[J].语言文字应用,2000(2):78-82. 被引量:6
  • 2林焘、王理嘉《语音学教程》168页,北京大学出版社,1999年.
  • 3Chen, Xiaoxia, Li Aijun, et al. 2000.An Application of SAMPA-C for Standard Chinese. The proceedings of International Conference on Spoken Language Processing(ICSLP2000).
  • 4Garofolo, J.S., lawel, L.F., Fisher, W. M., Fiscus, et al. 1986. The DARPA TIMIT Acousitc-Phoneic Continuous Speech corpus CDROM. NIST. [ www. ldc. upenn. edu/101/docs/TIMIT. html].
  • 5Leech, G., R. Garside, T. McEnery. 1997. Corpus Annotation: Linguistic Information .from Computer Text Corpora. Addison Wesley Longman.
  • 6Li, Aijun, Li Zhiqiang and Zu Yiqing. 1999. A National Database Designed and Prosodic Labeling for Speech synthesis. Proc. of Oriental COOCOSDA '99. TW.
  • 7Li, Aijun, Chen Xiaoxia, Sun Guohua, Hua Wu, Yin Zhigang, Zu Yiqing, Zheng Fang, Song Zhanjiang.2000. The phonetic labeling on read and spontaneous discourse corpora. The proceedings of International Conference on Spoken Language Processing(ICSLP2000).
  • 8Wells, John. 2000. Computer-coding the IPA : a proposed extensions of SAMPA. Unpublished notes, Department of Phonetics and Linguistics, University College London. http: //www. phon. ucl. ac. uk/home/sampa/home. htm
  • 9Zu, Yiqing. 2000. The Text Design for Continuous Speech Database of Standard Chinese. The Journal of Acoustic. No.1.
  • 10徐世荣.1980,《普通话语音常识》.北京:文字改革出版社.

共引文献8

同被引文献11

引证文献3

二级引证文献5

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部