The text design for continuous speech database of standard Chinese

The text design for continuous speech database of standard Chinese

导出

摘要 Well developed continuous speech recognition and synthesis systems demand a high quality continuous speech database which is compact and valid, and whose scientific design would benefit from incorporating linguistic and phonetic knowledge. It is argued that at the present stage the database should be limited to read speech. To describe those very complex variabilities in continuous speech, the following speech units are proposed: (1) 401syllables without tone; (2) 415 inter-syllabic diphones, (3) 3035 inter-syllabic triphones, (4) 781 inter-syllabic final-initial structures. The 17 basic sefltence patterns in standard Chinese are summarized to cover the most important prosodic phenomena. By using the automatic method,2393 sentences and 388 phrases are selected by above phonetic rules from a large corpus, which includes People's Daily in recent years, TV play scripts and dictionary entries, as the reading text of continuous speech recognition database in standard Chinese. This set of sentences and pbrases covers 99.8% syllables without counting tones, 100% inter-syllable diphones, 99.6% inter-syllable triphones and 100% sentence patterns. Well developed continuous speech recognition and synthesis systems demand a high quality continuous speech database which is compact and valid, and whose scientific design would benefit from incorporating linguistic and phonetic knowledge. It is argued that at the present stage the database should be limited to read speech. To describe those very complex variabilities in continuous speech, the following speech units are proposed: (1) 401syllables without tone; (2) 415 inter-syllabic diphones, (3) 3035 inter-syllabic triphones, (4) 781 inter-syllabic final-initial structures. The 17 basic sefltence patterns in standard Chinese are summarized to cover the most important prosodic phenomena. By using the automatic method,2393 sentences and 388 phrases are selected by above phonetic rules from a large corpus, which includes People's Daily in recent years, TV play scripts and dictionary entries, as the reading text of continuous speech recognition database in standard Chinese. This set of sentences and pbrases covers 99.8% syllables without counting tones, 100% inter-syllable diphones, 99.6% inter-syllable triphones and 100% sentence patterns.

作者 ZU Yiqing(Institute of Linguistics, Chinese Academy of Social Sciences Beijing 100732)

出处《Chinese Journal of Acoustics》 1999年第1期56-69,共14页 声学学报（英文版）

关键词 The text design for continuous speech database of standard Chinese

分类号 TN912 [电子电信—通信与信息系统]

引文网络
相关文献

1王昆仑.维吾尔语音节语音识别与识别基元的研究[J].计算机科学,2003,30(7):182-184. 被引量：7
2周治,杜利民,徐彦居.Audiovisual bimodal mutual compensation of Chinese[J].Science China(Technological Sciences),2001,44(1):19-26.

Chinese Journal of Acoustics

1999年第1期

浏览历史

内容加载中请稍等...

The text design for continuous speech database of standard Chinese

相关作者

相关机构

相关主题

浏览历史