期刊文献+

基于语音知识的音节切分 被引量:4

Syllable Segmentation Based on Chinese Speech Knowledge
下载PDF
导出
摘要 在充分利用普通话水平测试试卷的文本信息、同一人的声母时长在常规语速下基本稳定、同一人的声母之间以及韵母之间的相对时长基本保持比例关系等先验知识的基础上,使用经小波变换后再重构的3个语音信号分量的累计能量特征为参数,提出了利用话者语音统计信息的两级音节切分算法,使音节切分精度达98.3%以上。 Many kinds of knowledge have been applied in this paper to separate the syllables,such as the prior information from the standard text of speech in Mandarin proficiency test,from the duration of initial in Mandarin speech which is stable in the normal speed speech,from the proportions of initials' durations in related to the finals' durations in one's speech and so on.A two-level syllable segmentation algorithm is proposed by using accumulating energies of the three wavelets which are re-constructured from wavelet transform.The experimental results demonstrat that the accuracy of syllable separation reaches to 98.3% at least.
出处 《中文信息学报》 CSCD 北大核心 2010年第4期91-95,共5页 Journal of Chinese Information Processing
基金 江门市科技三项资金资助
关键词 计算机应用 中文信息处理 音节切分 语音信号处理 普通话水平测试 computer application Chinese information processing syllable segmentation speech signal processing Mandarin proficiency test
  • 相关文献

参考文献21

二级参考文献66

  • 1刘宇红,刘桥,任强.基于改进的模糊ART的语音信号端点检测与切分[J].系统工程与电子技术,2004,26(8):1151-1154. 被引量:6
  • 2张红.基于听觉感知机理的语音特征研究.博士学位论文[M].西南交通大学电气工程学院,1998..
  • 3郑方 吴文虎 等.CDCPM及其在语音识别中的应用[J].软件学报,1996,7(10):69-75.
  • 4郑方 王承发 等.一个语文转换文本编辑器的实现.第5届全国人机语音通讯学术会议(NCMMSC'98)会议论文集[M].哈尔滨:哈尔滨工业大学出版社,1998.280-285.
  • 5Carpenter, GA Grossberg, S Rosen. DB Fuzzy ART: Fast Stable Learnin and Categoriation of Analog Patterns by an Adaptive Resonance System[J]. Neural Networks, 1991, 4: 759-771.
  • 6Carpenter, GA grossberg, S Rosen. DB Fuzzy ART: an Adaptive Resonance Algorithm for Rapid, Stable Classi-Fication of Analog Patterns [A]. In Proc. Int. Joint conf. Neural Networks[C].1991. 411 - 420.
  • 7Normandin Y. High-Performance Connected Digit Recongnition Using Maximum Mutual Information Estimation [ J ]. IEEE Trans.Speech and Audio Processing, 1994, 2(2): 299-311.
  • 8Davis S B, Mermelstein P. Comparison of Parametric Representations for Monosyllabic Word Recognition in Continuously Spoken Sentences[J]. IEEE Trans. on ASSP, 1980, 28(4): 357 - 366.
  • 9[1]R. Bakis et al., Transcription of broadcast news shows with the IBM large vocabulary speech recognition system, proceedings of the Speech Recognition Workshop, 1997,67-72,1997
  • 10[2]F. Kubala et al. The 1996 BBN Byblos Hub-4 transcription system, Proceedings of the Speech Recognition Workshop, 1997,90-93

共引文献134

同被引文献31

引证文献4

二级引证文献9

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部