摘要
由于少数民族语言有其本身的特点,不能简单地套用现有的连续语音识别的方法.本文以蒙古语为例,研讨了声学和语言模型的建立,并在日本国际电气通信基础技术研究所的连续语音识别器上实现了蒙古语的语音识别系统.本文侧重于语言模型的建立,基于蒙古语黏着性语言特点,提出用相似词聚类方法建立多类N-gram模型.实验结果显示,应用我们提出的语言模型,识别精度比用传统的词的N-gram识别法提高了5.5%.
Because the minority languages in China have their special characteristics, it is not suitable to directly adopt the traditional automatic speech recognition (ASR) methods which are used for some major languages, such as Chinese, English, Japanese, etc. In this paper, we take Mongolian (a resource-deficient language) as an example and build the acoustic and language models for applying the ATRASR system. In this paper, we specially focus on the language modeling aspect by considering the special characteristics of the Mongolian. We trained a multi-class N-gram language model based on similar word clustering. By applying the proposed language model, the system could improve the performance by 5.5 % compared with the conventional word N-gram.
出处
《自动化学报》
EI
CSCD
北大核心
2010年第4期550-557,共8页
Acta Automatica Sinica
基金
日本独立行政法人情报通信研究机构多语言高新技术语音–文本处理研究项目资助~~
关键词
蒙古语
黏着语言
相似词分类
连续语语音识别
多类语言模型
Mongolian language, agglutinative language, similar word clustering, continuous speech recognition, multiclass N-gram model