摘要
在识别系统中,建模单元能够勾画一种语言的声学和语音学特性,因此对系统性能起到至关重要的作用。该文参照一些已在大词汇量连续语音识别系统(LVCSR)中取得较好效果的建模单元集,构建了新的音素建模单元集(Ne-wPS)。另外,根据NewPS中元音及其变体对前后接音素协同发音的影响,提出了基于扩展的元音三角图设计问题集(NewQS)的方法。实验表明:NewPS和NewQS结合的识别性能超越了传统的声韵母建模单元集;并且,建模单元数目大幅度的减少给系统后续模块的处理带来了便利。
Modeling units can be used to describe the salient acoustic and phonetic information for a language in speech recognition systems.Thus,they play a very important role in the system.This paper describes a phoneme set using several modeling units,which has good performance in large vocabulary continuous speech recognition(LVCSR) systems.A question set design method is given based on the extended vowel triangle.Tests show that the combination of the new phoneme set and the new question set surpasses the initial/final in performance.Also,the number of modeling units is greatly reduced which is more convenient for processing succeeding system modules.
出处
《清华大学学报(自然科学版)》
EI
CAS
CSCD
北大核心
2011年第9期1288-1292,1297,共6页
Journal of Tsinghua University(Science and Technology)
关键词
大词汇量连续语音识别
建模单元
元音三角图
问题集
主元音准则
large vocabulary continuous speech recognition(LVCSR)
modeling units
vowel triangle
question set
main vowel principle