摘要
本文主要研究连续语音中单词音节的神经网络建模问题.采用了一种富有特色的特征提取方法,并依据高维空间点覆盖理论,对实际连续数字语音的各不同数字音节,以人工切自连续数字语音中的2640个单字音节,构建连续语音中各不同数字音节的特征空间覆盖区,并使用7308个自连续数字语音中切分出的单字音节,利用仿生模式识别原理,进行了建模正确性验证.验证结果正确率达到97%以上,对同样数量的少量建模样本,识别率优于SVM方法.
The single figure syllable modeling based on neural network for continuous SloUch recognition is discussed. A new feature extraction method is used which mainly includes compressing single figure frames according to a certain inter-frame angle, extracting representative information comparing to standard single figure of fixed length. 2640 single figure syllables made from continuous speech are used to construct each kind of high dimensional space covering area. By biomimetic pattern recognition theory 7308 single figure syllables made from continuous speech are used to confirm this model in CASSANN-Ⅱ neural computer and get a quite good resuit. Experiments show the recognition rate is higher than SVM when the training samples are small.
出处
《电子学报》
EI
CAS
CSCD
北大核心
2005年第10期1883-1885,共3页
Acta Electronica Sinica
关键词
连续语音
单词音节
高维空间点覆盖
神经网络模型
continuous speech
high-dimensional space covering
single syllable
neural network modeling