摘要
针对语音识别性能受噪声干扰而显著降低的问题,提出一种采用特征空间随机映射(RP)的鲁棒性语音语音识别方法,并应用于汽车驾驶环境下的语音识别系统。首先,将原始语音特征参数采用随机矩阵线性映射到新的特征空间,使新的特征参数以最大概率保持原始特征之间距离的同时更加接近于高斯分布;然后训练隐马尔可夫模型(HMM),测试时结合多数投票表决方法对初始模式匹配结果进行判决并得到最终语音识别结果。采用日本情报处理学会车载环境下语音识别数据库CENSREC-2进行实验分析,结果表明,随机映射特征使得汽车驾驶环境下的语音识别性能有了很大改善。
To improve speech recognition in noisy environment such as in driving car,a new method which adopted Random Projection(RP) of feature space was proposed in this paper.First,original speech feature coefficients were projected into a new feature space using random matrixes to make the new coefficients have distribution more similar to the Gaussian but preserve the original distances among features with maximum probability.Then Hidden Markov Model(HMM) of every word was trained.In the test stage,the initial pattern matching results were further processed with majority voting strategy then to make a final speech recognition decision.The experimental results based on speech recognition database CENSREC-2 of Japan Information Processing Association demonstrate the effectiveness of random projection of feature space,which greatly improves the speech recognition performance in driving car.
出处
《计算机应用》
CSCD
北大核心
2012年第7期2070-2073,2081,共5页
journal of Computer Applications