期刊文献+

基于多码本矢量量化的非限定文本的联机话者辨认方法 被引量:1

AN APPROACH BASED ON MULTIPLE VECTOR QUANTIZATION FOR ON LINE TEXT INDEPENDENT SPEAKER IDENTIFICATION
下载PDF
导出
摘要 传统的利用话者的一个时期的语音作为训练语音,进行话者码本训练的方法,识别系统往往不够稳定.为了适应话者自身语音的时变性,文中提出了利用话者不同时期的语音进行训练话者的模型,每个话者具有多个码本.这些码本是采用逐渐减小误识率的优化过程得到的.为了补偿不同信道对系统识别性能的影响,文中给出了一种信道补偿方法.同时提出以一帧高能的浊音语音特征代替一个浊音音素的特征,实现了在线浊音特征提取,利用两级矢量量化及码本索引策略减少了44%的识别计算量.这些方法大大增加了系统的识别速度和鲁棒性.文中比较了用PLP分析和LPC倒谱分析进行话者辨认的识别结果. The traditional approach for training speaker codebooks only uses one session training speech samples, but the recognition system based on this approach is usually not robust. To adapt to the intraspeaker variations, the paper here introduces an approach for training speaker codebooks using multiple session training speech samples,with every speaker having multiple codebooks. These codebooks are trained based on the minimum recognition error rate.To compensate for the variations arising from transmission conditions, an approach to compensation of the variation presented. To speed up recognition speed, an on line feature extraction method for voiced sounds and two level vector quantization and codebook index strategy are used. These techniques increase the robustness of the speech feature and speed up the training and identification procedure greatly. Finally, the identification results of comparison using the perceptually based linear predictive(PLP) analysis and the LPC cepstrum analysis are given.
出处 《计算机研究与发展》 EI CSCD 北大核心 1999年第6期712-716,共5页 Journal of Computer Research and Development
基金 国家"八六三"计划 国家自然科学基金
关键词 联机话者辨认 多码本矢量量化 语音识别 on line text independent speaker identification, multiple codebooks quantization, transmission compensation
  • 相关文献

参考文献2

  • 1Kao Yuhuang,Proc ’93 IEEE Int Conf Acoustics,1993年,379页
  • 2Openshaw J P,Proc ’93 IEEE Int Conf Acoustics,1993年,371页

同被引文献9

引证文献1

二级引证文献8

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部