期刊文献+

基于高斯相似度分析的插值自适应算法 被引量:2

Interpolation Adaptation Algorithm Based on Gaussian Similarity Analysis
下载PDF
导出
摘要 快速说话人自适应算法在非特定人连续语音识别的应用中有重要意义 .现在流行的自适应算法多数只考虑均值的自适应 .本文提出的自适应算法可以快速的对协方差矩阵进行自适应 .该算法是用高斯相似度度量协方差矩阵间的距离 ,并由此测度建立了反映协方差矩阵结构关系的二叉决策树 .树的每个中间节点包含一个类质心 .在决策树基础上 ,训练多个与特定人模型相关的类质心 .自适应时 ,通过对这些类质心进行线性插值得到自适应的协方差矩阵 .实验结果表明 ,该方法能够在仅有一句自适应数据的情况下 ,使系统误识率由 2 9 4 9%下降到 2 7 5 5 % . Fast speaker adaptation is very important in application of speaker independent continuous speech recognition.Recently speaker adaptation methods almost just update mean vector.A new adaptation algorithm proposed in this paper can adapt covariance matrixes rapidly.Gaussian similarity is used for measuring the distance of different covariance.A binary decision tree is constructed with this measure.And each middle node includes a covariance matrix of cluster center.Lots of covariance matrixes corresponding to speaker dependent model are trained based on this tree.During adaptation,covariance matrixes are updated through linear interpolating those covariances of cluster center.It can be shown from the experiments that error rate is descended from 29 49% to 27 55% in case of just one adapted sentence.
出处 《电子学报》 EI CAS CSCD 北大核心 2001年第z1期1759-1761,共3页 Acta Electronica Sinica
基金 "98 5"重大项目 (985校 2 2 攻关 0 6)
关键词 连续语音识别 快速说话人自适应 高斯相似度分析 continuous speech recognition rapid speaker adaptation Gaussian similarity analysis
  • 相关文献

参考文献7

  • 1[1]Padmanabhan M,Bahl LR,Picheny MA.Speaker clustering and transformation for speaker adaptation in large-vocabulary speech recognition systems [C].Proceedings of ICASSP.1996,(2):701-704.
  • 2[2]Gauvain JL,Lee CH.Maximum a posteriori estimation for multvariate gaussian observations of markov chains [J].IEEE Transaction.Audio Speech Processing,1994,2(2):291-298.
  • 3[3]Legetter CJ,Woodland PC.Maximum likelihood linear regression for speaker adaptation of continuous density HMM' s [J].Computer Speech and Language,1995,9(2):171-186.
  • 4[4]Gao Yuqing,Padmanabhan M,Picheny M.Speaker adaptation based on pre-clustering training speakers [C].Proeedings of Eurospeech.1997:2091-2094.
  • 5[5]Wang Zuoying,Liu Feng.Speaker adaptation using maximum likelihood model interpolation [C].Proceedings of ICASSP.1999.(2):1368-1372.
  • 6[6]V Digalakis,S Berkowitz,et al.Rapid speech recoognizer adaptation to new Speakers [C].Proceedings of ICASSP.1999,(2):2102-1206.
  • 7[7]Wu Ji,Wang Zouying.A decision tree-strmctured algorithm of speaker adaptation based on gaussian similarity analysis [J].Chinese Journal of Electronics.2001,10(2):166-169.

同被引文献11

引证文献2

二级引证文献5

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部