期刊文献+

一种改进的线性区分分析方法及其在汉语数码语音识别上的应用 被引量:2

An Improved Linear Discriminant Analysis for Mandarin Digit Speech Recognition
下载PDF
导出
摘要 尽管汉语数码语音识别只涉及十个数字 ,但由于不同数字的发音存在相同或相似的声母或韵母 ,造成汉语数码语音之间的混淆性很大 .采用通常的隐含马尔科夫模型 (HMM)作为汉语数码语音识别模型难以得到很高的识别率 .为了解决汉语数码之间的混淆问题 ,提高汉语数码语音识别性能 ,本文在隐含马尔科夫模型的状态层次上采用线性区分分析方法 ,将不同状态之间容易混淆的特征样本构成混淆模式类 ,针对混淆模式类进行线性区分分析 .通过线性区分变换 ,在变换特征空间中仅保留那些能够有效区分该混淆类别的特征参数 .这种基于状态的线性区分分析有效地提高了模型对混淆数码的区分能力 .实验表明即使采用状态数很少的粗糙识别模型 ,也能很大幅度提高模型的识别性能 ;经过线性区分变换优化后的汉语数码识别模型 ,孤立汉语数码语音识别率可以达到 99 32 % . It is found that the phonetic similarities in the Mandarin digits are the main reasons for the difficulty of Mandarin digit recognition.In this paper,an improved linear discriminant analysis (LDA) based on the states of hidden Markov models (HMM) is presented.The recognition model discriminability is greatly improved by gathering the confusion data to the given states and then using the state-specific discriminative transformation.The experiments show that it increases the recognition rate greatly even if the simple models with insufficient states are used.The recognition accuracy of isolated Mandarin digits is over 99.32% after using optimal linear discriminative transformation.
出处 《电子学报》 EI CAS CSCD 北大核心 2002年第7期959-963,共5页 Acta Electronica Sinica
基金 国家自然科学基金 (No 699750 0 7) 国家 863项目 (No 863 30 6ZD1 3 0 4 6)
关键词 线性区分分析 LDA 汉语数码语音识别 区分变换 隐含马尔科夫模型 HMM LDA Mandarin digit speech recognition discriminative transformation
  • 相关文献

参考文献14

  • 1[1]K Fukunaga.Introduction to Statistical Pattern Recognition [M].NY:Academic Press,1990.
  • 2[2]M J Junt,C Lefebvre.Speaker dependent and independent speech recognition experiments with an auditory model [A].Proc.ICASSP [C].USA:ICASSP,1988.215-218.
  • 3[3]M J Hunt,C Lefebvre.A comparison of several acoustic representation for speech recognition with degraded and undegraded speech [A].Proc.ICASSP [C].USA:ICASSP,1989.262-265.
  • 4[4]C J Leggetter.Improved acoustic modelling for HMMs using linear transformations [D].UK:Cambridge,1995
  • 5[5]R Haeb-umbach,H Ney.Linear discriminant analysis for improved large vocabulary continuous speech recognition [A].Proc.ICASSP [C].USA:ICASSP,1992:13-16.
  • 6[6]E S Parris,M J Carey.Estimating linear discriminant parameters for ccintinuous density hidden Markov models [A].Proc.ICSLP [C].USA:ICASSP,1994.215-218.
  • 7[7]P C Woodland,D R Cole.Optimising hidden Markov models using discriminative output distributions [A].Proc.ICASSP [C].USA:ICASSP,1991.
  • 8[8]Duda Richard O,Hart Peter E.Pattern Classification and Scene Analysis [M].New York:John Wiley & Sons,Inc.,1973.
  • 9[9]E L Bocchieri,G R Doddington.Frame specific statistical features for speaker independent speech recognition [J].IEEE trans.ASSP,1986,34(4):755-764.
  • 10[10]G R Doddington.Phonetically sensitive discriminants for improved speech recognition [A].Proc.ICASSP [C].USA:ICASSP,1989.556-559.

同被引文献12

引证文献2

二级引证文献3

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部