期刊文献+

Viterbi和DTW算法的关系分析——在非特定人手语识别中的应用 被引量:7

Mapping Analysis Between Viterbi and DTW Algorithms——Application to the Identification of Signer Independent Sign Language
下载PDF
导出
摘要 在经典的模式识别理论中,Viterbi算法代表了统计概率的模式匹配算法,而DTW算法代表了模版匹配的模式匹配算法,它们之间是否存在关系至今尚无定论.为了找到这两种算法之间的关系,在"类别隶属度"是广义概率的假设前提下,应用模糊数学的理论在Viterbi算法与DTW算法之间建立起联系.首先,提出了利用模糊数学的贴近度把DTW算法的"距离"向Viterbi算法的"概率"转化的通用贴近度表达式,并对通用贴近度表达式给出了理论上的证明.其次,应用DTW的通用贴近度表达式重估HMM参数,建立DTW算法与Viterbi算法之间的模糊贴近度关系,并为此提出了δ-ε算法,得到基于数据帧的类似于HMM的参数重估形式.然后,为了确保建立DTW算法与Viterbi算法之间的模糊贴近度关系的正确性,以定理的形式给出了相应的证明.再次,通过设定的DTW贴近度表达式对HMM参数重估的过程中,发现了DTW贴近度的重估参数与HMM重估参数之间存在着的模糊关系,以定理的形式对这种模糊关系加以证明.最后,依据上述定理提出了Dtw-ViterbiⅠ,Ⅱ,Ⅲ算法,以定理的形式对Dtw-ViterbiⅠ,Ⅱ,Ⅲ算法的正确性加以证明,并将对Dtw-ViterbiⅠ,Ⅱ,Ⅲ算法应用于非特定人手语的识别.实验表明,把DTW算法的路径搜索策略以概率的形式引进到Viterbi算法中,能够以削减候选词集的方式部分消除非特定人手语识别的误识,从而提高大词汇量情况的下非特定人手语识别的识别率和速度. In classical pattern classification theory, Viterbi algorithm represents pattern matching algorithm of statistic probability. However, DTW algorithm represents pattern matching algorithm of template matching algorithm. Whether there is any relationship between them have not been presented clearly. Aiming at this problem, the authors set up relationship between Viterbi algorithm and DTW algorithm based on application of fuzzy math theory under the premise that "the category of fuzzy math membership is the general probability". Firstly, they propose the common closeness degree expression transferring "distance" of DTW algorithm to "probability" of Viterbi algorithm making use of closeness degree in fuzzy math and prove the common closeness degree expression theoretically. Secondly, the HMM parameters are re-estimated with the common closeness degree of DTW to set up fuzzy closeness degree relationship between DTW algorithm and Viterbi algorithm, for which the δ-ε algorithm is presented to obtain parameter re-estimating form similar to HMM based on data frame. Then, in order to ensure correctness of the fuzzy closeness relationship between DTW algorithm and Viterbi algorithm, corresponding proof is given as a theorem. Thirdly, during the HMM parameter re-estimation with the decided DTW closeness degree expression, it is found that there exists fuzzy relationship between the DTW closeness degree re-estimating parameters and the HMM re-estimating parameters and it is proved as a theorem. Finally, the authors propose Dtw- Viterbi Ⅰ , Ⅱ, Ⅲ based on the above theorem, prove the correctness of them as a theorem and implement them in signer-independent sign language recognition. Experiment results show that introducing the path searching strategy of DTW algorithm in Viterbi algorithm in the form of probability can partly reduce the failures in signer-independent sign language recognition by reducing candidate vocabulary thus improving the signer-independent sign language recognition rate and speed in case of large vocabulary.
出处 《计算机研究与发展》 EI CSCD 北大核心 2010年第2期305-317,共13页 Journal of Computer Research and Development
基金 国家自然科学基金重点项目(60533030) 国家自然科学基金项目(60603023)~~
关键词 VITERBI算法 DTW算法 类别隶属度 广义概率 Dtw-ViterbiⅠ Ⅲ算法 隐MARKOV模型 模糊数学 ε-δ算法 Viterbi algorithm DTW algorithm category membership generalized probability Dtw- Viterbi Ⅰ , Ⅱ and Ⅲ algorithm HMM fuzzy math δ-ε algorithm
  • 相关文献

参考文献40

  • 1Grimes G J. Digital data entry glove interface device: US, 4414537[P], 1983.
  • 2Takahashi T, Kishino F. A hand gesture recognition method and its application [J]. Systems & Computers in Japan. 1992, 23(3): 38-48.
  • 3Takahashi T, Kishino F. Hand gesture coding based on experiments using a hand gesture interface device[J]. Sigchi Bulletin, 1991, 23(2): 67-74.
  • 4Takahashi T, Kishino F. A hand gesture recognition method and its application [J]. Trans of the Institute of Electronics, Information & Communication Engineers Dill, 1990, J73D- Ⅱ(12): 1985-1992.
  • 5Takahashi T, Kishino F. Gesture coding based in experiments with a hand gesture interface device [J]. SIGCHI Bulletin, 1991, 23(2): 67-73.
  • 6Lee C, Xu Y. Online, interactive learning of gestures for human/robot interfaces [J]. Proc of IEEE Int Conf on Robotics and Automation, 1996, 3(1): 30-42.
  • 7[美]米歇尔.机器学习[M].曾华军,等译.北京:机械工业出版社,2003.
  • 8[美]迪达,等.模式分类[M].第2版.李宏东,等译.北京:机械工业出版社,2003.
  • 9[美]海金.神经网络原理[M].第2版.叶世伟,等译.北京:机械工业出版社,2004.
  • 10Zhao M, Quek F K H, Wu Xindong. RIEVL: Recursive induction learning in hand gesture recognition[J]. IEEE Trans on Pattern Analysis & Machine Intelligence, 1998, 20 (11): 1174-85.

二级参考文献9

  • 1[1]WANG Chunli,GAO Wen.Re-sampling for Chinese sign language recognition by genetic algorithm[A].GW2005[C].[s.1.],2005.
  • 2[2]DENG J W,TSUI H T.A two-step approach based on PaHMM for the recognition of ASL[A].Proceedings of The Fifth Asian Conference on Computer Vision[C].Melbourne,Australia,2002.
  • 3[3]BAHLL R,BROWN P F,SOUZA P V,MERCER R L.Maximum mutual information estimation of hidden Markov model parameters for speech recognition[A].Proc.1986 Int.Conf.on Acoustics,Speech and Signal Processing[C].Tokyo,Japanl986.
  • 4[4]NORMANDIN Y.An improved MMIE training algorithm for speaker independent[A].Proc.ICASSP'91[C].Toronto,1991.
  • 5[5]SCHLUTER R,MACHEREY W,RULLER B,NEY H.Comparison of discriminative training criteria and optimization methods for speech recognition[J].Speech Communication,2001(34):287-310.
  • 6[6]ZHENG J,BUTZBERER J,FRANCO H.Scandinavia improved maximum mutual information estimation training of continuous density HMMs[J].Andreas Stolcke Speech Technology and Research Laboratory,2001,15(2):25-30.
  • 7[7]WOODLAND P C,POVEY D.Large scale discriminative training for speech recognition[J].In Proc.ITRW ASR[C].ISCA,2000.
  • 8[8]BAHL L R,PADMANABHAN M,NAHAMOO D,GOPALAKRISHNAN P S.An n-best candidates-based discriminative training for speech recognition Applications[J].IEEE Transactions on Speech and Audio Processing,1994,2(1):206-216.
  • 9[9]CHOW Y L.Maximum mutual information estimation of HMM parameters for continuous speech recognition using the N-Best algorithm[A].Proc.ICASSP'90[C].Albuquerque,1990.

共引文献8

同被引文献105

引证文献7

二级引证文献67

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部