Viterbi和DTW算法的关系分析——在非特定人手语识别中的应用被引量：7

Mapping Analysis Between Viterbi and DTW Algorithms——Application to the Identification of Signer Independent Sign Language

下载PDF

导出

摘要在经典的模式识别理论中,Viterbi算法代表了统计概率的模式匹配算法,而DTW算法代表了模版匹配的模式匹配算法,它们之间是否存在关系至今尚无定论.为了找到这两种算法之间的关系,在"类别隶属度"是广义概率的假设前提下,应用模糊数学的理论在Viterbi算法与DTW算法之间建立起联系.首先,提出了利用模糊数学的贴近度把DTW算法的"距离"向Viterbi算法的"概率"转化的通用贴近度表达式,并对通用贴近度表达式给出了理论上的证明.其次,应用DTW的通用贴近度表达式重估HMM参数,建立DTW算法与Viterbi算法之间的模糊贴近度关系,并为此提出了δ-ε算法,得到基于数据帧的类似于HMM的参数重估形式.然后,为了确保建立DTW算法与Viterbi算法之间的模糊贴近度关系的正确性,以定理的形式给出了相应的证明.再次,通过设定的DTW贴近度表达式对HMM参数重估的过程中,发现了DTW贴近度的重估参数与HMM重估参数之间存在着的模糊关系,以定理的形式对这种模糊关系加以证明.最后,依据上述定理提出了Dtw-ViterbiⅠ,Ⅱ,Ⅲ算法,以定理的形式对Dtw-ViterbiⅠ,Ⅱ,Ⅲ算法的正确性加以证明,并将对Dtw-ViterbiⅠ,Ⅱ,Ⅲ算法应用于非特定人手语的识别.实验表明,把DTW算法的路径搜索策略以概率的形式引进到Viterbi算法中,能够以削减候选词集的方式部分消除非特定人手语识别的误识,从而提高大词汇量情况的下非特定人手语识别的识别率和速度. In classical pattern classification theory, Viterbi algorithm represents pattern matching algorithm of statistic probability. However, DTW algorithm represents pattern matching algorithm of template matching algorithm. Whether there is any relationship between them have not been presented clearly. Aiming at this problem, the authors set up relationship between Viterbi algorithm and DTW algorithm based on application of fuzzy math theory under the premise that ＂the category of fuzzy math membership is the general probability＂. Firstly, they propose the common closeness degree expression transferring ＂distance＂ of DTW algorithm to ＂probability＂ of Viterbi algorithm making use of closeness degree in fuzzy math and prove the common closeness degree expression theoretically. Secondly, the HMM parameters are re-estimated with the common closeness degree of DTW to set up fuzzy closeness degree relationship between DTW algorithm and Viterbi algorithm, for which the δ-ε algorithm is presented to obtain parameter re-estimating form similar to HMM based on data frame. Then, in order to ensure correctness of the fuzzy closeness relationship between DTW algorithm and Viterbi algorithm, corresponding proof is given as a theorem. Thirdly, during the HMM parameter re-estimation with the decided DTW closeness degree expression, it is found that there exists fuzzy relationship between the DTW closeness degree re-estimating parameters and the HMM re-estimating parameters and it is proved as a theorem. Finally, the authors propose Dtw- Viterbi Ⅰ , Ⅱ, Ⅲ based on the above theorem, prove the correctness of them as a theorem and implement them in signer-independent sign language recognition. Experiment results show that introducing the path searching strategy of DTW algorithm in Viterbi algorithm in the form of probability can partly reduce the failures in signer-independent sign language recognition by reducing candidate vocabulary thus improving the signer-independent sign language recognition rate and speed in case of large vocabulary.

作者倪训博赵德斌姜峰程丹松

机构地区哈尔滨工业大学计算机学院

出处《计算机研究与发展》 EI CSCD 北大核心 2010年第2期305-317,共13页 Journal of Computer Research and Development

基金国家自然科学基金重点项目(60533030) 国家自然科学基金项目(60603023)~~

关键词 VITERBI算法 DTW算法类别隶属度广义概率 Dtw-ViterbiⅠ Ⅱ Ⅲ算法隐MARKOV模型模糊数学 ε-δ算法 Viterbi algorithm DTW algorithm category membership generalized probability Dtw- Viterbi Ⅰ , Ⅱ and Ⅲ algorithm HMM fuzzy math δ-ε algorithm

分类号 TP391.4 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献40

1Grimes G J. Digital data entry glove interface device: US, 4414537[P], 1983.
2Takahashi T, Kishino F. A hand gesture recognition method and its application [J]. Systems & Computers in Japan. 1992, 23(3): 38-48.
3Takahashi T, Kishino F. Hand gesture coding based on experiments using a hand gesture interface device[J]. Sigchi Bulletin, 1991, 23(2): 67-74.
4Takahashi T, Kishino F. A hand gesture recognition method and its application [J]. Trans of the Institute of Electronics, Information & Communication Engineers Dill, 1990, J73D- Ⅱ(12): 1985-1992.
5Takahashi T, Kishino F. Gesture coding based in experiments with a hand gesture interface device [J]. SIGCHI Bulletin, 1991, 23(2): 67-73.
6Lee C, Xu Y. Online, interactive learning of gestures for human/robot interfaces [J]. Proc of IEEE Int Conf on Robotics and Automation, 1996, 3(1): 30-42.
7[美]米歇尔.机器学习[M].曾华军,等译.北京:机械工业出版社,2003.
8[美]迪达,等.模式分类[M].第2版.李宏东,等译.北京:机械工业出版社,2003.
9[美]海金.神经网络原理[M].第2版.叶世伟,等译.北京:机械工业出版社,2004.
10Zhao M, Quek F K H, Wu Xindong. RIEVL: Recursive induction learning in hand gesture recognition[J]. IEEE Trans on Pattern Analysis & Machine Intelligence, 1998, 20 (11): 1174-85.

二级参考文献9

1[1]WANG Chunli,GAO Wen.Re-sampling for Chinese sign language recognition by genetic algorithm[A].GW2005[C].[s.1.],2005.
2[2]DENG J W,TSUI H T.A two-step approach based on PaHMM for the recognition of ASL[A].Proceedings of The Fifth Asian Conference on Computer Vision[C].Melbourne,Australia,2002.
3[3]BAHLL R,BROWN P F,SOUZA P V,MERCER R L.Maximum mutual information estimation of hidden Markov model parameters for speech recognition[A].Proc.1986 Int.Conf.on Acoustics,Speech and Signal Processing[C].Tokyo,Japanl986.
4[4]NORMANDIN Y.An improved MMIE training algorithm for speaker independent[A].Proc.ICASSP'91[C].Toronto,1991.
5[5]SCHLUTER R,MACHEREY W,RULLER B,NEY H.Comparison of discriminative training criteria and optimization methods for speech recognition[J].Speech Communication,2001(34):287-310.
6[6]ZHENG J,BUTZBERER J,FRANCO H.Scandinavia improved maximum mutual information estimation training of continuous density HMMs[J].Andreas Stolcke Speech Technology and Research Laboratory,2001,15(2):25-30.
7[7]WOODLAND P C,POVEY D.Large scale discriminative training for speech recognition[J].In Proc.ITRW ASR[C].ISCA,2000.
8[8]BAHL L R,PADMANABHAN M,NAHAMOO D,GOPALAKRISHNAN P S.An n-best candidates-based discriminative training for speech recognition Applications[J].IEEE Transactions on Speech and Audio Processing,1994,2(1):206-216.
9[9]CHOW Y L.Maximum mutual information estimation of HMM parameters for continuous speech recognition using the N-Best algorithm[A].Proc.ICASSP'90[C].Albuquerque,1990.

共引文献8

1姜峰,高文,姚鸿勋,赵德斌,陈熙霖.非特定人手语识别问题中的合成数据驱动方法[J].计算机研究与发展,2007,44(5):873-881. 被引量：5
2倪训博,程丹松,吕海峰,王克家,耿铁珍.非特定人手语识别参数训练模型的改进及应用[J].哈尔滨工程大学学报,2009,30(9):1035-1040.
3倪训博,赵德斌,姜峰,程丹松.中国手语音韵标记的建立、实现及其有效性验证[J].计算机学报,2009,32(12):2438-2453. 被引量：1
4倪训博,王克家,葛宏志,程丹松,耿铁珍.非特定人手语识别统计模型的改进及应用[J].哈尔滨工程大学学报,2009,30(11):1273-1278.
5倪训博,赵德斌,高文,姜峰,姚鸿勋.非特定人手语数据生成及其有效性检测[J].软件学报,2010,21(5):1153-1170. 被引量：6
6宋秋强,张占松,张冲,黄若坤,刘欢.测井相-岩相分析技术在复杂岩性中的应用[J].石油天然气学报,2013,35(7):78-81. 被引量：11
7杜晶,陈群,刘海龙.一种基于遗传算法的查询关键词形成技术[J].计算机与现代化,2013(12):5-8.
8于梅.声纹识别中的区分性训练[J].电子技术与软件工程,2017(24):95-95.

同被引文献105

1张生军,何小海,李刚,周宜波,侯胜伟.基于视频的手势识别中左右手判别研究[J].四川大学学报（工程科学版）,2011,43(S1):155-159. 被引量：3
2林玮,杨莉莉,徐柏龄.基于修正MFCC参数汉语耳语音的话者识别[J].南京大学学报（自然科学版）,2006,42(1):54-62. 被引量：22
3李荣平,周广胜,张慧玲.植物物候研究进展[J].应用生态学报,2006,17(3):541-544. 被引量：79
4杨端端,金连文,尹俊勋.手指书写汉字识别系统中的指尖检测方法[J].华南理工大学学报（自然科学版）,2007,35(1):58-63. 被引量：13
5金连文,徐睿,杨端端,镇立新,黄建成.手指书写:一种虚拟文字识别人机交互新方法[J].电子学报,2007,35(3):396-401. 被引量：6
6庞素超,陈实.用动态规划方法求解最短路问题[J].大庆石油学院学报,2007,31(3):118-120. 被引量：5
7Davis J, Shah M. Visual gesture recognition [ C ]//Proceeding on Vi- sion, Image Signal Processing, 1994:321 - 332.
8Dardas N H, Petriu E M. Hand gesture detection and recognition using principal component analysis [ C ]//Computational Intelligence for Measurement Systems and Applications (CIMSA) , Toyko, 2011 IEEE International Conference, Tianjin, 2011 (9) : 1 - 6.
9Choi Seung-Hwan, Han Ji-Hyeong, Kim Jong-Hwan. 3D-Position Esti- mation for Hand Gesture Interface Using a Single Camera [ J ]. Lec- tures Notes in Computer Science, 2011 (6762) : 231 - 237.
10Rafael Bastos, Miguel Sales Dias. Skin Color Profile Capture for Scale and Rotation Invariant Hand Gesture Recognition [J].Lectures Notes in Computer Science, 2009(5085) : 81 -92.

引证文献7

1关然,徐向民,罗雅愉,苗捷,裘索.基于计算机视觉的手势检测识别技术[J].计算机应用与软件,2013,30(1):155-159. 被引量：43
2石瑛,王雪飞,汪欣,沈来信.徽州方言孤立字识别的实现[J].蚌埠学院学报,2014,3(2):9-12. 被引量：2
3张国亮,王展妮,王田.应用计算机视觉的动态手势识别综述[J].华侨大学学报（自然科学版）,2014,35(6):653-658. 被引量：11
4于胜举,陈亚雄,房森,赵海法.Kinect与轨迹识别算法相结合在语音和手势识别方面的应用[J].中国科技博览,2015,0(40):226-226. 被引量：1
5张露.基于DTW的单个手语识别算法[J].现代计算机（中旬刊）,2016(3):77-80. 被引量：4
6李继红,徐佳栋.基于动态时间规整算法的时间序列遥感影像树种分类[J].东北林业大学学报,2017,45(5):56-61.
7马正华,李雷,乔玉涛,戎海龙,曹海婷.基于多传感器融合的动态手势识别研究分析[J].计算机工程与应用,2017,53(17):153-159. 被引量：6

二级引证文献67

1范铁生,张杰.基于时间序列LBP算子的手势检测[J].辽宁大学学报（自然科学版）,2013,40(3):220-226.
2李涛,张艳珍,黎华,欧宗瑛.基于SQL SERVER的POS系统的开发与实现[J].计算机应用研究,2000,17(2):82-83. 被引量：2
3张毅,刘钰然,罗元.基于视觉的手势识别方法及其在数字信号处理器上的实现[J].计算机应用,2014,34(3):833-836. 被引量：9
4方华,刘诗雄,田敬北.基于kinect骨骼系统的手势识别研究[J].计算机光盘软件与应用,2014,17(2):65-68. 被引量：5
5陈瑞霞,薛迪杰.体感控制无线智能车的认识与研究[J].科技风,2014(11):61-61.
6沈先耿.融合深度信息和稀疏自编码的手势识别算法[J].计算机仿真,2019,36(1):397-402. 被引量：1
7张国亮,王展妮,王田.应用计算机视觉的动态手势识别综述[J].华侨大学学报（自然科学版）,2014,35(6):653-658. 被引量：11
8张磊,吴颖.基于视觉的手势识别系统研究[J].信息技术与信息化,2015(2):101-103. 被引量：1
9陆海虹.基于OPENCV的手势识别系统的设计与实现[J].计算机测量与控制,2015,23(5):1649-1652. 被引量：6
10朱恩涌,魏传锋,李喆.空间任务人机协同作业内涵及关键技术问题[J].航天器工程,2015,24(3):93-99. 被引量：2

1彭志平,李绍平.一种基于神经模糊系统的协商策略[J].系统仿真学报,2008,20(3):623-626.
2张新明,张贝,涂强.广义概率Tsallis熵的快速多阈值图像分割[J].数据采集与处理,2016,31(3):502-511. 被引量：7
3吕成戍.基于代价敏感支持向量机的推荐系统托攻击检测方法[J].计算机工程与科学,2014,36(4):697-701. 被引量：7
4徐丽,康瑞华.基于遗传算法的HMM参数估计[J].湖北工业大学学报,2006,21(4):68-71. 被引量：3
5王雪峰.现阶段基于内容的图像检索技术分析[J].伊犁师范学院学报（自然科学版）,2010,4(2):52-56. 被引量：2
6吴巧敏,林亚平.一种基于重复训练的支持向量机方法[J].计算机工程与应用,2007,43(31):165-168. 被引量：2
7倪训博,程丹松,吕海峰,王克家,耿铁珍.非特定人手语识别参数训练模型的改进及应用[J].哈尔滨工程大学学报,2009,30(9):1035-1040.
8李炜,张美玲,李庆卿.基于DCPSO的模糊神经网络的管道泄漏检测方法[J].工业仪表与自动化装置,2010(6):3-7. 被引量：1
9黄赞武,魏学业,刘泽.基于广义概率和、积模糊神经模型的故障预测方法[J].信息与控制,2013,42(1):64-70. 被引量：1
10倪训博,王克家,葛宏志,程丹松,耿铁珍.非特定人手语识别统计模型的改进及应用[J].哈尔滨工程大学学报,2009,30(11):1273-1278.

计算机研究与发展

2010年第2期

浏览历史

内容加载中请稍等...

Viterbi和DTW算法的关系分析——在非特定人手语识别中的应用被引量：7

参考文献40

二级参考文献9

共引文献8

同被引文献105

引证文献7

二级引证文献67

相关作者

相关机构

相关主题

浏览历史

Viterbi和DTW算法的关系分析——在非特定人手语识别中的应用 被引量：7

参考文献40

二级参考文献9

共引文献8

同被引文献105

引证文献7

二级引证文献67

相关作者

相关机构

相关主题

浏览历史

Viterbi和DTW算法的关系分析——在非特定人手语识别中的应用被引量：7