可分离长短期注意力网络的手势识别方法被引量：2

Gesture recognition method with separable long short-term attention networks

下载PDF

导出

摘要在人机交互领域中,大多数手势识别算法无法有效地消除采集背景对待提取手势区域的影响。与此同时,对手势运动信息的准确建模也存在困难。针对目前人机交互中的上述问题,提出利用深度可分离残差卷积长短期记忆(LSTM)网络的方法对动态手势的特征信息进行建模和识别。首先,利用常规3D卷积操作对输入的视频帧进行特征的初步提取,通过较大的卷积核尺寸以扩大其感受野;然后,通过可分离卷积残差操作对输入的浅层特征进行特征的再提取,实现对高维特征的提取建模;最后,将经过前两个阶段提取出的特征信息经过3D池化操作后输入到LSTM网络中,对输入的视频数据的时序信息进行建模,并在输入中引入注意力机制。在大规模孤立手势数据集上进行的相关实验结果表明,所提方法的准确率与原始的围绕稀疏关键点的混合特征(MFSK)+视觉词袋(BoVW)+支持向量机(SVM)网络相比提高了21.02个百分点。 Most gesture recognition algorithms in the human-computer interaction field cannot effectively eliminate the influence of the acquisition background on the extraction gesture area.At the same time,it is difficult to accurately model the motion information of the gesture.In view of the above problems in human-computer interaction,separable Long Short-Term Memory(LSTM)network for gesture recognition was proposed to model and recognize the feature information.First,the preliminary extraction of the input video frame by conventional 3D convolution operation was carried out.A large convolutional size was chosen to expand the receptive field.Then,the shallow features were re-extracted with separable convolutional residual operation and constructed the model of high-dimensional features.Finally,the feature information extracted through the first two steps was entered into a LSTM network after 3D pooling.The timing information of the video data was modeled,and attention mechanism was introduced into the input.Experimental results on a large-scale isolated gesture dataset show that the accuracy of the proposed method is 21.02 percentage points higher than that of the original MFSK(Mixed features around Sparse Keypoints)+BoVW(Bag of Visual Words)+SVM(Support Vector Machine)network.

作者顾明李轶群张二超张训雷齐林帖云 GU Ming;LI Yiqun;ZHANG Erchao;ZHANG Xunlei;QI Lin;TIE Yun(Henan Communications Investment Group Company Limited,Zhengzhou Henan 450016,China;Zhengzhou Branch,Zhongxun Post&Telecommunication Consulting&Design Institute Company Limited,Zhengzhou Henan 450000,China;School of Information Engineering,Zhengzhou University,Zhengzhou Henan 450001,China)

机构地区河南交通投资集团有限公司中讯邮电咨询设计院有限公司郑州分公司郑州大学信息工程学院

出处《计算机应用》 CSCD 北大核心 2022年第S01期59-63,共5页 journal of Computer Applications

关键词深度残差网络可分离卷积长短期记忆网络动态手势识别注意力机制 deep residual network separable convolution Long Short-Term Memory(LSTM)network dynamic gesture recognition attention mechanism

分类号 TP37 [自动化与计算机技术—计算机系统结构]

引文网络
相关文献

参考文献1

1杜堃,谭台哲.复杂环境下通用的手势识别方法[J].计算机应用,2016,36(7):1965-1970. 被引量：14

二级参考文献17

1NEWCOMBE R A, IZADI S, HILI,IGES O, et al. KineetFusion: real-lime dense surtaee mapping and tracking [ C]// Proceedings of the 2011 IF, EE International Symposium on Mixed and Augmented Reality. VCashinglon, DC: IEEE Computer Society, 2011: 127- 136.
2WACItS J P. KOLSCH M, STERN H, et al. Vision-based hand- gesture applications [J] Communications of the ACM, 2011, 54 (2): 60 -70.
3SAMUEL,D, RATHI Y, A. TANNENBAUM A. A framework for image segmentation using shape models and kernel space shape pri- ors [J]. IEEE Transactions of Pattern Analysis and Machine Intelii-genee, 2008, 30(8): 1385 -1399.
4DARDAS N H, GEORGANAS N D. Real-time hand gesture detec- tion and recognition using bag-of-features and support vector machine techniques [ J]. IEEE Transactions on Instrumentation & Measure- ment, 2011, 60( 1 1 ) : 3592 - 3607.
5BELONGIE S, MALIK J, PUZICHA J. Shape matching and object recognition using shape contexts [ J]. IEEE Transaetions on Pattern Analysis and Machine Intelligence, 2002, 24(4): 509 -522.
6CHENG M M, ZHANG Z M, I,IN W Y. BING: binarized normed gTadients for objectness estimation at 300fps [ C]// Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recogni- tion. Washington, DC: 1EEE Computer Society, 2014: 3286- 3293.
7STRIGL, KOFLER K, PODLIPNIG S. Perforulanc: and scalability of GPU-based convolutional neural networks [ C ]// Reedings of the 2010 18th Euromicro Conference on Parallel, Distributed and Network-based Processing. Piscataway, NJ: IEEE, 2010: 317- 324.
8BOJIC N, PANG K. Adaptiw skin segmentation for head and shoulder video sequences [ C]//Visual Communiealions and Image Processing 2000. Bellingham, WA: SPIE, 2000:704-711.
9KOVAC J, PEIt P, SOLINA F. Human skin color clustering for face detection [ C]// IEEE Region 8 EUROCON 2003. Computer as a Tool. Piseataway, NJ: IEEE, 2003, 2: 144- 148.
10FAN R E, CHANG K W, HSIEH C J, et al. Liblinear: a library. for large linear classification [ J]. lournal of Machine Learning Re- search. 2008, 9(12) : 1871 - 1874.

共引文献13

1陈国良,葛凯凯,李聪浩.基于多特征HMM融合的复杂动态手势识别[J].华中科技大学学报（自然科学版）,2018,46(12):42-47. 被引量：12
2陈邦泽,杨晓波.远程教育物理虚拟图像有效提取仿真研究[J].计算机仿真,2017,34(4):204-207. 被引量：3
3王浩宇,漆晶,方天恩,刘德庆.基于轨迹模板匹配的动态手势识别方法[J].单片机与嵌入式系统应用,2017,17(7):39-43. 被引量：7
4王命延,胡茗,杨文姬.机器人跟踪手势姿态图像运动准确性仿真[J].计算机仿真,2017,34(8):346-350. 被引量：3
5范文兵,朱连杰.一种基于肤色特征提取的手势检测识别方法[J].现代电子技术,2017,40(18):85-88. 被引量：6
6邓卫斌,陈睿.交通指挥中三维手势图像识别仿真研究[J].计算机仿真,2017,34(11):95-98. 被引量：1
7马杰,张绣丹,杨楠,田亚蕾.融合密集卷积与空间转换网络的手势识别方法[J].电子与信息学报,2018,40(4):951-956. 被引量：12
8常镶石,胡玉兰.一种实时手势位置识别方法研究[J].沈阳理工大学学报,2018,37(4):1-6.
9李荣.利用均值漂移算法的动态手势识别方法[J].信息与电脑,2017,29(12):93-95.
10黄泽军.交互系统设计中手势指令智能识别方法仿真[J].计算机仿真,2019,36(3):343-346.

同被引文献24

1刘云,张堃,王传旭.基于双流卷积神经网络的人体行为识别方法[J].计算机系统应用,2019,28(7):234-239. 被引量：6
2刘晓光,李奂良,娄存广,梁铁,王立玲,刘秀玲,王洪瑞.基于支持向量机的表面肌电信号和加速度融合跌倒识别方法[J].现代生物医学进展,2020,20(2):385-390. 被引量：3
3罗利梦,许芷毓,谢晓辉,李磊.基于卷积神经网络的表面肌电信号手势识别[J].电脑编程技巧与维护,2021(1):137-138. 被引量：1
4王银,陈云龙,孙前来.复杂背景下的手势识别[J].中国图象图形学报,2021,26(4):815-827. 被引量：12
5袁帅,韩曼菲,张莉莉,吕佳琪,张凤.基于改进YOLOV3与贝叶斯分类器的手势识别方法研究[J].小型微型计算机系统,2021,42(7):1464-1469. 被引量：5
6刘亮,蒲浩洋.基于LSTM的多维度特征手势实时识别[J].计算机科学,2021,48(8):328-333. 被引量：7
7王森妹,刘海华,张安铎,刘攸实.基于Gabor卷积神经网络的图像分类算法研究[J].广西大学学报（自然科学版）,2021,46(3):675-682. 被引量：4
8许留凯,张克勤,徐兆红,杨根科.基于表面肌电信号能量核相图的卷积神经网络人体手势识别算法[J].生物医学工程学杂志,2021,38(4):621-629. 被引量：10
9张明华,牛玉莹,杜艳玲,黄冬梅,刘刻福.基于残差3DCNN和三维Gabor滤波器的高光谱图像分类[J].图学学报,2021,42(5):729-737. 被引量：7
10杨建华,李正,赵妤,王少文.基于肌电信号的嵌入式手势识别系统设计[J].自动化与仪表,2021,36(12):62-66. 被引量：3

引证文献2

1刘晓光,张明进,王嘉威,梁铁,李俊,刘秀玲.Grael脑电放大器与深度学习的手势实时识别研究[J].电子测量技术,2023,46(8):7-13.
2赖丹晖,罗伟峰,袁旭东,邱子良.复杂环境下多模态手势关键点特征提取算法[J].吉林大学学报（工学版）,2024,54(8):2288-2294.

计算机应用

2022年第S01期

浏览历史

内容加载中请稍等...

可分离长短期注意力网络的手势识别方法被引量：2

参考文献1

二级参考文献17

共引文献13

同被引文献24

引证文献2

相关作者

相关机构

相关主题

浏览历史

可分离长短期注意力网络的手势识别方法 被引量：2

参考文献1

二级参考文献17

共引文献13

同被引文献24

引证文献2

相关作者

相关机构

相关主题

浏览历史

可分离长短期注意力网络的手势识别方法被引量：2