期刊文献+

基于长短时记忆和深度神经网络的视觉手势识别技术 被引量:4

Visual gesture recognition technology based on long short term memory and deep neural network
下载PDF
导出
摘要 针对基于视觉的动态手势识别易受光照、背景和手势形状变化影响等问题,在分析人体手势空间上下文特征的基础上,首先建立一种基于人体骨架和部件轮廓特征的动态手势模型,并采用卷积姿势机和单发多框检测器技术构造深度神经网络进行人体手势骨架和部件轮廓特征提取。其次,引入长短时记忆网络提取动态人体手势中骨架、左右手和头部轮廓的时序特征,进而分类识别手势。在此基础上,设计了一种空间上下文与时序特征融合的动态手势识别机(GRSCTFF),并通过交警指挥手势视频样本库对其进行网络训练和实验分析。实验证明,该系统可以快速准确识别动态交警指挥手势,准确率达到94.12%,并对光线、背景和手势形状变化具有较强的抗干扰能力。 Aiming at the problem that visual gesture recognition is susceptible to light conditions, background information and changes in gesture shape, this paper analyzed the spatial context features of human gestures. First, this paper established a dynamic gesture model based on the contour features of human skeleton and body parts. The convolutional pose machine(CPM) and the single shot multibox detector(SSD) technology were utilized to build deep neural network, so as to extract the contour features of human gesture skeleton and body parts. Next, the long short term memory(LSTM) network was introduced to extract the temporal features of skeleton, left and right hand, and head contour in dynamic human gestures, so as to further classify and recognize gestures. On this basis, this paper designed a dynamic gesture recognizer based on spatial context and temporal feature fusion(GRSCTFF), and conducted network training and experimental analysis on GRSCTFF through the video sample database of traffic police command gestures. The experimental results show that GRSCTFF can quickly and accurately recognize the dynamic traffic police command gestures with an accuracy of 94.12%, and it has strong anti-interference ability to light, background and gesture shape changes.
作者 何坚 廖俊杰 张丞 魏鑫 白佳豪 王伟东 HE Jian;LIAO Jun-jie;ZHANG Cheng;WEI Xin;BAI Jia-hao;WANG Wei-dong(Software and System Engineering Technology Center,Beijing 100124,China;Faculty of Information,Beijing University of Technology,Beijing 100124,China)
出处 《图学学报》 CSCD 北大核心 2020年第3期372-381,共10页 Journal of Graphics
基金 国家自然科学基金项目(61602016) 北京市科技计划项目(D171100004017003)。
关键词 手势识别 空间上下文 长短时记忆 特征提取 gesture recognition spatial context long short term memory feature extraction
  • 相关文献

参考文献5

二级参考文献22

  • 1朱继玉,王西颖,王威信,戴国忠.基于结构分析的手势识别[J].计算机学报,2006,29(12):2130-2137. 被引量:26
  • 2克初 田斌.语音信号处理[M].北京:国防工业出版社,2000..
  • 3Jacob R J K. Eye-movement-based human computer interaction techniques:Toward non command interfaces//Proceedings of the Advances in Human-Computer Interaction, Ablex Publishing Corporation. Norwood, New Jersey, 1993: 151- 190.
  • 4Kato H, Billinghurst M, Poupyrev I. Virtual object manipulation on a table top AR environment//Proceedings of the ISAR2000. Munich, 2000: 111-119.
  • 5Kjeldsen R, Levas A, Pinhanez C. Dynamically reconfigutable vision based user interfaces. Machine Vision and Applications, 2004, 16(1): 6-12.
  • 6Wu Y, Huang T S. Vision-based gesture recognition: A review//Proceedings of the Gesture Workshop. Gifsur Yvette, France, 1999:103-115.
  • 7Kolsch M. Vision based hand gesture interfaces for wearable computing and virtual environments[Ph. D. dissertation]. University of California, Santa Barbara, 2004.
  • 8Wichens C D, Hollands J. Engineering Psychology and Human Performance. New Jersey: Prentice Hall, Inc. , 2003: 82-133.
  • 9Buxton W. A three-state model of graphical input//Proceedings of the Human Computer Interaction-INTERACT ' 90. Amsterdam, North-Holland, 1990:449- 456.
  • 10Ashbrook A P, Thacker N A, Rockett P I. Pairwise geometric histograms: A scaleable solution for the recognition of 2D rigid shape//Proceedings of the 9th Scandinavian Conference on Image Analysis. Uppsala, Sweden, 1995, (1):271-278.

共引文献95

同被引文献27

引证文献4

二级引证文献9

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部