期刊文献+

多尺度卷积特征融合的SSD手势识别算法 被引量:7

SSD Gesture Recognition Algorithm with Multi-scale Convolution Feature Fusion
下载PDF
导出
摘要 为了提高对中小占比手势识别的准确性与稳定性,提出了一种多尺度卷积特征融合的SSD(single shot multibox detector)手势识别方法。该方法突出表现在两大方面,其一,在原始的SSD算法的多尺度卷积检测方法基础上,引入了不同卷积层的特征融合思想,经过空洞卷积下采样操作与反卷积上采样操作,实现网络结构中的浅层视觉卷积层与深层语义卷积层的融合,代替原有的卷积层用于手势识别,以提高模型对中小目标手势的识别精度;其二,为了解决正负样本不均衡导致分类性能差的问题,提出一种改进的损失函数,以提升模型对目标手势的分类能力。在手势识别公开的数据集上的实验结果表明,与SSD和Faster R-CNN等识别方法相比,能够在保持较高的手势检测精度的同时,又具有较好的鲁棒性与检测速度。 To improve the accuracy and stability of small-medium proportion gesture recognition,SSD(single shot multibox detector)gesture recognition algorithm with multi-scale convolution feature fusion is proposed.Two aspects are highlighted in this method.On the one hand,based on the multi-scale convolution detection method of the original SSD algorithm,the feature fusion mechanism of different classification layers is introduced.Through the dilated convolution down sampling operation and the deconvolution up sampling operation,the shallow visual feature layer and the deep semantic feature layer in the network structure are organically combined to replace the original convolution layer for gesture recognition to improve the semantic representation ability of the model.On the other hand,to solve the problem of poor classification performance caused by imbalance of positive and negative samples,an improved loss function is proposed.Experiments on the open data set of gesture recognition show that compared with SSD,Faster R-CNN and other recognition methods,the proposed method has better robustness and detection speed while maintaining higher gesture detection accuracy.
作者 谢淋东 仲志丹 乔栋豪 高辛洪 XIE Lin-dong;ZHONG Zhi-dan;QIAO Dong-hao;GAO Xin-hong(School of Mechanical and Electrical Engineering,Henan University of Science&Technology,Luoyang 471003,China)
出处 《计算机技术与发展》 2021年第3期100-105,共6页 Computer Technology and Development
基金 国家重点研发计划(2018YFB1701205) 国家级大学生创新创业训练项目(201910464002)。
关键词 多尺度卷积特征 中小占比手势 空洞卷积 反卷积 特征融合 改进的损失函数 multi-scale convolution features small-medium proportion gesture dilated convolution deconvolution feature fusion improved loss function
  • 相关文献

参考文献9

二级参考文献91

  • 1李连仲,王小虎,蔡述江.捷联惯性导航、制导系统中方向余弦矩阵的递推算法[J].宇航学报,2006,27(3):349-353. 被引量:17
  • 2陈文.基于加速度传感器的智能终端手势识别关键技术研究[D].国防科学技术大学2011
  • 3Lowe D G. Distinctive image features from scale-invariant keypoints[J]. International Journal of Computer Vision, 2004, 60 (2) 91 110.
  • 4Dalai N, Triggs B. Histograms of oriented gradients for human detection[C]//Computer Vision and Pattern Recognition (CVPR), IEEE Computer Society Conference on. San Diego, USA: IEEE, 2005, 1 886-893.
  • 5Hinton G E, Salakhutdinov R R. Reducing the dimensionality of data with neural networks[J]. Science, 2006, 313(5786) : 504-507.
  • 6Hubel D H, Wiesel T N. Receptive fields, binocular interaction and functional architecture in the catrs visual cortex[J]. The Journal of Physiology, 1962, 160(1): 106-154.
  • 7Fukushima K, Miyake S. Neocognitron: A new algorithm for pattern recognition tolerant of deformations and shifts in posi- tion[J]. Pattern Recognition, 1982, 15(6): 455-469.
  • 8Ruck D W, Rogers S K, Kabrisky M. Feature selection using a multilayer perceptron[J]. Journal of Neural Network Com- puting, 1990, 2(2): 40-48.
  • 9Rumelhart D E, Hinton G E, Williams R J. Learning representations by back-propagating errors[J]. Nature, 1986,3231 533 538.
  • 10LeCun Y, Denker J S, Henderson D, et al. Handwritten digit recognition with a back-propagation network[C]//Advances in Neural Information Processing Systems. Colorado, USA Is. n. ], 1990: 396-404.

共引文献816

同被引文献73

引证文献7

二级引证文献14

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部