期刊文献+

基于手语视觉单词特征的手语字母识别研究 被引量:1

Study of Sign Language Alphabet Recognition Based on Sign Language Visual Word Features
下载PDF
导出
摘要 为有效识别手语字母,提出一种手语视觉单词(SLVW)的识别方法。采用Kinect获取手语字母视频及其深度信息,在深度图像中,通过计算获得手语手势的主轴方向角和质心位置以调整搜索窗口,利用基于深度图像信息的DI_CamShift方法对手势进行跟踪,进而使用基于深度积分图像的Ostu方法分割手势,并提取其尺度不变特征变换数据。将局部特征描述子表示的图像小区域量化生成SLVW,统计一幅手语图像中的视觉单词频率,用词包模型表示手语字母,并用支持向量机进行识别。实验结果表明,该方法不受颜色、光照和阴影的干扰,具有较高的识别准确性和鲁棒性,对复杂背景手语视频中的30个手语字母的平均识别率达到96.21%。 In order to effectively recognize the sign language alphabet, this paper presents an algorithm based on Sign Language Visual Word(SLVW). It uses Kinect to obtain the video and depth image information of sign language gestures, calculates spindle direction angle and mass center position of the depth image to adjust the search window and for gesture tracking which depends on depth image information DI_CamShift. An Ostu method based on depth integral image is used to gesture segmentation, and the Scale Invariant Feature Transform(SIFT) data are extracted. It generates SLVW from small regions represented by local feature descriptors. After counting the frequency of visual words in a sign language alphabet image, it builds Bag of Words(BoW) to describe manual alphabets and uses Support Vector Machine(SVM) for recognition. Experimental results show that this method has high recognition accuracy and good robustness. Meanwhile, all of color, light and shadow have no effect on it. The average recognition rate of 30 sign language alphabets in the sign language video under complex background is 96.21%.
作者 杨全 彭进业
出处 《计算机工程》 CAS CSCD 2014年第4期192-197,202,共7页 Computer Engineering
基金 国家自然科学基金资助项目(61075014) 高等学校博士学科点专项科研基金资助项目(20116102110027)
关键词 手势跟踪 手语视觉单词 Ostu方法 深度图像 词包 手语字母 gesture tracking Sign Language Visual Word(SLVW) Ostu method depth image Bag of Words(BoW) sign languagealphabet
  • 相关文献

参考文献15

  • 1Wachs J P, Kolsch M, Stem H, et al. Vision-based Hand- gesture Applications[J]. Communications of the ACM, 2011, 54(2): 60-72.
  • 2Ren Zhou, Yuan Junsong, Zhang Zhengyou. Robust Hand Gesture Recognition Based on Finger-earth Mover's Distance with a Commodity Depth Camera[C]//Proc. of the 19th ACM Intemational Conference on Multimedia. New York, USA: ACM Press, 2011 : 1093-1096.
  • 3Doliotis P, Stefan A, Murrough C, et al. Comparing Gesture Recognition Accuracy Using Color and Depth Infor- mation[C]//Proc, of the 4th International Conference on Pervasive Technologies Related to Assistive Environments. New York, USA: ACM Press, 2011 : 123-133.
  • 4杨筱林,姚鸿勋.基于多尺度形状描述子的手势识别[J].计算机工程与应用,2004,40(32):76-78. 被引量:3
  • 5张良国,高文,陈熙霖,陈益强,王春立.面向中等词汇量的中国手语视觉识别系统[J].计算机研究与发展,2006,43(3):476-482. 被引量:11
  • 6姜峰,高文,姚鸿勋,赵德斌,陈熙霖.非特定人手语识别问题中的合成数据驱动方法[J].计算机研究与发展,2007,44(5):873-881. 被引量:5
  • 7Deng J W. A Two-step Approach Based on HMM for the Recognition of ASL[C]//Proc. of Asian Conference on Computer Vision. Melbourne, Australia: [s. n.], 2002: 126- 131.
  • 8Chen Qing, Georganas N D, Petriu E M. Real-time Vision- based Hand Gesture Recognition Using Haar-like Features[C]//Proc. of Instrumentation and Measurement Technology Conference. Warsaw, Poland: [s. n.], 2007: 1-6.
  • 9Silanon K, Suvonvorn N. Hand Motion Analysis for Thai Alphabet Recognition Using HMM[J]. International Journal of Information and Electronics Engineering, 2011, 1 (1): 65-71.
  • 10Jon O E, Helen C, Nicolas P, et al. Sign Language Recognition Using Sequential Pattern Trees[C]//Proc. of IEEE Conference on Computer Vision and Pattern Recognition. Rhode Island, USA: [s. n.], 2012: 2200-2207.

二级参考文献98

  • 1刘健庄,栗文青.灰度图象的二维Otsu自动阈值分割法[J].自动化学报,1993,19(1):101-105. 被引量:355
  • 2郝颖明,朱枫.2维Otsu自适应阈值的快速算法[J].中国图象图形学报(A辑),2005,10(4):484-488. 被引量:120
  • 3SEZGIN M, SANKUR B. Survey over image thresholding techniques and quantitative performance evaluation [ J ]. Journal of Electronic Imaging, 2004,13 ( 1 ) : 146-165.
  • 4OTSU N. A threshold selection method from gray-level histograms [J]. IEEE Transaction on System Man and Cybernetic, 1979,9( 1 ) :62-66.
  • 5VIOLA P , JONES M . Rapid object detection using a boosted cascade of simple features [ C ]. Proc. CVPR. Volume I. (2001) :511-518.
  • 6MICHAEL G, HELMUT G, HORST B. Fast approximated SIFT[C]. Proc. ACCV. (2006):918-927.
  • 7Vailaya A, Figueiredo M, Jain A, et al. Image classification for content-based indexing [J].IEEE Transactions on hnage Processing, 2001, 10( 1 ) : 117-130.
  • 8Oliva A, Torralba A. Modeling the shape of the scene: A holistic representation of the spatial envelope [ J]. International Journal of Computer Vision, 2001, 42 ( 3 ) : 145-175.
  • 9Naphade M, Huang T. A probabilistic framework for semantic video indexing, filtering and retrieval [ J]. IEEE Transaction on Multimedia, 2001, 3 ( 1 ) : 141 - 151.
  • 10Serrano N, Savakis A, Luo J. A computationally efficient approach to indoor/outdoor scene classification [ C ] // Proceedings of the 16th International Conference on Pattern Recognition. Quebec, Canada,IEEE Press, 2002: 146-149.

共引文献74

同被引文献1

引证文献1

二级引证文献2

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部