基于编码视频的动态手势数据优化与识别

Dynamic Gesture Data Optimization and Recognition Based on Encoded Video

导出

摘要编码视频数据流中的运动矢量和残差等语法元素可用于替代光流进行运动表示,但其固有的像素噪声和特征稀疏性会影响精细动作的识别精度。对此,在对编码视频语法元素进行数据优化的基础上,设计了一个高精度、低复杂度的动态手势识别框架。首先,提出了关键P帧选择方法,通过选择信息量更高的编码帧解决了特征稀疏性问题;其次,提出了联合残差特征表示方法,利用残差得到精细的手势轮廓图,去除了运动矢量中手部以外的像素噪声;最后,设计了一种轻量而高效的动态手势识别模型,利用优化后的运动矢量和残差获得了类似于光流的计算效果。在viva,sheffield klnect gesture,NvGesture和EgoGesture等数据集上对所提方法进行了验证,实验结果显示,所提方法中仅使用RGB数据模式可达到的识别精度分别为82.94%、99.72%、81.12%和90.48%,降低了89%的存储开销,并且以4.7倍的运行速度获得了与先进方法相近的结果。 The syntax elements such as motion vectors(MVs)and residuals in encoding video data streams can substitute for optical flow in motion representation.But its inherent pixel noise and feature sparsity may also lead to some errors when fine movements are recognized.Hence,a dynamic gesture recognition framework is designed to get higher-precision and lower-complexity by using the data optimization of syntax elements in coding video.First,a key P-frame selection strategy is introduced to cope with the feature sparsity by selecting encoding frames which cover higher information content.Second,a joint residual feature representation method is proposed to remove the noisy MV not associated with the hand by using finer gesture contour maps obtained from residuals.Finally,a lightweight and efficient dynamic gesture recognition model is designed,leveraging optimized MVs and residuals to achieve a computation effect similar to optical flow.The proposed method is validated on datasets such as Viva dataset,sheffield klnect gesture(SKIG)dataset,NvGesture dataset,and EgoGesture dataset.The results of the experiments show that while using only RGB data,the recognition accuracy of the method mentioned was 82.94%,99.72%,81.12%and 90.48%respectively,reducing storage overhead by 89%and achieving results comparable to the advanced methods with a running speed 4.7 times faster.

作者谢晓燕曹盘宇夏浩陈雨馨 XIE Xiaoyan;CAO Panyu;XIA Hao;CHEN Yuxin(School of Computer,Xi'an University of Posts and Telecommunications,Xi'an 710121,China)

机构地区西安邮电大学计算机学院

出处《北京邮电大学学报》 EI CAS CSCD 北大核心 2024年第2期90-96,共7页 Journal of Beijing University of Posts and Telecommunications

基金科技创新2030-“新一代人工智能”重大项目(2022ZD0119001) 国家自然科学基金项目(61834005,61772417)。

关键词动态手势识别编码视频运动矢量残差数据优化 dynamic gesture recognition encoded video Motion Vector residual data optimization

分类号 TP391.4 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献1

1谢晓燕,赵欢,蒋林.基于视频数据特性的动态手势识别[J].北京邮电大学学报,2020(5):91-97. 被引量：3

二级参考文献1

1杨真真,匡楠,范露,康彬.基于卷积神经网络的图像分类算法综述[J].信号处理,2018,34(12):1474-1489. 被引量：105

共引文献2

1周思昀,施水才.面向网页交互场景下的手势识别改进算法研究[J].通信技术,2021,54(4):1028-1034.
2梁瑞.基于手势识别的设备音量控制系统的研究与实现[J].家电维修,2023(11):52-55.

1曲海成,李竹媛,刘万军.基于脉冲神经网络的时空交互图像分类[J].计算机系统应用,2024,33(5):162-169.
2王淑华.现代农业设备工程中大数据技术的应用分析[J].河北农机,2024(9):34-36.
3黎贝卡.今年的流行色来啦!普通人能穿吗[J].生活潮,2023(1):54-57.
4观点[J].服务外包,2024(6):8-9.
5Xiao Han,Fang Liu.Lipschitz Regularity of Viscosity Solutions to the Infinity Laplace Equation[J].Journal of Applied Mathematics and Physics,2023,11(10):2982-2996.
6胥明凯,朱坤双,李元良,杨啸帅,秦挺鑫,王皖.电力作业多源要素风险的自适应识别模型[J].清华大学学报（自然科学版）,2024,64(6):1047-1059. 被引量：1
7暮千寻.为美丽献身的胭脂虫,红颜薄命身价却堪比黄金[J].知音（海外版）,2023(11):26-29.
8Yuchang Si.Improved Long Short-term Memory Network for Gesture Recognition[J].IJLAI Transactions on Science and Engineering,2024,2(2):5-12.
9无,许诺(译).错买错卖摩根大通与希腊初创公司对簿公堂[J].商业周刊（中文版）,2024(10):22-23.
10Sharmin Akter Milu,Azmath Fathima,Tanmay Talukder,Inzamamul Islam,Md. Ismail Siddiqi Emon.Design and Implementation of Hand Gesture Detection System Using HM Model for Sign Language Recognition Development[J].Journal of Data Analysis and Information Processing,2024,12(2):139-150.

北京邮电大学学报

2024年第2期

浏览历史

内容加载中请稍等...

基于编码视频的动态手势数据优化与识别

参考文献1

二级参考文献1

共引文献2

相关作者

相关机构

相关主题

浏览历史