期刊文献+

基于时空信息融合的人体行为识别研究 被引量:1

Research on Human Behavior Recognition Based on Temporal and Spatial Information Fusion
下载PDF
导出
摘要 在视频理解任务中,人体行为识别是一个重要的研究内容,但视频序列中存在时空信息融合困难、准确率低等问题。针对这些问题,提出一种基于时空信息融合的双流时空残差卷积网络模型。将视频分段采样提取RGB图像和光流图像,并将其输入到双流时空残差网络,通过设计的时空残差模块提取视频的深度时空特征,将每个视频片段的类别结果加权融合得到行为类别。提出的双流时空残差模块引入了少量的三维卷积和混合注意力机制,能够同时获取不同尺度的时空信息并且抑制无效信息,可以有效平衡时空信息的捕捉和计算量问题,并且提升了精度。实验基于TSN网络模型,在UCF101数据集上进行验证,实验结果表明提出的模型比原TSN网络模型的精准度提高了0.9个百分点,有效地提高了网络的时空信息捕获效率。 In video comprehension task, human behavior recognition is an important research content, but the temporal and spatial information fusion in video sequence is difficult and the accuracy is low. To solve these problems, this paper proposes a two-stream spatio-temporal residual convolution network model based on spatio-temporal information fusion.Firstly, RGB images and optical flow images are extracted from segmented video samples, and then are input into the twostream spatio-temporal residual network. The depth spatio-temporal features of the video are extracted by the designed spatio-temporal residual module. Finally, the category results of each video segment are weighted and fused to obtain the behavior category. The two-stream space-time residual module proposed in this paper introduces a small amount of threedimensional convolution and mixed attention mechanism, which can simultaneously obtain spatio-temporal information of different scales and suppress invalid information. It can effectively balance the problem of capturing and calculating spatio-temporal information, and improve the accuracy. The experiment is based on TSN network model and verified on UCF101 data set. Experimental results show that the accuracy of the proposed model is improved by 0.9 percentage points compared with the original TSN network model, and the efficiency of spatio-temporal information capture is effectively improved.
作者 于海港 何宁 刘圣杰 韩文静 YU Haigang;HE Ning;LIU Shengjie;HAN Wenjing(Beijing Key Laboratory of Information Service Engineering,College of Smart City,Beijing Union University,Beijing 100101,China)
出处 《计算机工程与应用》 CSCD 北大核心 2023年第3期202-208,共7页 Computer Engineering and Applications
基金 国家自然科学基金(61872042,61572077) 北京市教委科技计划重点项目(KZ201911417048) 北京市教委科技计划面上项目(KM202111417009) 北京联合大学人才强校优选计划(BPHR2020AZ01,BPHR2020EZ01) 北京联合大学科研项目(ZK50202001) 北京联合大学研究生科研创新资助项目(YZ2020K001)。
关键词 行为识别 双流网络 残差结构 注意力机制 时序信息 behavior recognition two stream network residual structure attentional mechanism temporal information
  • 相关文献

参考文献2

二级参考文献3

共引文献18

同被引文献8

引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部