基于压缩奖惩机制的视频行为识别方法研究被引量：1

Research on Video Behavior Recognition Method Based on Squeeze and Excitation Mechanism

下载PDF

导出

摘要大多数视频行为识别任务中都是同等处理网络提取到的时空信息,为了忽略无关信息而关注重点信息,本文设计了含有压缩奖惩机制的卷积神经网络结构,用于视频行为识别.该网络结构基于时间分段网络构建,首先将视频分为多个等长片段,从每个片段随机提取堆叠光流图像和RGB视频帧,将其分别输入到含有压缩奖惩机制的时间与空间双流卷积神经网络,通过压缩与奖惩操作,对网络提取到的特征进行加权,根据加权后的时间与空间特征分别在时间与空间两个通道上对行为作出初步预测;然后对每个片段的时间与空间初步预测结果分别融合,得到视频级预测结果;最后将视频级时间与空间预测结果融合,得到最终视频行为识别结果.在数据集UCF101与HMDB51上进行了实验,结果表明,与其他不含压缩奖惩机制的多种网络模型相比,该模型具有较高的准确率. In most video behavior recognition tasks,the temporal-spatio information extracted by the network is treated equally.In order to ignore the irrelevant information and focus on the key information,a convolutional neural network with squeeze and excitation mechanism was designed for video behavior recognition.The network was constructed based on a temporal segment network.Firstly,the video was divided into multiple equal-length segments,accordingly,stacked optical flow images and RGB video frames were extracted from each segment randomly.For each segment,the stacked optical flow image and RGB video frame were respectively input into the temporal and spatial two-stream convolutional neural network with squeeze and excitation mechanism.Furthermore,weights were added to the features extracted from the temporal and spatial convolution network by squeeze and excitation operation.Then,according to the weighted temporal and spatial features,the preliminary predictions of behavior were made on the temporal and spatial channels.The video-level predictions of temporal and spatial were obtained by merging the preliminary predictions of temporal and spatial for each segment.Finally,the video-level predictions of temporal and spatial were combined to obtain the final video behavior recognition result.Experiments were carried out on the datasets UCF101 and HMDB51.The results showed that the accuracy of the network was higher than many other networks without squeeze and excitation mechanism.

作者张丽红郭磊 ZHANG Lihong;GUO Lei(College of Physics And Electronic Engineering, Shanxi University, Taiyuan 030006, China)

机构地区山西大学物理电子工程学院

出处《测试技术学报》 2020年第5期418-424,共7页 Journal of Test and Measurement Technology

基金山西省科技攻关计划(工业)资助项目(2015031003-1)。

关键词视频行为识别压缩奖惩机制时间分段网络双流卷积网络特征融合 video action recognition squeeze and excitation mechanism temporal segment network two-stream convolution network feature fusion

分类号 TP391.4 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

同被引文献5

1朱红蕾,朱昶胜,徐志刚.人体行为识别数据集研究进展[J].自动化学报,2018,44(6):978-1004. 被引量：35
2龚捷,罗聪,罗琴.基于注意力机制和残差网络的动作识别模型[J].电子测量技术,2021,44(14):111-116. 被引量：7
3梁绪,李文新,张航宁.人体行为识别方法研究综述[J].计算机应用研究,2022,39(3):651-660. 被引量：18
4黄志强,李军.基于空间通道注意力机制与多尺度融合的交通标志识别研究[J].南京邮电大学学报（自然科学版）,2022,42(2):93-102. 被引量：7
5王志强.一种基于三维残差网络分组膨胀卷积的人体行为识别方法[J].现代计算机,2022,28(5):65-70. 被引量：1

引证文献1

1李建平,赖永倩.基于注意力机制和残差网络的视频行为识别[J].计算机技术与发展,2023,33(4):69-74.

1涂璟,李明.基于“雨课堂”的教学模式设计与实践——以森林生态学课程为例[J].现代农业科技,2020(18):250-252. 被引量：2
2何鑫,许娟,金莹莹.行为关联网络:完整的变化行为建模[J].计算机科学,2020,47(9):123-128.
3金正康,秦工,李朝阳,李申.光流定位自主无人机的研究与应用[J].电子测试,2020,31(19):52-55. 被引量：4
4付春梅.互助保障好贴心关爱行动落地行[J].中国工会财会,2020(9):41-41.
5刘星红.无人机倾斜摄影技术在地理空间信息数据生产中的应用[J].经纬天地,2020(4):39-43. 被引量：3
6李寒露,解庆,唐伶俐,刘永坚.融合时空信息和兴趣点重要性的POI推荐算法[J].计算机应用,2020,40(9):2600-2605. 被引量：4
7王瑜铬,任磊.贸易摩擦对美国来华游客数量的影响:实证与预测[J].旅游论坛,2020,13(4):62-72. 被引量：1
8邵轩,张良,时绿艳.基于时空大数据的江淮生态经济区经济发展质量综合评价[J].现代测绘,2020,43(2):27-30. 被引量：1
9刘望华,刘光帅,陈晓文,李旭瑞.结合微分特征和Haar小波分解的鲁棒纹理表达[J].计算机应用,2020,40(9):2728-2736. 被引量：18
10田世吉,李忠义,鲁敏,李辉.双馈风电机组双通道扭振主动控制策略研究[J].现代电子技术,2020,43(19):84-87. 被引量：1

测试技术学报

2020年第5期

浏览历史

内容加载中请稍等...

基于压缩奖惩机制的视频行为识别方法研究被引量：1

同被引文献5

引证文献1

相关作者

相关机构

相关主题

浏览历史

基于压缩奖惩机制的视频行为识别方法研究 被引量：1

同被引文献5

引证文献1

相关作者

相关机构

相关主题

浏览历史

基于压缩奖惩机制的视频行为识别方法研究被引量：1