基于深度残差网络的人体行为识别算法研究被引量：1

Research on Human Action Recognition Algorithm Based on Deep Residual Network

下载PDF

导出

摘要针对原始C3D卷积神经网络的层数较少、参数量较大和难以关注关键帧而导致的人体行为识别准确率较低的问题,提出一种基于改进型C3D的注意力残差网络模型;首先,增加原始网络卷积层并采用卷积核合并与拆分操作实现(3×1×7)和(3×7×1)的非对称式卷积核,之后采用全预激活式残差网络结构来增加构建的非对称卷积层,并且在残差块中增加时空通道注意力模块;最后,为展示该算法的先进性和应用性,则将该算法与原始C3D网络以及其他流行算法分别在基准数据集HMDB51和自建的43类别体育运动数据集上相比较;实验结果表明,该算法与原始C3D网络相比,在HMDB51和43类体育运动数据集上分别提高了9.88%和21.61%,参数量比原来降低了38.68%,并且结果也优于其他流行算法。 Aiming at the problem that the original C3 D convolutional neural network has a small number of layers,a large amount of parameters,and the difficulty of focusing on key frames lead to the low accuracy of human behavior recognition,an improved C3 D-based attention residual network model is proposed.First,adds the original network convolution layer and uses the convolution kernel merge and splits operation to realize the asymmetric convolution kernel of(3×1×7) and(3×7×1),and then the fully pre-activated residual network structure is used to increase the constructed asymmetric convolutional layer,and the spatiotemporal channel attention module is added to the residual block.Finally,in order to demonstrate the advancement and applicability of the algorithm,the algorithm is compared with the original C3 D network and other popular algorithms on the benchmark data set HMDB51 and the self-built 43 categories sports data set.Experimental results show that compared with the original C3 D network,the algorithm has increased by 9.88% and 21.61% on the HMDB51 and 43 types of sports data sets,respectively,and the quantity of parameters has been reduced by 38.68%,and the results of the algorithm are better than that of other popular algorithms.

作者冯宇席志红 FENG Yu;XI Zhihong(College of Information and Communication Engineering,Harbin Engineering University,Harbin 150001,China)

机构地区哈尔滨工程大学信息与通信工程学院

出处《计算机测量与控制》 2022年第3期251-258,共8页 Computer Measurement &Control

基金国家自然科学基金资助项目(60875025)。

关键词深度学习三维卷积非对称式卷积核残差网络注意力模块人体行为识别 deep learning three-dimensional convolution asymmetric convolution kernel residual network attention module human behavior recognition

分类号 TP391 [自动化与计算机技术—计算机应用技术]