摘要
时空行为检测是计算机视觉领域重要的研究方向,为了减小模型体量,提高检测速度,提出一种基于协同卷积(collective convolution, CoConv)的轻量级行为检测方法。将视频的时序信息转换为时空图像(spatio-temporal image, STI),利用协同卷积获取相同位置不同时间的时空特征信息。在YOLOv5的基础上将骨干网络和检测头部替换为协同卷积模块构建时空行为检测网络结构,通过后处理对时空图像的检测结果进行连接,快速形成视频结果,提高网络的行为检测性能。实验结果表明,提出的方法可以在保证准确率和不增加参数量的情况下,减少网络计算量,提高网络检测速度,且优于现有的行为检测方法。
Spatio-temporal behavior detection is an important research direction in the field of computer vision.To reduce the size of models and improve the detection speed,we propose a lightweight action detection method based on collective convolution(CoConv).The temporal information of the video is converted into spatio-temporal image(STI),and the spatio-temporal feature information of the same position at different times is obtained by Collective Convolution.Based on YOLOv5,the backbone network and detection head are replaced by collective convolution modules to construct the spatio-temporal action detection network structure.Detection results of spatio-temporal images are connected through post-processing to quickly form video results and improve the performance of network action detection.Experimental results demonstrate that our method can reduce network computational complexity,enhance detection speed,and outperform existing behavior detection methods while maintaining accuracy and not increasing the number of parameters.
作者
陈欣悦
高陈强
陈旭
黄思翔
CHEN Xinyue;GAO Chenqiang;CHEN Xu;HUANG Sixiang(School of Communications and Information Engineering,Chongqing University of Posts and Telecommunications,Chongqing 400065,P.R.China;Chongqing Key Laboratory of Signal and Information Processing,Chongqing 400065,P.R.China)
出处
《重庆邮电大学学报(自然科学版)》
CSCD
北大核心
2024年第1期136-144,共9页
Journal of Chongqing University of Posts and Telecommunications(Natural Science Edition)
基金
国家自然科学基金(62176035)
重庆市教委科学技术研究项目(KJZD-K202100606)~~。
关键词
深度学习
时空行为检测
轻量级
协同卷积
时空图像
deep learning
spatiotemporal action detection
lightweight
collective convolution
spatio-temporal image