摘要
行为识别是计算机视觉领域意义重大的热点研究问题,它经历了从手工设计特征表征到深度学习特征表达的发展过程。从传统行为识别模型和深度学习模型两方面,对行为识别发展历程中产生的主流算法进行了归类梳理。传统行为识别模型主要包括基于轮廓剪影、时空兴趣点、人体关节点、运动轨迹的特征描述方法。其中改进的密集轨迹方式拥有良好的鲁棒性和可靠性;深度学习网络架构主要有双流网络、3D卷积网络和混合网络。首先,重点阐述了各行为识别算法的主要研究思路与创新点,并介绍了每类算法的模型架构、算法特色、适用情境等。然后,对广泛使用的公共行为数据库进行了分类阐述,着重对HMDB51和UCF101数据集进行了详细介绍,比较分析了传统方法和深度学习算法在各数据集上的识别效果。通过对比分析发现,传统方法不适用于高精细行为的识别,且不易实现跨数据库或跨场景的推广;深度架构中,双流网络和3D卷积网络获得了比较好的行为识别效果且被广泛使用。最后,对行为识别的未来发展进行了展望,指出了若干将来可行的研究方向。
Behavior recognition is a hot topic in the field of computer vision.It has experienced the development process from manual design feature representation to deep learning feature expression.This paper classifies the mainstream algorithms in the development of behavior recognition from two aspects of traditional behavior recognition models and deep learning models.The traditional behavior recognition models mainly include feature description methods based on silhouette,space-time interest points,human joint point and trajectories.Among them,the improved dense trajectory method has good robustness and reliability.Deep learning network architecture mainly includes two-stream network,3D convolution network and hybrid network.Firstly,this paper focuses on the main research ideas and innovations of each behavior recognition algorithm,and introducees the model architecture,algorithm features,application scenarios of each kind of algorithm.Then,the widely used public behavior databases are classified,and the HMDB51 and UCF101 datasets are introduced in detail.The recognition effects of traditional methods and deep learning algorithms on each dataset are compared and analyzed.Through comparative analysis,the traditional methods are not suitable for high-precision behavior recognition,and it is not easy to achieve cross database or cross scene promotion.In depth architecture,two-stream network and 3D convolution network have achieved good behavior recognition effect and are widely used.Finally,the future development of behavior recognition is prospected,and some feasible research directions in the future are pointed out.
作者
裴利沈
刘少博
赵雪专
PEI Lishen;LIU Shaobo;ZHAO Xuezhuan(School of Computer and Information Engineering,Henan University of Economics and Law,Zhengzhou 450046,China;School of Intelligent Engineering,Zhengzhou University of Aeronautics,Zhengzhou 450046,China)
出处
《计算机科学与探索》
CSCD
北大核心
2022年第2期305-322,共18页
Journal of Frontiers of Computer Science and Technology
基金
国家自然科学基金(61806073)
河南省重点研发与推广专项(科技攻关)基金(192102210097,192102210126,212102210160)。
关键词
人体行为识别
深度学习
神经网络
行为数据集
human behavior recognition
deep learning
neural network
behavior dataset