期刊文献+

基于改进NeXtVLAD的视频分类 被引量:1

Video classification based on improved NeXtVLAD
下载PDF
导出
摘要 为提高长视频分类精度并减少训练时占用显存,提出基于NeXtVLAD改进的长视频分类模型。将人脸识别领域的ghost聚类中心思想迁移到视频分类,通过加入ghost聚类中心降低无关采样帧的干扰,提高模型识别准确率,针对长视频分类提出多尺度的视频帧采样方法。采用预训练模型ResNet50提取采样帧的深度特征,在训练时冻结特征提取网络参数,减少训练时的计算量。在VideoNet数据集的前100个类别上进行实验,实验结果表明,该模型与现有相关模型相比取得了更好的分类效果。 To increase the accuracy of long video classification and reduce the occupation of video memory during training,an improved long video classification model based on NeXtVLAD was proposed.The idea of ghost clustering center in the field of face recognition was transferred to video classification.By adding ghost clustering center to reduce the interference of irrelevant sampling frames,the accuracy of model recognition was improved.A multi-scale video frame sampling method was proposed for long video classification.The pre-training model ResNet50 was used to extract the depth feature of the sampling frame,and the feature was freezed to extract the network parameters during the training to reduce the calculation amount during the training.Experimental results show that the proposed model has better classification effects than the existing correlation model.
作者 陈意 黄山 CHEN Yi;HUANG Shan(College of Electrical Engineering,Sichuan University,Chengdu 610065,China)
出处 《计算机工程与设计》 北大核心 2021年第3期749-754,共6页 Computer Engineering and Design
关键词 深度学习 视频分类 局部聚合描述子向量 特征融合 卷积神经网络 deep learning video classification vector of locally aggregated descriptors feature fusion convolutional neural network
  • 相关文献

同被引文献6

引证文献1

二级引证文献2

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部