一种新的基于三维卷积共生梯度直方图和多示例学习的特殊视频检测算法被引量：7

A New Special Video Detection Algorithm Based on 3D Convolution CoHOG and MIL

下载PDF

导出

摘要已有的基于梯度方向直方图信息的视频内容检测算法侧重在二维的视频帧上提取特征,忽略了视频内容在时间维度上的相关性.提取局部梯度间潜在的共生关系特征可一定程度上提高算法的检测准确率;同时,对相邻特征池化可有效减少特征降维过程中的信息丢失.基于此,利用视频帧间结构信息通过卷积运算构建共生梯度直方图的三维结构,然后对相邻特征池化实现描述特征的有效降维,解决了忽略帧间信息影响识别准确率以及高维度特征难以训练的问题;将视频特征映射到多示例学习中的示例和包,非常容易地实现了对不同长度视频的检测.在公开测试数据集Hockey、Movie上进行测试,实验结果显示,Hockey数据集上算法的检测准确率高于现有最优算法3%,Movie数据集上的检测准确率高于现有最优算法0.5%,验证了新特征与算法的有效性. Existing video content detection algorithms based on gradient direction histogram information are focused on the features extracted from the single two-dimensional video frames,ignored the correlation of the video frames on the time dimension.The frames in the video are inseparable whole.All consecutive frames could express true and complete semantics.The extracted information contained in video is inaccurate if only consider key frames.The correlation contain semantic information of video,is import for video content detection.And the potential symbiotic relationship between local gradient direction features is beneficial to the improvement of the algorithm accuracy.Just as important,pooling used in the adjacent features can reduce high-dimensional feature dimension,avoid losing hidden action information.Constructed3D Conv-CoHOG feature by using the hidden structure information in video frames on the time dimension,and extending two-dimensional CoHOG features to three-dimensional features.Pooling operation on neighboring features reduced feature dimension effectively.This algorithm solved the problems of recognition accuracy reduction because of the inter-frame information neglect and the high computing complexity caused by high-dimensional features.Mapping video features to instances and bags corresponding to multiple-instance learning,dealing with video content detection problems for different lengths of videos simply.In this article,we introduced field of research and the importance of video violence content detection firstly.Then summarized the achievements of previous research,classified the findings of the research.All algorithms are divided into3categories,based on multi-modal features of audio and video and fused color feature,based on fusion of different action features,and the content detection algorithm based on neural network and unsupervised feature extraction.The most important part of this article is the introduction of algorithmic structure.We introduced the concept of HOG features and the extraction process,compared the extraction difference between HOG,CoHOG and Conv-CoHOG,also compared the extraction difference between HOG and HOG3D,and proposed the new special video content detection algorithm3D convolution CoHOG extended from Conv-CoHOG.We compared the difference between the proposed new feature and the old features,such as computational dimension,feature dimension,and the relationship between adjacent features.In part 3.2,we introduced the framework of the new algorithm.In part 3.3 to part 3.7,we introduced the construction of feature extraction unit,the quantization of three dimensional gradients,extraction of Co-HOG3D,extraction of Conv-CoHOG3D,and the training of multiple-instance learning algorithm model.In part 4.1,described the two databases used in this experiment.In part 4.2,showed parameter setting and evaluation criteria.Then we analyzed the experimental results.In stage of training data,we used three classifiers,each classifier has a variety of implementations.When testing,compared the results of different features,analyzed the reasons for the different results,and analyzed the effectiveness of the new feature.In the end,we put forward effective solution on special video content detection.The highest detection accuracy on hockey and movie sets illustrated the availability of the proposed new algorithm on the special video detection.3% higher than the existing optimal algorithm on Hockey data set,0.5% higher than the existing optimal algorithm on Movie data set.

作者宋伟任栋于京齐振国 SONG Wei;REN Dong;YU Jing;QI Zhen-Guo(School of Information Engineering, Minzu University of China, Beijing 100081;School of Electronic Information Engineering, Beijing Jiaotong University, Beijing 100044)

机构地区中央民族大学信息工程学院北京交通大学电子信息工程学院

出处《计算机学报》 EI CSCD 北大核心 2019年第1期149-163,共15页 Chinese Journal of Computers

基金国家自然科学基金(61503424 61331013) 国家留学基金中央民族大学一流大学一流学科("图像工程") 中央民族大学青年教师科研能力提升计划项目资助~~

关键词视频内容检测梯度方向直方图多示例学习卷积池化极限学习机 special videos detection histogram of oriented gradient multiple instance learning convolution pooling extreme learning machine

分类号 TP391 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献1

1任栋,宋伟,于京,姜薇.特殊视频内容检测算法研究综述[J].信息网络安全,2016(9):184-191. 被引量：8

二级参考文献2

1丁昕苗,李兵,胡卫明,郭文,王振翀.基于多视角融合稀疏表示的恐怖视频识别[J].电子学报,2014,42(2):301-305. 被引量：7
2黄杰,倪鹏宇.基于人体关键部位的不良图像过滤[J].应用科学学报,2014,32(4):416-422. 被引量：4

共引文献7

1王庆丰,郝斌,成瑾,雷建青,肖晟.一种可变码率视频内容检测技术[J].通信技术,2017,50(11):2577-2583. 被引量：1
2康凯,王重道,王生进,范英.面向人口信息人像比对应用的人像比对算法研究[J].信息网络安全,2017(12):80-84. 被引量：2
3孙必慎,石武祯,姜峰.计算视觉核心问题:自然图像先验建模研究综述[J].智能系统学报,2019,14(1):71-81. 被引量：2
4谷学汇.基于信息融合算法的暴力视频内容识别[J].济南大学学报（自然科学版）,2019,33(3):224-228. 被引量：4
5袁志民,朱春磊,吕成都.基于SM4的选择性视频加密算法[J].通信技术,2019,52(8):1962-1966. 被引量：2
6吴晨思,蔡茂滨,杨耀淳,赵晓莺,范科峰.视频内容安全评价发展探讨[J].中国图象图形学报,2022,27(1):163-175. 被引量：1
7徐源,张玉杰.基于改进ShuffleNetV2的敏感内容识别与应用[J].传感器与微系统,2023,42(3):164-168. 被引量：1

同被引文献74

1文伟,肖志云,彭思龙.边缘指导的BDCT压缩图像的后处理算法[J].计算机辅助设计与图形学学报,2005,17(9):2022-2028. 被引量：2
2高新波,田春娜,张娜.一种基于SVM主动学习的卡通视频检测方法[J].电子与信息学报,2007,29(6):1338-1342. 被引量：2
3王传旭,张祥光,原春锋,刘云.基于邻域相关性和帧间连续性的前景目标分割[J].数据采集与处理,2007,22(3):288-291. 被引量：5
4武伟,詹玲超.利用颜色滤波阵列特性和模糊估计检测篡改[J].计算机工程与设计,2007,28(21):5179-5180. 被引量：1
5骆伟祺,黄继武,丘国平.鲁棒的区域复制图像篡改检测技术[J].计算机学报,2007,30(11):1998-2007. 被引量：65
6王鑫,鲁志波.基于JPEG块效应差异的图像篡改区域自动定位[J].计算机科学,2010,37(2):269-273. 被引量：12
7李荣杰,蒋兴浩,孙锬锋.一种基于音频词袋的暴力视频分类方法[J].上海交通大学学报,2011,45(2):214-218. 被引量：4
8陈威兵,杨高波,陈日超,朱宁波.数字视频真实性和来源的被动取证[J].通信学报,2011,32(6):177-183. 被引量：20
9张静,陈静,苏育挺.基于滤波检测的视频区域篡改检测算法[J].电子测量技术,2011,34(11):66-69. 被引量：2
10吴双,张文生,徐海瑞.基于词间关系分析的文本特征选择算法[J].计算机工程与科学,2012,34(6):140-145. 被引量：3

引证文献7

1谷学汇.基于信息融合算法的暴力视频内容识别[J].济南大学学报（自然科学版）,2019,33(3):224-228. 被引量：4
2徐建国,肖海峰,赵华.基于多示例学习框架的文本分类算法[J].计算机工程与设计,2020,41(4):1017-1023.
3陈临强,杨全鑫,袁理锋,姚晔,张祯,吴国华.视频对象移除篡改的时空域定位被动取证[J].通信学报,2020,41(7):110-120. 被引量：3
4谢旻旻,钟小莉.基于三维直方图的立体图像层次化分割算法[J].计算机仿真,2021,38(2):133-136.
5路皓翔,刘振丙,张静,王子民.结合多尺度循环卷积和多聚类空间的红外图像增强[J].电子学报,2022,50(2):415-425. 被引量：4
6赵璐,袁立明,郝琨.多示例学习算法综述[J].计算机科学,2022,49(S01):93-99. 被引量：2
7周江,蔡臻.基于CNN与LSTM混合算法下的学生学习表情识别研究[J].广东交通职业技术学院学报,2023,22(1):48-52. 被引量：2

二级引证文献15

1葛思坤.短视频生产与传播的负面风险管理策略分析[J].视听,2020(2):160-162. 被引量：1
2熊礼治,曹梦琦,付章杰.基于三维双流网络的视频目标移除篡改取证[J].通信学报,2021,42(12):202-211. 被引量：1
3蔡兴泉,封丁惟,王通,孙辰,孙海燕.基于时间注意力机制和EfficientNet的视频暴力行为检测[J].计算机应用,2022,42(11):3564-3572. 被引量：1
4谢静,刘玉文.基于LDA模型和卡方检验的网络暴力话题挖掘方法[J].西昌学院学报（自然科学版）,2022,36(4):97-103. 被引量：1
5杨志勇,江峰,于旭,杜军威.采用离群点检测技术的混合型数据聚类初始化方法[J].智能系统学报,2023,18(1):56-65. 被引量：3
6贺琨,李智,王国美,张健.基于双流CNN的帧内取证深度学习算法研究[J].计算机仿真,2023,40(1):259-266.
7史超,王兴桦,吴新垒,黄玉,李智星,毛叶丰.基于机器学习快速预报模型的城市洪涝预报预警系统研究及应用[J].西北水电,2023(2):12-19. 被引量：1
8潘鑫鑫,侯精明,陈光照,周聂,吕佳豪,梁鑫,唐君言,张松.基于 K 近邻和水动力模型的城市内涝快速预报[J].水资源保护,2023,39(3):91-100. 被引量：4
9苗勃.基于红外图像增强算法的石油储罐内油品温度过高风险自动识别方法[J].化工自动化及仪表,2023,50(6):900-904.
10冯光璐,欧阳静,李然,倪凡,曾路.电网OA系统非结构化文档内容自动化识别技术[J].信息技术,2024,48(1):104-109.

1梁建胜,黄隆胜,徐淑琼.基于视频内容检测的协同过滤视频推荐系统[J].控制工程,2018,25(2):305-312. 被引量：12
2陈守辉,沙晶晶,李振.一种基于改进 SIFT 算法的图像匹配检索方法[J].信息周刊,2018,0(13):0085-0086.
3杨先伟,战学秋,康红娟,罗影.一种利用二次筛法分析画面异常的方法[J].吉林化工学院学报,2018,35(9):70-75.
4杜勇.“数字信号处理”课程线性卷积教学研究[J].四川工商学院学术新视野,2018,3(4):22-25.
5李琳,张涛.结合改进聚合通道特征和灰度共生矩阵的俯视行人检测算法[J].计算机应用,2018,38(12):3367-3371. 被引量：5
6蔡鹏飞,段朝伟.基于最优导向法则与距离约束的图像修复算法[J].电子测量与仪器学报,2018,32(10):119-125. 被引量：3
7景星烁,邹卫军,夏婷,李超.基于差异颜色特性的自适应互补学习目标跟踪[J].计算机辅助设计与图形学学报,2018,30(12):2253-2261. 被引量：5
8关键,赵天然,于西西,曹聪.军队院校网络微课程建设研究[J].求知导刊,2018(30):60-60.
9张洋洋,荆晓远,吴飞.基于迁移学习的跨项目软件缺陷预测[J].计算机技术与发展,2018,28(12):83-85.
10刘永梅,杜松怀,盛万兴.基于AdaBoost算法的剩余电流分类方法[J].电子技术应用,2018,44(12):147-150. 被引量：3

计算机学报

2019年第1期

浏览历史

内容加载中请稍等...

一种新的基于三维卷积共生梯度直方图和多示例学习的特殊视频检测算法被引量：7

参考文献1

二级参考文献2

共引文献7

同被引文献74

引证文献7

二级引证文献15

相关作者

相关机构

相关主题

浏览历史

一种新的基于三维卷积共生梯度直方图和多示例学习的特殊视频检测算法 被引量：7

参考文献1

二级参考文献2

共引文献7

同被引文献74

引证文献7

二级引证文献15

相关作者

相关机构

相关主题

浏览历史

一种新的基于三维卷积共生梯度直方图和多示例学习的特殊视频检测算法被引量：7