期刊文献+

三维注意力增强的暴力场景检测算法

Three-dimensional attention-enhanced algorithm for violence scene detection
下载PDF
导出
摘要 为了提升互联网多媒体内容安全检测能力,有效过滤不良信息,提出了一种基于三维注意力增强的视频暴力内容检测算法。该算法以3D-DenseNet为骨干网络,首先利用P3D提取低层次的时空特征信息;其次引入SimAM注意力模块计算通道-空间注意力,增强帧画面重点区域信息;然后设计了时域注意力加强的过渡层突出重点时序信息,如此形成通道-空间-时间三维注意力,提升暴力场景检测性能。实验结果显示,算法在内容单一的小规模暴力行为检测数据集Hockey和Movies上准确率分别达到了98.75%和100%,在内容多样的大规模数据集RWF-2000上达到了89.25%,综合性能优于同类算法,验证了算法的有效性;在长视频的暴力内容定位实验中,算法在VSD2014数据集上相较同类算法也取得了更好的检测效果,证明了算法在暴力内容检测方面的泛化能力。 In order to improve the ability of multimedia to analyze the security on Web and effectively filter the objectionable content,a violent video scene detection algorithm based on three-dimensional attention is proposed.Taking the 3D DenseNet as the backbone network,the algorithm first uses the P3D to extract low-level spatial-temporal feature information.Second,the SimAM attention module is introduced to calculate channel-spatial attention so as to enhance the feature of the key area in the video frame.Then,a transition layer with temporal attention is designed to highlight the feature of key frames in the video.In this way,the channel-spatial-temporal attention is formed to better detect violent scenes.In the experiments on violence detection,the accuracy reaches 98.75%and 100%on Hockey and Movies,which are small data sets with a single content,and 89.25%on RWF-2000,which is a large data set with a diverse content.Results show that the proposed algorithm can effectively improve the performance of violence detection with 3D attention.In the violent content localization detection experiment on data set VSD2014,the better performance further proves the effectiveness and generalization ability of the algorithm.
作者 丁昕苗 王家兴 郭文 DING Xinmiao;WANG Jiaxing;GUO Wen(School of Information and Electronic Engineering,Shandong Technology and Business University,Yantai 264005,China)
出处 《西安电子科技大学学报》 EI CAS CSCD 北大核心 2024年第1期114-124,共11页 Journal of Xidian University
基金 国家自然科学基金(61876100,62072286,61572296) 山东省高等学校青创科技支持计划(2019KJN041,2020KJN005) 山东省研究生教育创新计划(SDYAL21211)。
关键词 暴力检测 深度学习 注意力机制 模式识别 P3D 3D-DenseNet violence detection deep learning attention mechanism pattern recognition P3D 3D-DenseNet
  • 相关文献

参考文献2

二级参考文献6

共引文献10

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部