基于两阶段算法的多媒体有害信息识别方法

Multimedia Harmful Information Recognition Method Based on Two-stage Algorithm

下载PDF

导出

摘要在互联网安全监管和网络违法犯罪打击整治的应用场景中,现有多媒体有害信息识别方法普遍存在运算效率不高、无法准确识别局部敏感信息,以及识别检测局限于单一的网络违法犯罪类型等问题。针对以上问题,文中提出了一种基于两阶段算法的多媒体有害信息识别模型。该模型将信息过滤与内容检测分阶段处理,将场景识别和元素目标检测分任务并行处理,第一阶段采用EfficientNet-B2构建高吞吐的前置过滤模块快速筛选掉80%正常内容的数据;第二阶段基于Meal-V2,Faster RCNN,NetVLAD网络构建3种不同网络结构的模块,适应多维度场景、多特征元素的识别要求。结果表明,模型运算效率在T4卡上达到57 FPS,多媒体有害信息的识别准确率、召回率均超过97%;与传统模型相比,在NPDI和自建测试集上识别准确率分别最高提升3.09%和19.26%。 In the application scenarios of Internet content security supervision and combating and rectifying Internet crimes,existing multimedia harmful information identification methods generally have problems such as low computational efficiency,inability to accurately identify local sensitive information,and identification capabilities are limited to a single type of cyber crimes.In order to solve the above problems,the paper proposes a multimedia harmful information recognition model based on a two-stage algorithm.This method processes information filtering and content detection in stages,and splits the tasks of scene recognition and element target detection.The first stage uses EfficientNet-B2 to build a high-throughput pre-filter model to quickly filter out 80%of images and short videos with normal content.In the second stage,three modules with different network structures are built based on Meal-V2,Faster RCNN,and NetVLAD networks to adapt to the recognition requirements of multi-dimensional scenes and multi-feature elements.The results show that the model’s computing efficiency reaches 57FPS(frames per second)on the T4 card,and the recognition accuracy and recall rate of multimedia harmful information exceed 97%.Compared with traditional mo-dels,the recognition accuracy rate on the NPDI dataset and the self-built test dataset increases by 3.09%and 19.26%respectively.

作者史晓苏李欣简玲倪华健 SHI Xiaosu;LI Xin;JIAN Ling;NI Huajian(Information Network Security Academy,People’s Public Security University of China,Beijing 100091,China;Network Security Corps,Shanghai Public Security Bureau,Shanghai 200025,China;Shanghai SUPREMIND Technology Co.,Ltd,Zhejiang 310000,China)

机构地区中国人民公安大学信息网络安全学院上海市公安局网络安全保卫总队上海闪马智能科技有限公司

出处《计算机科学》 CSCD 北大核心 2024年第S01期1011-1016,共6页 Computer Science

基金公安部应用创新计划(2020YYCXSHSJ019)。

关键词两阶段算法多媒体有害信息识别 Two-stage algorithm Multimedia Harmful information recognition

分类号 TP391.41 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献1

1顾杨,李剑,景博,王励成.一种互联网内容安全检测过滤系统[J].计算机应用研究,2008,25(9):2834-2835. 被引量：3

二级参考文献10

1CNNIC.中国互联网络发展状况统计报告[EB／OL].http://www.cnnic.net.en/index/OE/00/11/index.htm.,2003—12—31.
2CARNEY D, CETINTERNEL U, CHERNIACK M, et al. Monitoring streams: a new class of DBMS applications, CS- 02- 01 [ R]. [ S. l. ] : Department of Computer Science, Brown University,2002.
3KNUTH D E, MORRIS J H, PRATT V R. Fast pattern matching in strings[ J]. SIAM Journal on Computing,1977,6( 1 ) :323-350.
4NAVARRO G, RAFFINOT M. Flexible pattern matching in strings: practical on-line search algorithms for texts and biological sequences [ M]. Cambridge: Cambridge University Press, 2002.
5AHO A V,CORASICK M J. Efficient string matching:an aid to bibliographic search[J]. Communication of the ACM,1975,18(6):333-340.
6WU S, MANBER U. Fast text searching allowing errors[J]. Communications of the ACM ,1992,35(10) :83-91.
7BABCOCK B, BABU S, DATAR M,et al. Models and issues in data stream systems[ C ]//Proc of ACM Symp on Principles of Database Systems ( PODS 2002). 2002.
8BABCOCK B, DATAR M, MOTWANI R. Sampling from a moving window over streaming data [ C ]//Proc of Annual ACM-SIAM Syrup on Discrete Algorithms. 2002:633-634.
9BABU S, WIDOM J. Continuous queries over data streams[ M]. [ S. l. ] :Sigmod Record, 2001.
10BOYER R S, MOORE J S. A fast string searching algorithm [ J ]. Communications of the ACM, 1977,20 (10) :762- 772.

共引文献2

1刘华鹏,刘胜全,刘艳,张华楠,李鹏.基于语义的Web内容安全检测模型[J].计算机工程与应用,2014,50(14):116-120.
2徐源,张玉杰.基于改进ShuffleNetV2的敏感内容识别与应用[J].传感器与微系统,2023,42(3):164-168. 被引量：2

1今年5月全国受理网络违法和不良信息举报1659.6万件[J].现代经济信息,2023(21):14-14.
2任娜,张悦,何沛涵.养老服务领域非法集资的特点、成因和对策[J].中国民政,2024(7):34-35.
3内蒙古警方公布5起打击整治网络谣言典型案例[J].中国防伪报道,2024(2):50-50.
4梁少玲.避坑抢红包快乐过新年[J].秋光（长寿生活）,2023(1):4-6.
5公安部:警惕农资“忽悠团”犯罪特征,一“引”二“吹”三“骗”[J].农药市场信息,2024(9):15-15.
6浙江省公安厅发布十大网络犯罪案例[J].中国防伪报道,2024(2):62-63.
7胡绍洋.电商平台中刷单行为的刑法适用研究[J].电子商务评论,2024,13(2):2510-2516.
8徐斌.论如何加强网络警察执法能力建设[J].中国科技期刊数据库科研,2017(1):218-219.
9新疆警方公布一批打击网络违法犯罪典型案例[J].中国防伪报道,2024(2):70-71.
10陈俊,谢永恒,万月亮.公安业务视角下基于XDeepFM模型的网络水军识别[J].网络安全技术与应用,2024(5):50-54.

计算机科学

2024年第S01期

浏览历史

内容加载中请稍等...

基于两阶段算法的多媒体有害信息识别方法

参考文献1

二级参考文献10

共引文献2

相关作者

相关机构

相关主题

浏览历史