基于强化学习的工控系统恶意软件行为检测方法被引量：14

Reinforcement learning-based detection method for malware behavior in industrial control systems

下载PDF

导出

摘要网络环境下的恶意软件严重威胁着工控系统的安全,随着目前恶意软件变种的逐渐增多,给工控系统恶意软件的检测和安全防护带来了巨大的挑战.现有的检测方法存在着自适应检测识别的智能化程度不高等局限性.针对此问题,围绕威胁工控系统网络安全的恶意软件对象,本文通过结合利用强化学习这一高级的机器学习算法,设计了一个检测应用方法框架.在实现过程中,根据恶意软件行为检测的实际需求,充分结合强化学习的序列决策和动态反馈学习等智能特征,详细讨论并设计了其中的特征提取网络、策略网络和分类网络等关键应用模块.基于恶意软件实际测试数据集进行的应用实验验证了本文方法的有效性,可为一般恶意软件行为检测提供一种智能化的决策辅助手段. Due to the popularity of intelligent mobile devices, malwares in the internet have seriously threatened the security of industrial control systems. Increasing number of malware attacks has become a major concern in the information security community.Currently, with the increase of malware variants in a wide range of application fields, some technical challenges must be addressed to detect malwares and achieve security protection in industrial control systems. Although many traditional solutions have been developed to provide effective ways of detecting malwares, some current approaches have their limitations in intelligently detecting and recognizing malwares, as more complex malwares exist. Given the success of machine learning methods and techniques in data analysis applications, some advanced algorithms can also be applied in the detection and analysis of complex malwares. To detect malwares and consider the advantages of machine learning algorithms,we developed a detection framework for malwares that threatens the network security of industrial control systems through the combination of an advanced machine learning algorithm,i.e.,reinforcement learning.During the implementation process,according to the actual needs of malware behavior detection,key modules including feature extraction,policy,and classification networks were designed on the basis of the intelligent features of reinforcement learning algorithms in relation to sequence decision and dynamic feedback learning.Moreover,the training algorithms for the above key modules were presented while providing the detailed functional analysis and implementation framework.In the application experiments,after preprocessing the actual dataset of malwares,the developed method was tested and the satisfactory classification performance for malware was achieved that verified the efficiency and effectiveness of the reinforcement learning-based method.This method can provide an intelligent decision aid for general malware behavior detection.

作者高洋王礼伟任望谢丰莫晓锋罗熊王卫苹杨玺 GAO Yang;WANG Li-wei;REN Wang;XIE Feng;MO Xia-feng;LUO Xing;WANG Wei-ping;YANG Xi(China Information Technology Security Evaluation Center,Beijing 100085,China;School of Computer and Communication Engineering,University of Science and Technology Beiing,Bijing 100083.China;Institute of Artificial Inelligence,University of Science and Technology Beijing,Bejing 100083,China;Beijing Key Laboratory of Knowledge Engineering for Materials Science,Beijing 100083,China;Beijing Itelligent Logistics System Collaborative Innovation Center,Bejing 101149,China)

机构地区中国信息安全测评中心北京科技大学计算机与通信工程学院北京科技大学人工智能研究院材料领域知识工程北京市重点实验室北京市智能物流系统协同创新中心

出处《工程科学学报》 EI CSCD 北大核心 2020年第4期455-462,共8页 Chinese Journal of Engineering

基金国家自然科学基金资助项目(U1736117,U1836106) 北京市自然科学基金资助项目(19L2029,9204028) 北京市智能物流系统协同创新中心开放课题资助项目(BILSCIC-2019KF-08) 北京科技大学顺德研究生院科技创新专项资金资助项目(BK19BF006) 材料领域知识工程北京市重点实验室基本业务费资助项目(FRF-BD-19-012A)

关键词恶意软件检测方法强化学习特征提取策略网络 malware detection method reinforcement learning feature extraction policy network

分类号 TP273 [自动化与计算机技术—检测技术与自动化装置]

引文网络
相关文献

同被引文献138

1刘科科,王丹辉,郑学欣,郭静.基于活动行为特征的APT攻击检测方法研究[J].中国电子科学研究院学报,2019,14(1):86-92. 被引量：8
2肖琦,苏开宇.基于随机森林的僵尸网络流量检测[J].微电子学与计算机,2019,36(3):43-47. 被引量：14
3文雨,王伟平,孟丹.面向内部威胁检测的用户跨域行为模式挖掘[J].计算机学报,2016,39(8):1555-1569. 被引量：16
4王玉良,陈晓东,吴吞.基于iOS系统的恶意行为检测研究[J].电信科学,2017,33(2):48-54. 被引量：3
5朱琨,张琪.机器学习在网络入侵检测中的应用[J].数据采集与处理,2017,32(3):479-488. 被引量：45
6王丽娜,谈诚,余荣威,尹正光.针对数据泄漏行为的恶意软件检测[J].计算机研究与发展,2017,54(7):1537-1548. 被引量：16
7王迎云.基于优化动态小波神经网络的网络安全态势评估[J].信阳农林学院学报,2017,27(3):95-99. 被引量：4
8王鲁华,杨宇波,赵阳.基于数据挖掘的网络入侵检测方法[J].信息安全研究,2017,3(9):810-816. 被引量：14
9戚晓晶.基于信息融合的网络安全态势评估模型[J].科技创新与应用,2017,7(30):190-191. 被引量：4
10杨静,李无忧,闫俊杰,华磊.串谋行为识别的间断连接无线网络数据转发机制[J].系统工程与电子技术,2017,39(11):2571-2579. 被引量：2

引证文献14

1谢奇爱,李正茂.基于大数据关联规则的网络恶意行为识别检测[J].合肥学院学报（综合版）,2021,38(2):85-91. 被引量：3
2李丹彤,冯海云,高涌皓.一种基于机器学习算法的网络安全评估方法[J].电子设计工程,2021,29(12):138-142. 被引量：5
3刘祥,杨永强.基于多信息融合的网络通信威胁智能识别方法[J].自动化与仪器仪表,2021(9):75-78. 被引量：1
4崔景洋,陈振国,田立勤,张光华.基于机器学习的用户与实体行为分析技术综述[J].计算机工程,2022,48(2):10-24. 被引量：7
5Jin Guo,Xuebin Wang,Yanling Zhang,Wenchao Xue,Yanlong Zhao.System identification with binary-valued observations under both denial-of-service attacks and data tampering attacks:defense scheme and its optimality[J].Control Theory and Technology,2022,20(1):114-126.
6Jin Guo,Xuebin Wang,Yanling Zhang,Wenchao Xue,Yanlong Zhao.System identification with binary-valued observations under both denial-of-service attacks and data tampering attacks:the optimality of attack strategy[J].Control Theory and Technology,2022,20(1):127-138. 被引量：1
7张旭华,任蔚,李欣.基于贝叶斯网络的电子阅读App软件动态检测技术[J].微型电脑应用,2022,38(4):57-59. 被引量：1
8洪蕾,谢锐.基于目标驱动的集成化软件需求建模方法[J].计算机仿真,2022,39(5):356-360.
9熊英乔,邱芬.恶意软件攻击行为的时序逻辑建模分析[J].计算机仿真,2022,39(6):430-433.
10周泽元,严彬元,刘俊荣.基于未知威胁感知的电网内外网边界信息安全监测[J].电力大数据,2022,25(4):18-25.

二级引证文献18

1徐兴硕,李世明.基于证据推理算法的网络安全评估[J].哈尔滨师范大学自然科学学报,2023,39(1):83-88.
2贾布里,莫腾飞,武永成.一种基于梯度提升的云安全机器学习算法[J].科技创新导报,2021,18(16):72-74.
3沈溶溶.基于大数据技术的交互式网络恶意入侵行为检测方法[J].信息与电脑,2022,34(1):35-37. 被引量：6
4刘斌.计算机网络安全的入侵检测技术研究[J].中国新技术新产品,2022(4):143-145.
5赖丹晖,罗伟峰,黄建华,袁旭东,邱子良.基于业务调用认证登录接口的电网信息防泄漏技术[J].中国电力,2022,55(8):184-189. 被引量：2
6陈益芳,宣羿,樊立波,孙智卿,屠永伟,张亦涵,蔡乾晨.基于机器学习的电网威胁检测算法模型和大数据平台设计[J].电力大数据,2022,25(4):34-41. 被引量：2
7郭禹伶,左晓军,崔景洋,王颖,张光华.基于模糊聚类的多类簇归属电力实体行为异常检测算法[J].河北科技大学学报,2022,43(5):528-537. 被引量：1
8张子宣,宗学军,何戡,连莲.基于CVAE⁃CatBoost的工业控制网络异常流量检测研究[J].计算机工程,2023,49(5):173-180. 被引量：2
9曹春梅.基于改进变分自动编码器的入侵检测模型构建及仿真[J].河南工程学院学报（自然科学版）,2023,35(2):63-69.
10余长宏,陆雅,王海鑫,高明.基于滑动时间窗的物联网设备流量分类算法[J].计算机工程,2023,49(7):259-268. 被引量：1

1冯胥睿瑞,刘嘉勇,程芃森.基于特征提取的恶意软件行为及能力分析方法研究[J].信息网络安全,2019(12):72-78. 被引量：6
2姚敏,杨东升.移动终端恶意软件流量行为自动化检测方法研究[J].自动化与仪器仪表,2019,0(11):149-152. 被引量：3
3刘蓬勃,曹军.人工智能下传感器在汽车电子中的应用[J].汽车世界,2019(21):0118-0118.
4冯常青,张岩.网络安全中恶意软件的行为研究与检测[J].科教导刊（电子版）,2019,0(35):277-278.
5张斌,李立勋,董书琴.基于改进SOINN算法的恶意软件增量检测方法[J].网络与信息安全学报,2019,5(6):21-30. 被引量：3
6杜宝江,徐尚云.虚拟仿真交通安全体验系统[J].中国水运（下半月）,2020,20(2):64-65.
7景栋盛,薛劲松,冯仁君.基于深度Q网络的垃圾邮件文本分类方法[J].计算机与现代化,2020,0(6):89-94. 被引量：1
8杨国元,吕晓军,李超,李依诺.基于ROS的铁路客站自主移动机器人关键技术研究[J].铁路计算机应用,2020,29(5):17-21. 被引量：4
9许新锋.工业控制系统网络安全等级保护的建设[J].中小企业管理与科技,2020(7):112-113. 被引量：2
10李志航.基于深度递归强化学习的无人自主驾驶策略研究[J].工业控制计算机,2020,33(4):61-63. 被引量：2

工程科学学报

2020年第4期

浏览历史

内容加载中请稍等...

基于强化学习的工控系统恶意软件行为检测方法被引量：14

同被引文献138

引证文献14

二级引证文献18

相关作者

相关机构

相关主题

浏览历史

基于强化学习的工控系统恶意软件行为检测方法 被引量：14

同被引文献138

引证文献14

二级引证文献18

相关作者

相关机构

相关主题

浏览历史

基于强化学习的工控系统恶意软件行为检测方法被引量：14