针对恶意逃避行为的PDF文档检测

PDF Document Detection for Malicious Evasion Behavior

下载PDF

导出

摘要便捷式文档格式(PDF)是全球数据交换中广泛使用的格式之一,人们对其有很高的信任度。然而,近年来不法分子利用PDF文档进行恶意网络攻击的情况越来越严重。随着黑客技术的进步,他们也逐渐采用一些逃避检测的方法,使得常见的学习算法难以检测到这种恶意文件。针对这些“更聪明”的恶意PDF攻击样本,对PDF文档的特性进行了分析,提取了25维特征,并应用调参后的Adaboost算法训练模型,准确率达到99.63%,优于同领域的其他研究成果。 The Portable Document Format(PDF)is one of the widely used formats in global data exchange,and people have a high level of trust in it.However,in recent years,the situation of criminals using PDF documents for malicious network attacks has become increasingly serious.With the advancement of hacker technology,they are gradually adopting methods to evade detection,making it more difficult for common learning algorithms to detect such malicious files.In response to these“smarter”malicious PDF attack samples,an analysis of the characteristics of PDF documents is conducted,and 25-dimensional features are extracted.By applying a finely-tuned Adaboost algorithm for model training,an accuracy rate of 99.63%is achieved,surpassing other research achievements in the same field.

作者李东帅尚培文 LI Dongshuai;SHANG Peiwen(School of Electronics&Information Engineering,Liaoning University of Technology,Jinzhou 121001,China)

机构地区辽宁工业大学电子与信息工程学院

出处《现代信息科技》 2024年第10期7-12,共6页 Modern Information Technology

关键词 PDF 逃避检测 ADABOOST算法网络攻击 PDF evading detection Adaboost algorithm network attack

分类号 TP309 [自动化与计算机技术—计算机系统结构]

引文网络
相关文献

参考文献10

1喻民,姜建国,李罡,刘超,黄伟庆,宋楠.恶意文档检测研究综述[J].信息安全学报,2021,6(3):54-76. 被引量：8
2林杨东,杜学绘,孙奕.恶意PDF文档检测技术研究进展[J].计算机应用研究,2018,35(8):2251-2255. 被引量：6
3张福勇,齐德昱,胡镜林.基于C4.5决策树的嵌入型恶意代码检测方法[J].华南理工大学学报（自然科学版）,2011,39(5):68-72. 被引量：8
4胡江,周安民.针对JavaScript攻击的恶意PDF文档检测技术研究[J].现代计算机,2016,22(1):36-40. 被引量：4
5徐建平.基于SVM模型的恶意PDF文档检测方法[J].电脑知识与技术,2016,12(8X):90-92. 被引量：1
6李睿,杨淑群,张新宇.一种双向采样的恶意PDF文档检测方法[J].软件导刊,2022,21(5):67-72. 被引量：2
7俞远哲,王金双,邹霞.基于特征集聚和卷积神经网络的恶意PDF文档检测方法[J].信息技术与网络安全,2021,40(8):35-41. 被引量：3
8俞远哲,王金双,邹霞.基于文档图结构的恶意PDF文档检测方法[J].信息技术与网络安全,2021,40(11):16-23. 被引量：1
9李坤明,顾益军,张培晶.对抗环境下基于集成决策树的恶意PDF文件检测[J].计算机应用与软件,2020,37(10):318-322. 被引量：4
10李坤明,顾益军,王安.逃避攻击下恶意PDF文件检测技术[J].中国人民公安大学学报（自然科学版）,2019,25(3):60-64. 被引量：4

二级参考文献49

1闵华清,卢炎生,蒋晓宇.基于共同进化计算的分类规则算法[J].华南理工大学学报（自然科学版）,2006,34(6):69-73. 被引量：1
2Stolfo S J,Wang K,Li W J.Towards stealthy malware detection[M] // Malware detection.Heidelberg:SpringerVerlag,2007:231-249.
3Li W J,Stoffo S J,Stavrou A,et al.A study of malcodebearing documents[C] //Proceedings of the 4th International Conference on Detection of Intrusions and Malware,and Vulnerability Assessment.Heidelberg:Springer-Verlng,2007:231-250.
4Shafiq M Z,Khayam S A,Farooq M.Embedded malware detection using Markov n-grams[C] //Proceedings of the 5th International Conference on Detection of Intrusions and Malware,and Vulnerability Assessment.Heidelberg:Springer-Verlag,2008:88-107.
5John Leyden.Trojan exploits unpatched Word vulnerability[EB/OL].(2006-05-22)[2010-05-28].http://www.theregister.co.uk/2006/05/22/trojan_ exploit_word_vuln/.
6Joris Evers.Zero-day attacks continue to hit Microsoft[EB/OL].(2006-09-28)[2010-05-28].http://news.cnet.com/ Zero-day-attacks-continue-to-hit-Microsoft/2100-7349_3-6120481.html.
7David Kierznowski.Backdooring PDF files[EB/OL].(2006-09-13)[2010-05-28].http:// michaeldaw.org/md-hacks/backdooring-pdf-files.
8Damashek M.Gauging similarity with n-grams:language-independent categorization of text[J].Science,1995,267(5199):843-848.
9Grossman D A,Frieder O.Information retrieval:algorithms and heuristics[M].2nd ed.Heidelberg:Springer-Verlag,2004.
10Dumais S,Platt J,Heckerman D,et al.Inductive learning algorithms and representations for text categorization[C] // Proceedings of the 7th International Conference on Information and Knowledge Management.New York:ACM Press,1998:148-155.

共引文献27

1陈亮.改进的贝叶斯网络模型在保险欺诈挖掘中的应用[J].河南城建学院学报,2012,21(1):50-53. 被引量：2
2唐慧强,杭丽娜,范海娟.基于C4.5决策树算法的天气预警系统的手机终端设计[J].计算机应用,2013,33(5):1467-1469. 被引量：9
3边根庆,龚培娇,邵必林.基于K-L散度的恶意代码模型聚类检测方法[J].计算机工程,2014,40(12):104-107. 被引量：1
4赵丽,齐兴斌,李雪梅,田涛.基于PTM潜在Dirichlet分配的少量标记样本文本分类[J].计算机应用研究,2015,32(5):1428-1432. 被引量：2
5张福勇,赵铁柱.采用路径IRP的Windows恶意进程检测方法[J].沈阳工业大学学报,2015,37(4):434-439. 被引量：5
6蒋传勇,姚立红,潘理.基于VMM的程序行为异常检测[J].信息安全与通信保密,2016,14(3):118-122. 被引量：1
7李涛.基于SVM的恶意PDF检测研究[J].现代计算机（中旬刊）,2018(3):117-120. 被引量：2
8林杨东,杜学绘,孙奕.恶意PDF文档检测技术研究进展[J].计算机应用研究,2018,35(8):2251-2255. 被引量：6
9文伟平,吴勃志,焦英楠,何永强,通信作者.基于机器学习的恶意文档识别工具设计与实现[J].信息网络安全,2018,0(8):1-7. 被引量：3
10李国,黄永健,王静,徐俊洁,王鹏.一种基于复合特征的恶意PDF检测方法[J].现代电子技术,2020,43(2):45-48. 被引量：2

1田昊,王超.基于国密SM3和SM4算法的SNMPv3安全机制设计与实现[J].计算机科学,2024,51(S01):919-925.

现代信息科技

2024年第10期

浏览历史

内容加载中请稍等...

针对恶意逃避行为的PDF文档检测

参考文献10

二级参考文献49

共引文献27

相关作者

相关机构

相关主题

浏览历史