Binary Program Vulnerability Mining Based on Neural Network

下载PDF

导出

摘要 Software security analysts typically only have access to the executable program and cannot directly access the source code of the program.This poses significant challenges to security analysis.While it is crucial to identify vulnerabilities in such non-source code programs,there exists a limited set of generalized tools due to the low versatility of current vulnerability mining methods.However,these tools suffer from some shortcomings.In terms of targeted fuzzing,the path searching for target points is not streamlined enough,and the completely random testing leads to an excessively large search space.Additionally,when it comes to code similarity analysis,there are issues with incomplete code feature extraction,which may result in information loss.In this paper,we propose a cross-platform and cross-architecture approach to exploit vulnerabilities using neural network obfuscation techniques.By leveraging the Angr framework,a deobfuscation technique is introduced,along with the adoption of a VEX-IR-based intermediate language conversion method.This combination allows for the unified handling of binary programs across various architectures,compilers,and compilation options.Subsequently,binary programs are processed to extract multi-level spatial features using a combination of a skip-gram model with self-attention mechanism and a bidirectional Long Short-Term Memory(LSTM)network.Finally,the graph embedding network is utilized to evaluate the similarity of program functionalities.Based on these similarity scores,a target function is determined,and symbolic execution is applied to solve the target function.The solved content serves as the initial seed for targeted fuzzing.The binary program is processed by using the de-obfuscation technique and intermediate language transformation method,and then the similarity of program functions is evaluated by using a graph embedding network,and symbolic execution is performed based on these similarity scores.This approach facilitates cross-architecture analysis of executable programs without their source codes and concurrently reduces the risk of symbolic execution path explosion.

作者 Zhenhui Li Shuangping Xing Lin Yu Huiping Li Fan Zhou Guangqiang Yin Xikai Tang Zhiguo Wang

机构地区 School of Information and Software Engineering School of Electrical and Computer Engineering

出处《Computers, Materials & Continua》 SCIE EI 2024年第2期1861-1879,共19页 计算机、材料和连续体（英文）

关键词 Vulnerability mining de-obfuscation neural network graph embedding network symbolic execution

分类号 TP3 [自动化与计算机技术—计算机科学与技术]

引文网络
相关文献

参考文献18

1Ajay Kumar,K.Abhishek,M.R.Ghalib,A.Shankar,X.Cheng.Intrusion detection and prevention system for an IoT environment[J].Digital Communications and Networks,2022,8(4):540-551. 被引量：3
2Brett Weinger,Jinoh Kim,Alex Sim,Makiya Nakashima,Nour Moustafa,K.John Wu.Enhancing IoT anomaly detection performance for federated learning[J].Digital Communications and Networks,2022,8(3):314-323. 被引量：4
3Jorge Gallego-Madrid,Ramon Sanchez-Iborra,Pedro M.Ruiz,Antonio F.Skarmeta.Machine learning-based zero-touch network and service management:a survey[J].Digital Communications and Networks,2022,8(2):105-123. 被引量：2
4Shufeng Li,Mingyu Cai,Robert Edwards,Yao Sun,Libiao Jin.Research on encoding and decoding of non-binary polar codes over GF(2m)[J].Digital Communications and Networks,2022,8(3):359-372. 被引量：1
5苏文超,费洪晓.覆盖率引导的灰盒模糊测试综述[J].信息安全研究,2022,8(7):643-655. 被引量：2
6杨克,贺也平,马恒太,王雪飞.精准执行可达性分析:理论与应用[J].软件学报,2018,29(1):1-22. 被引量：5
7宋丛溪,王辛,张文喆.Angr动态软件测试应用分析与优化[J].计算机工程与科学,2018,40(A01):163-168. 被引量：4
8任泽众,郑晗,张嘉元,王文杰,冯涛,王鹤,张玉清.模糊测试技术综述[J].计算机研究与发展,2021,58(5):944-963. 被引量：24
9张琦,马莺姿.模糊测试器AFL种子变异策略优化研究[J].现代信息科技,2021,5(24):142-145. 被引量：2
10王晓磊,杨林,马琳茹,穆源,施江勇,宋焱淼.面向跨架构恶意软件的函数相似性检测和衍变分析[J].陆军工程大学学报,2022,1(3):36-47. 被引量：1

二级参考文献51

1王灿辉,张敏,马少平.自然语言处理在信息检索中的应用综述[J].中文信息学报,2007,21(2):35-45. 被引量：50
2范哲意,江帆,刘志文.基于拍摄图像的PDF417条码识别[J].北京理工大学学报,2008,28(12):1088-1092. 被引量：12
3梅宏,王千祥,张路,王戟.软件分析技术进展[J].计算机学报,2009,32(9):1697-1710. 被引量：101
4董自涛,包佃清,马小虎.智能问答系统中问句相似度计算方法[J].武汉理工大学学报（信息与管理工程版）,2010,32(1):31-34. 被引量：16
5夏振春,丁万山.基于CIS与ARM9的条码图像采集系统[J].电子科技,2010,23(3):23-25. 被引量：4
6陈聪明,霍玮,于洪涛,冯晓兵.基于包含的指针分析优化技术综述[J].计算机学报,2011,34(7):1224-1238. 被引量：10
7黄强,曾庆凯.基于信息流策略的污点传播分析及动态验证[J].软件学报,2011,22(9):2036-2048. 被引量：21
8李婧,刘万伟.SMT求解器理论组合技术研究[J].计算机工程与科学,2011,33(10):111-119. 被引量：5
9王轶,蒋同海,董军,周喜.基于路径覆盖插桩的可执行代码测试工具实现[J].计算机工程,2012,38(5):35-37. 被引量：4
10程传鹏,齐晖.文本相似度计算在主观题评分中的应用[J].计算机工程,2012,38(5):288-290. 被引量：6

共引文献148

1张天逸,孙毅然,刘凡琪,梁悦祺,林永杰,马明辉.基于K均值聚类算法与RBF神经网络的交通流预测方法[J].智能计算机与应用,2020,10(8):148-151. 被引量：7
2周航,方勇,黄诚,刘亮,陈兴刚.针对PHP应用的二阶漏洞检测方法[J].信息安全研究,2018,4(4):380-386. 被引量：1
3王夏菁,胡昌振,马锐,高欣竺.二进制程序漏洞挖掘关键技术研究综述[J].信息网络安全,2017(8):1-13. 被引量：14
4达小文,毛俐旻,吴明杰,郭敏.一种基于补丁比对和静态污点分析的漏洞定位技术研究[J].信息网络安全,2017(9):5-9. 被引量：3
5帅训波.软件安全设计与检测方法概述[J].石油工业计算机应用,2017,25(2):21-25.
6唐枭.基于动态污点分析的反馈式模糊测试改进方法[J].信息安全研究,2019,5(2):145-151. 被引量：3
7李珍,邹德清,王泽丽,金海.面向源代码的软件漏洞静态检测综述[J].网络与信息安全学报,2019,5(1):1-14. 被引量：22
8刘明聪,王娜,周宁.基于依赖分析的云组合服务信息流控制机制[J].计算机科学,2019,46(4):189-196. 被引量：1
9段斌,李兰,赖俊,詹俊.基于动态污点分析的工控设备硬件漏洞挖掘方法研究[J].信息网络安全,2019(4):47-54. 被引量：2
10邱景,李宜卓.基于Spark的大规模软件完整性校验行为识别框架[J].软件导刊,2019,18(4):46-49. 被引量：1

1袁斌,万俊,吴宇晗,邹德清,金海.On the Security of Smart Home Systems:A Survey[J].Journal of Computer Science & Technology,2023,38(2):228-247.
2Special Issue“Genome Editing and Genotype-independent Transformation Methodologies in Crop Improvement”[J].aBIOTECH,2023,4(2).
3Chandra KISHORE,Vaishali Ji,Saurav Mallik,Ayan MUKHERJI,Namrata TOMAR,Soumen Kumar Pati,LI Ai Min,Sinthia Roy BANERJEE,Soumadip GHOSH,Raza Ali NAQVI.A Comprehensive View on the Progress of Organoid Research with an Emphasis on its Relevance to Disease Characterization[J].Biomedical and Environmental Sciences,2023,36(10):959-971.
4张航.Identification and 3D architecture analysis of the LIPC gene mutation in a pedigree with familial hypercholesterolemia-like phenotype[J].China Medical Abstracts(Internal Medicine),2023,40(3):157-158.
5Jinxue PENG,Yong WANG,Jingfeng XUE,Zhenyan LIU.Fast Cross-Platform Binary Code Similarity Detection Framework Based on CFGs Taking Advantage of NLP and Inductive GNN[J].Chinese Journal of Electronics,2024,33(1):128-138.
6GUIDE FOR AUTHORS[J].Green Energy & Environment,2024,9(3).
7GUIDE FOR AUTHORS[J].Green Energy & Environment,2024,9(1).
8GUIDE FOR AUTHORS[J].Green Energy & Environment,2023,8(6).
9GUIDE FOR AUTHORS[J].Green Energy & Environment,2024,9(2).
10Yang Wang,Wenli Gao,Shuo Yang,Qiaolin Chen,Chao Ye,Hao Wang,Qiang Zhang,Jing Ren,Zhijun Ning,Xin Chen,Zhengzhong Shao,Jian Li,Yifan Liu,Shengjie Ling.Humanoid Intelligent Display Platform for Audiovisual Interaction and Sound Identification[J].Nano-Micro Letters,2023,15(12):82-98. 被引量：2

Computers, Materials & Continua

2024年第2期

浏览历史

内容加载中请稍等...

Binary Program Vulnerability Mining Based on Neural Network

参考文献18

二级参考文献51

共引文献148

相关作者

相关机构

相关主题

浏览历史