基于符号执行与模糊测试的混合测试方法被引量：18

Hybrid Testing Based on Symbolic Execution and Fuzzing

下载PDF

导出

摘要软件测试是保障软件质量的常用方法,如何获得高覆盖率是测试中十分重要且具有挑战性的研究问题.模糊测试与符号执行作为两大主流测试技术已被广泛研究并应用到学术界与工业界中,这两种技术都具有一定的优缺点:模糊测试随机变异生成测试用例并动态执行程序,可以执行并覆盖到较深的分支,但其很难通过变异的方法生成覆盖到复杂条件分支的测试用例.而符号执行依赖约束求解器,可以生成覆盖复杂条件分支的测试用例,但在符号化执行过程中往往会出现状态爆炸问题,因此很难覆盖到较深的分支.有工作已经证明,将符号执行与模糊测试相结合可以获得比单独使用模糊测试或者符号执行更好的效果.分析符号执行与模糊测试的优缺点,提出了一种基于分支覆盖将两种方法结合的混合测试方法 Afleer,结合双方优点从而可以生成具有更高分支覆盖率的测试用例.具体来说,模糊测试(例如 AFL)为程序快速生成大量可以覆盖较深分支的测试用例,符号执行(例如 KLEE)基于模糊测试的覆盖信息进行搜索,仅为未覆盖到的分支生成测试用例.为了验证 Afleer 的有效性,选取标准程序集LAVA-M 以及实际项目 oSIP 作为评测对象,以漏洞检测能力以及覆盖能力作为评测指标.实验结果表明:(1)在漏洞检测能力上,Afleer 总共可以发现 755 个漏洞,而 AFL 仅发现 1 个;(2)在覆盖能力上,Afleer 在标准程序集上以及实际项目中都有不同程度的提升.其中,在 oSIP 中,Afleer 比 AFL 在分支覆盖率上提高 2.4 倍,在路径覆盖率上提升 6.1倍.除此之外,Afleer 在 oSIP 中还检测出一个新的漏洞. Software testing is a common way to guarantee software quality. How to achieve high coverage is a very important and challenging goal in testing. Fuzz testing and symbolic execution, as two mainstream testing techniques, have been widely studied and applied to academia and industry, both technologies have certain advantages and limitations. Fuzz testing can execute and cover deeper branches by randomly mutating test cases and dynamically executing programs. However, it is difficult to generate test cases that can cover complex conditional branches by random mutation. Symbolic execution can cover complex conditional branches with SMT solvers, but it is difficult to cover deeper branches due to state explosion during symbolic execution. Current works have shown that hybrid testing involving fuzzing and symbolic execution can archive better performance than fuzzing or symbolic execution. By analyzing the advantages and disadvantages in fuzzing and symbolic execution, this study proposes a branch coverage-based hybrid testing approach that combines the two methods with each other to achieve better test cases with high branch coverage. Specifically, fuzz testing (e.g., AFL) quickly generates a large number of test cases that can cover deeper branches, and symbolic execution (e.g., KLEE) performs a search based on the coverage of fuzz testing, and generating test cases for uncovered branches. To evaluate the effectiveness of Afleer, the study selects the standard benchmark LAVA-M and one real project oSIP as the evaluation object, and uses bug detection and coverage as the evaluation measures. The experimental results show that: 1) For bug discovery, Afleer found 755 bugs while AFL only found 1;2) For coverage, Afleer achieved some improvement on benchmarks and real project. In the project oSIP, Afleer increases the branch coverage by 2.4 times and the path coverage by 6.1 times. In addition, Afleer found a new bug in oSIP.

作者谢肖飞李晓红陈翔孟国柱刘杨 XIE Xiao-Fei;LI Xiao-Hong;CHEN Xiang;MENG Guo-Zhu;LIU Yang(Tianjin Key Laboratory of Advanced Networking (Tianjin University), Tianjin 300050, China;School of Computer Science and Technology, Nantong University, Nantong 226019, China;State Key Laboratory of Information Security (Institute of Information Engineering, Chinese Academy of Sciences), Beijing 100093,China)

机构地区天津市先进网络重点实验室(天津大学) 南通大学计算机科学与技术学院信息安全国家重点研究室(中国科学院信息工程研究所) School of Computer Science and Engineering

出处《软件学报》 EI CSCD 北大核心 2019年第10期3071-3089,共19页 Journal of Software

基金国家自然科学基金(61572349,61272106)~~

关键词软件质量保障模糊测试符号执行测试用例生成 software quality assurance fuzz testing symbolic execution test case generation

分类号 TP311 [自动化与计算机技术—计算机软件与理论]

引文网络
相关文献

参考文献2

1甘水滔,王林章,谢向辉,秦晓军,周林,陈左宁.一种基于程序功能标签切片的制导符号执行分析方法[J].软件学报,2019,30(11):3259-3280. 被引量：4
2崔展齐,王林章,李宣东.一种目标制导的混合执行测试方法[J].计算机学报,2011,34(6):953-964. 被引量：18

二级参考文献27

1Pezze M, Young M. Software Testing and Analysis:Process, Principles and Techniques. Hoboken, NJ: John Wiley b- Sons, 2007.
2Emanuelsson P, Nilsson U. A comparative study of industrial static analysis tools. Electronic Notes in Theoretical Computer Science, 2008, 217:5-21.
3Bertolino A. Software testing research: Achievements, challenges, dreams//Proceedings of the Future of Software Engi- neering(FOSE'07). Washington, DC, USA.. IEEE Computer Society, 2007:85-103.
4Godefroid P, Klarlund N, Sen K. DART: Directed automated random testing//Proceedings of the ACM SIGPLAN Conference on Programming Language Design and Implemen tation(PLDI'05). New York, NY, USA: ACM, 2005: 213-223.
5Sen K, Marinov D, Agha G. CUTE: A concolic unit testing engine for C//Proceedings of the 10th European Software Engineering Conference Held Jointly with 13th ACM SIGSOFT International Symposium on Foundations of Software Engineering ( ESEC/FSE-13 ). New York, NY, USA: ACM, 2005:263-272.
6Sen K, Agha G. CUTE and Jcute: Concolic unit testing and explicit path model checking tools//Proceedings of the 18th International Conference on Computer Aided Verification (CAVe06). Lecture Notes in Computer Science 4144. Berlin, Heidelberg: Springer, 2006:419-423.
7Burnim J, Sen K. Heuristics for scalable dynamic test gener ation//Proceedings of the 23rd IEEE/ACM International Conference on Automated Software Engineering ( ASE ' 08). Washington, DC, USA: IEEE Computer Society, 2008: 443-446.
8Xu R-G, Godefroid P, Majumdar R. Testing for buffer overflows with length abstraction//Proeeedings of the 2008 International Symposium on Software Testing and Analysis (ISS TA'08). New York, NY, USA: ACM, 2008:27-38.
9Evans D, Larochelle D. Improving security using extensible lightweight static analysis. IEEE Software, 2002, 19 (1) : 42-51.
10Xie Y, Chou A, Engler D. ARCHER: Using symbolic, path-sensitive analysis to detect memory access errors//Proceedings of the 9th European Software Engineering Conference Held Jointly with llth ACM SIGSOFT International Symposium on Foundations of Software Engineering (ESEC/ FSE11). New York, NY, USA: ACM, 2003:327-336.

共引文献19

1许波勇.纸质原型测试在软件项目开发中的应用探析[J].电脑开发与应用,2013,26(6):7-10.
2奚琪,曾勇军,王清贤,吴红水.一种动静结合的代码反汇编框架[J].小型微型计算机系统,2013,34(10):2251-2255. 被引量：6
3衷璐洁,霍玮,李龙,李丰,冯晓兵,张兆庆.一种场景敏感的高效错误检测方法[J].软件学报,2014,25(3):472-488. 被引量：2
4王欣,郭涛,董国伟,邵帅,辛伟.基于补丁比对的Concolic测试方法[J].清华大学学报（自然科学版）,2013,53(12):1737-1742. 被引量：4
5Yan ZHANG,Dunwei GONG.Generating test data for both paths coverage and faults detection using genetic algorithms： multi-path case[J].Frontiers of Computer Science,2014,8(5):726-740. 被引量：4
6殷鹏川,贲可荣.基于路径引导的回归测试用例集扩增方法[J].计算机工程与科学,2014,36(11):2159-2163. 被引量：2
7刘春宏,徐立华,颜婷,杨宗源.基于混合测试和动态分析的分段代码测试[J].计算机工程,2015,41(2):63-69. 被引量：1
8李舟军,张俊贤,廖湘科,马金鑫.软件安全漏洞检测技术[J].计算机学报,2015,38(4):717-732. 被引量：75
9秦晓军,周林,陈左宁,甘水滔.基于懒符号执行的软件脆弱性路径求解算法[J].计算机学报,2015,38(11):2290-2300. 被引量：7
10韩莹,罗扬,吴取劲,余童兰.基于MC/DC的回归测试数据进化生成[J].南华大学学报（自然科学版）,2016,30(1):55-60.

同被引文献123

1李伟明,于俊清,艾少波.PyFuzzer:自动化高效内存模糊测试方法[J].通信学报,2013,34(S2):64-68. 被引量：3
2单锦辉,姜瑛,孙萍.软件测试研究进展[J].北京大学学报（自然科学版）,2005,41(1):134-145. 被引量：135
3高峻,徐志大,李健.针对复合文档的Fuzzing测试技术[J].计算机与数字工程,2008,36(12):116-119. 被引量：8
4王璇.敏捷测试理论与实践[J].软件导刊,2009,8(1):38-39. 被引量：7
5陈锦富,卢炎生,谢晓东.软件错误注入测试技术研究[J].软件学报,2009,20(6):1425-1443. 被引量：27
6李伟明,张爱芳,刘建财,李之棠.网络协议的自动化模糊测试漏洞挖掘方法[J].计算机学报,2011,34(2):242-255. 被引量：66
7唐亚男,王振一.敏捷测试综述[J].硅谷,2011,4(5):133-134. 被引量：10
8刘旭,胡未琼,戴伟.基于智能容错技术的雷达软件可靠性研究[J].现代雷达,2011,33(8):47-51. 被引量：5
9CHEN Kai,FENG DengGuo,SU PuRui,ZHANG YingJun.Black-box testing based on colorful taint analysis[J].Science China(Information Sciences),2012,55(1):171-183. 被引量：3
10张新华,何永前.软件测试方法概述[J].科技视界,2012(4):35-37. 被引量：32

引证文献18

1於家伟,李世明,毕雪洁,李秋月,高胜花.基于参数约束的分支覆盖符号执行优化算法[J].信息技术与网络安全,2020,39(1):14-18.
2胡贵恒.可持续性运行软件组合测试用例的自动生成[J].辽东学院学报（自然科学版）,2020,27(2):131-134.
3叶波,陈佳斌.高效可信、灵活赋能的软件测试框架的构建与实施[J].信息技术与信息化,2020(5):17-21. 被引量：1
4许朴,舒辉,于颖超.程序敏感的模糊测试样本生成方法[J].计算机工程与设计,2020,41(12):3368-3375. 被引量：1
5刘音.基于改进遗传算法的回归测试用例优先级排序[J].计算机仿真,2021,38(2):273-277. 被引量：4
6高凤娟,王豫,司徒凌云,王林章.基于深度学习的混合模糊测试方法[J].软件学报,2021,32(4):988-1005. 被引量：12
7陈亮,李永刚,刘磊,许静,李洁.基于特征的电力信息系统注入漏洞检测方法[J].计算机工程与设计,2021,42(8):2115-2123. 被引量：6
8王廷永,黄松.测试用例自动生成技术综述[J].电子技术与软件工程,2021(18):51-53. 被引量：3
9张协力,祝跃飞,顾纯祥,陈熹.模型学习与符号执行结合的安全协议代码分析技术[J].网络与信息安全学报,2021,7(5):93-104. 被引量：2
10徐学政,王涛,方健,张光达.面向RISC-V的汇编程序语义等价性自动化测试系统[J].计算机系统应用,2021,30(11):33-40. 被引量：3

二级引证文献35

1朱亚南,刘峰.基于双种群遗传算法的测试用例优先级排序[J].中国科技论文在线精品论文,2023(2):223-232.
2陈自力.一种基于K-means聚类的软件测试数据异常检测方法[J].太原师范学院学报（自然科学版）,2021,20(3):38-42. 被引量：2
3李超.基于支持向量机的视频元数据信息快速检索方法[J].太原师范学院学报（自然科学版）,2021,20(4):60-64. 被引量：3
4杜婉莹,王雅文.单元自动化测试中类的抽象内存模型研究[J].计算机测量与控制,2022,30(2):84-94.
5冯玉平,李恒武,郭辉.基于回归分析模型的学生学业成绩差异研究[J].无线互联科技,2022,19(8):140-142. 被引量：1
6赵男男.基于PCA和改进BP神经网络的信息安全评估模型构建[J].宁夏师范学院学报,2022,43(7):86-93. 被引量：1
7李红卫.基于改进遗传算法的嵌入式软件时序测试数据自动生成方法[J].自动化与仪器仪表,2022(9):75-78. 被引量：7
8董健,冯莹莹.改进机器学习的软件多类漏洞并行挖掘分析[J].计算机仿真,2022,39(10):386-390. 被引量：1
9张旭鸿,梁红,夏亦凡,蒲誉文,纪守领.并行化模糊测试研究综述[J].信息对抗技术,2022,1(1):24-42. 被引量：1
10傅仕琛,张坤三,林颖锐,洪钰,李铮,潘敏.基于页面自适应替换缓存替换算法的电力数据通信网设备漏洞快速查找方法[J].电力大数据,2022,25(11):63-68. 被引量：4

1王飞,张春燕,席辉,张付静,秦慧博,孙静茹.PDCA循环在提高临床路径覆盖率中的应用[J].中国卫生产业,2019,16(23):1-2. 被引量：1
2董雨良,董博,秦晓军,甘水滔.基于重点变异区域智能识别的模糊测试技术[J].计算机技术与发展,2019,29(9):55-60. 被引量：1
3崔新凯,李豪,高向川,杨欢.2.6 GHz下的5G NR覆盖能力分析[J].电信科学,2019,35(8):104-110. 被引量：9
4无.等几何分析与网格生成学术研讨[J].国际学术动态,2019,0(3):19-20.
5曹琰,刘龙,王禹,王清贤.基于函数语义分析的软件补丁比对技术[J].网络与信息安全学报,2019,5(5):56-63. 被引量：2
6侯风茂.计算机软件工程的维护措施与方法[J].中国新通信,2019,0(12):106-106. 被引量：1
7侯超文.多方向移动训练对初中生足球射门稳定性的研究与分析[J].体育世界,2019(5):181-182. 被引量：3
8顾浩威.上市公司相关新闻报道与股价动量效应的关系研究[J].国际商务财会,2019,0(8):74-80.
9徐鹏,刘嘉勇,林波,孙慧颖,雷斌.基于循环神经网络的模糊测试用例生成[J].计算机应用研究,2019,36(9):2679-2685. 被引量：6
10巫锡炜,刘慧.中国老年人虚弱变化轨迹及其分化:基于虚弱指数的考察[J].人口研究,2019,43(4):70-84. 被引量：15

软件学报

2019年第10期

浏览历史

内容加载中请稍等...

基于符号执行与模糊测试的混合测试方法被引量：18

参考文献2

二级参考文献27

共引文献19

同被引文献123

引证文献18

二级引证文献35

相关作者

相关机构

相关主题

浏览历史

基于符号执行与模糊测试的混合测试方法 被引量：18

参考文献2

二级参考文献27

共引文献19

同被引文献123

引证文献18

二级引证文献35

相关作者

相关机构

相关主题

浏览历史

基于符号执行与模糊测试的混合测试方法被引量：18