Predicting Effectiveness of Generate-and-Validate Patch Generation Systems Using Random Forest 被引量：2

Predicting Effectiveness of Generate-and-Validate Patch Generation Systems Using Random Forest

导出

摘要 One way to improve practicability of automatic program repair（APR） techniques is to build prediction models which can predict whether an application of a APR technique on a bug is effective or not. Existing prediction models have some limitations. First, the prediction models are built with hand crafted features which usually fail to capture the semantic characteristics of program repair task. Second, the performance of the prediction models is only evaluated on Genprog, a genetic-programming based APR technique. This paper develops prediction models, i.e., random forest prediction models for SPR, another kind of generate-and-validate APR technique, which can distinguish ineffective repair instances from effective repair instances. Rather than handcrafted features, we use features automatically learned by deep belief network（DBN） to train the prediction models. The empirical results show that compared to the baseline models, that is, all effective models, our proposed models can at least improve the F1 by 9% and AUC（area under the receiver operating characteristics curve） by 19%. At the same time, the prediction model using learned features at least outperforms the one using hand-crafted features in terms of F1 by 11%. One way to improve practicability of automatic program repair（APR） techniques is to build prediction models which can predict whether an application of a APR technique on a bug is effective or not. Existing prediction models have some limitations. First, the prediction models are built with hand crafted features which usually fail to capture the semantic characteristics of program repair task. Second, the performance of the prediction models is only evaluated on Genprog, a genetic-programming based APR technique. This paper develops prediction models, i.e., random forest prediction models for SPR, another kind of generate-and-validate APR technique, which can distinguish ineffective repair instances from effective repair instances. Rather than handcrafted features, we use features automatically learned by deep belief network（DBN） to train the prediction models. The empirical results show that compared to the baseline models, that is, all effective models, our proposed models can at least improve the F1 by 9% and AUC（area under the receiver operating characteristics curve） by 19%. At the same time, the prediction model using learned features at least outperforms the one using hand-crafted features in terms of F1 by 11%.

作者 XU Yong HUANG Bo ZOU Xiaoning KONG Liying

机构地区 School of Mathematics and Statistics School of Electronic and Electrical Engineering

出处《Wuhan University Journal of Natural Sciences》 CAS CSCD 2018年第6期525-534,共10页 武汉大学学报（自然科学英文版）

基金 Supported by the National Natural Science Foundation of China(61603242) Opening Project of Collaborative Innovation Center for Economics Crime Investigation and Prevention Technology(JXJZXTCX-030) the Scientific Research Fund of Zhaoqing Univeristy(201734) Innovative Guidance Fund of Zhaoqing City(201704030409)

关键词 automatic program repair deep belief network effec-tiveness prediction repair instance patch generation random forest automatic program repair deep belief network effec-tiveness prediction repair instance patch generation random forest

分类号 TP311.5 [自动化与计算机技术—计算机软件与理论]

引文网络
相关文献

参考文献1

1徐勇,毋国庆,袁梦霆,黄勃.基于变型空间代数的自动程序修复方法[J].电子学报,2017,45(10):2498-2505. 被引量：2

二级参考文献1

1DONG YunMei.Linear algorithm for lexicographic enumeration of CFG parse trees[J].Science in China(Series F),2009,52(7):1177-1202. 被引量：2

共引文献1

1马立新,李春,郝成亮,陈明,田健.基于Robot框架的非功能性自动化回归测试研究[J].科技通报,2019,35(11):90-93.

引证文献2

1NI Xiaomei,WANG Huawei,LV Shaolan,XIONG Minglan.An Ensemble Classification Model Based on Imbalanced Data for Aviation Safety[J].Wuhan University Journal of Natural Sciences,2021,26(5):437-443.
2XU Yong,CHENG Ming.Multi-View Feature Fusion Model for Software Bug Repair Pattern Prediction[J].Wuhan University Journal of Natural Sciences,2023,28(6):493-507.

1郭钰,李明洋,刘祥春,王鸣飞,李雪妍,张惠茅.CT影像组学对结直肠癌肝转移的诊断价值[J].中国临床医学影像杂志,2018,29(11):798-802. 被引量：14
2马海荣,程新文.一种处理非平衡数据集的优化随机森林分类方法[J].微电子学与计算机,2018,35(11):28-32. 被引量：10
3Bo-na Deng,Guang-hui Li,Jun Luo,Jing-hua Zeng,Ming-jun Rao,Zhi-wei Peng,Tao Jiang.Alkaline digestion behavior and alumina extraction from sodium aluminosilicate generated in pyrometallurgical process[J].International Journal of Minerals,Metallurgy and Materials,2018,25(12):1380-1388.
4吴婷婷,余克强,张海辉,冯毅,张晓,汪辉辉.小麦黑胚病识别模型优选和多分类识别分析[J].光谱学与光谱分析,2018,38(12):3912-3916. 被引量：2
5王飞,杨胜天,丁建丽,魏阳,葛翔宇,梁静.环境敏感变量优选及机器学习算法预测绿洲土壤盐分[J].农业工程学报,2018,34(22):102-110. 被引量：39
6Shirin Hasani-Ranjbar,Mahsa M Amoli,Maasumeh Noorani,Mohsen Ghadami.Malignant pheochromocytoma in neurofibromatosis; mutation screening of RET proto-oncogene, VHL and SDH gene[J].World Journal of Medical Genetics,2013,3(1):1-4.
7Yang Liu,Nicole L Jennings,Anthony M Dart,Xiao-Jun Du.Standardizing a simpler, more sensitive and accurate tail bleeding assay in mice[J].World Journal of Experimental Medicine,2012,2(2):30-36. 被引量：6

Wuhan University Journal of Natural Sciences

2018年第6期

浏览历史

内容加载中请稍等...

Predicting Effectiveness of Generate-and-Validate Patch Generation Systems Using Random Forest 被引量：2

参考文献1

二级参考文献1

共引文献1

引证文献2

相关作者

相关机构

相关主题

浏览历史