期刊文献+

软件缺陷集成预测模型研究 被引量:6

Software defect prediction based on classifiers ensemble
下载PDF
导出
摘要 利用单一分类器构造的缺陷预测模型已经遇到了性能瓶颈,而集成分类器相比单一分类器往往具有显著的性能优势。以构造高效的集成缺陷预测模型为出发点,比较了七种不同类型集成分类器的算法和特点。在14个基准数据集上的实验显示,部分集成预测模型的性能优于基于朴素贝叶斯的单一预测模型。其中,基于投票的集成分类框架具有最优的预测性能以及统计学意义上的性能优势显著性,随机森林算法次之。Stacking集成框架也具有较强的泛化能力。 Software defect prediction using classification algorithms was advocated by many researchers.However,several new literatures show the performance bottleneck by applying a single classifier recent years.On the other hand,classifiers ensemble can effectively improve classification performance than a single classifier.This paper conducted a comparative study of various ensemble methods with perspective of taxonomy.A series of benchmarking experiments on public-domain datasets MDP show that applying classifiers ensemble methods to predict defect could achieve better performance than using a single classifier.Specially,in all seven ensemble methods evolved by this experiments,voting and random forest have obvious performance superiority than others,and Stacking also has better generalization ability.
出处 《计算机应用研究》 CSCD 北大核心 2013年第6期1734-1738,共5页 Application Research of Computers
基金 国家自然科学基金面上项目(61173190) 陕西省自然科学基础研究计划项目(2009JM8002) 中央高校基本科研业务费专项资金资助项目(GK201302055)
关键词 软件缺陷预测 集成分类 投票 随机森林 software defect prediction classifiers ensemble vote random forest
  • 相关文献

参考文献17

  • 1MENZIES T, MILTON Z, TURHAN B, et al. Defect prediction from static code features : current results, limitations, new approaches [ J ]. Automated Software Engineering, 2010,17 ( 5 ) : 375-407.
  • 2JIANG Y, CUKIC B, MENZIES T, et al. Comparing design and code metrics for software quality prediction [ C ]//Proc of the 4th International Workshop on Predictor Models in Software Engineering. New York : ACM Press ,2008 : 11-18.
  • 3MENZIES T, TURHAN B, BENER A, et al. Implications of ceiling effects in defect predictors [ C ]// Proc of ACM International Conference on Predictive Models in Software Engineering. 2008: 47-54.
  • 4ZHANG H, NELSON A, MENZIES T. On the value of learning from defect dense components for software defect prediction [ C ]// Proc of ACM International Conference on Predictive Models in Software Engineering. 2010 : 1-9.
  • 5STEFANO C D, FONTANELLA F, FOLINO G, et al. A Bayesian approach for combining ensembles of GP classifiers[ C]//Proc of the 10th International Workshop on Multiple Classifier Systems. 2011:26-35.
  • 6TOSUN A, TURHAN B, BERNER A B. Ensemble of software defect predictors : a case study[ C ]//Proc of the 2nd International Symposium on Empirical Software Engineering and Measurement. 2008:318- 320.
  • 7ZHENG J. Cost-sensitive boosting neural networks for software defect prediction[ J]. Expert Systems with Applications, 2010,37 ( 6 ) : 4537-4543.
  • 8BREIMAN L. Bagging predictors [ J]. Machine Learning, 1996,24 (2) :123-140.
  • 9FREUND Y, SCHAPIRE R. Experiments with a new boosting algorithm[C]// Proc of the 13th International Conference on Machine Learning. 1996 : 148-156.
  • 10DIETTERICH T. An experimental comparison of three methods for constructing ensembles of decision trees: bagging, boosting, and randomization[J]. Machine Learning,2000,40(2) :139-157.

同被引文献72

  • 1聂林波,刘孟仁.软件缺陷分类的研究[J].计算机应用研究,2004,21(6):84-86. 被引量:39
  • 2刘亦书.CMM/CMMI中同行评审子过程的定量控制[J].计算机工程与设计,2004,25(6):978-981. 被引量:5
  • 3Menzies T,Greenwald J,Frank A.Data mining static code attributes to learn defect predictors[J].IEEE Transactions on Software Engineering,2007,33(1):2-13.
  • 4Turhan B,Bener A.Analysis of Naive Bayes assumptions on software fault data:An empirical study[J].Data&Knowledge Engineering,2009,68(2):278-290.
  • 5Boetticher G D.Improving credibility of machine learner models in software engineering[M]∥Advanced Machine Learner Applications in Software Engineering(Series on Software Engineering and Knowledge Engineering),USA:Langston University,2006:52-72.
  • 6Catal C,Diri B.Investigating the effect of dataset size,metrics sets and feature selection techniques on software fault prediction problem[J].Information Sciences,2009,179(8):1040-1058.
  • 7Riquelme J C,Ruiz R,Rodriguez D,et al.Finding defective modules from highly unbalanced datasets[J].Actas de los Talleres de las Jornadas de Ingeniería del Software y Bases de Datos,2008,2(1):67-74.
  • 8Menzies T,Turhan B,Bener A,et al.Implications of ceiling effects in defect predictors[C]∥Proc of the 4th International Workshop on Predictor Models in Software Engineering,2008:47-54.
  • 9Seiffert C,Khoshgoftaar T M,Van Hulse J.Improving software-quality predictions with data sampling and boosting[J].IEEE Transactions on Systems,Man and Cybernetics,Part A:Systems and Humans,2009,39(6):1283-1294.
  • 10Pelayo L,Dick S.Evaluating stratification alternatives to improve software defect prediction[J].IEEE Transactions on Reliability,2012,61(2):516-525.

引证文献6

二级引证文献18

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部