期刊文献+

基于二次学习的半监督字典学习软件缺陷预测 被引量:3

Twice Learning Based Semi-supervised Dictionary Learning for Software Defect Prediction
下载PDF
导出
摘要 当软件历史仓库中有标记训练样本较少时,有效的预测模型难以构建.针对此问题,文中提出基于二次学习的半监督字典学习软件缺陷预测方法.在第一阶段的学习中,利用稀疏表示分类器将大量无标记样本通过概率软标记标注扩充至有标记训练样本集中.再在扩充后的训练集上进行第二阶段的鉴别字典学习,最后在学得的字典上预测缺陷倾向性.在NASA MDP和PROMISE AR数据集上的实验验证文中方法的优越性. When the previous defect labels of modules in software history warehouse are limited, building an effective prediction model becomes a challenging problem. Aiming at this problem, a twice learning based semi-supervised learning algorithm for software defect prediction is proposed. In the first stage of learning, a large number of unlabeled samples are labeled with probability soft labels and extended to the labeled training dataset by using sparse representation classifier. Then, on this dataset discriminative dictionary learning is used for the second stage of learning. Finally, defect proneness prediction is conducted on the obtained dictionary. Experiments on the widely used NASA MDP and PROMISE AR datasets indicate the superiority of the proposed algorithm.
出处 《模式识别与人工智能》 EI CSCD 北大核心 2017年第3期242-250,共9页 Pattern Recognition and Artificial Intelligence
基金 国家自然科学基金项目(No.61272273 61073113) 江苏省普通高校研究生科研创新计划项目(No.CXZZ12_0478)资助~~
关键词 软件缺陷预测 二次学习 半监督学习 字典学习 Software Defect Prediction, Twice Learning, Semi-supervised Learning, Dictionary Learning
  • 相关文献

参考文献2

二级参考文献32

  • 1Nikora A, Munson J. Developing Fault Predictors for Evolving Soft- ware Systems//Proc of the 9th International Software Metrics Sym- posium. Sydney, Australia, 2003:338-350.
  • 2Nagappan N, Ball T. Static Analysis Tools as Early Indicators of Prerelease Defect Density// Proc of the 27th International Confer- ence on Software Engineering. St. Louis, USA, 2005:580-586.
  • 3Menzies T, Greenwald J, Frank A. Data Mining Static Code Attrib- utes to Learn Defect Predictors. IEEE Trans on Software Engineer- ing, 2007, 33(1): 2-13.
  • 4Lessmann S, Baesens B, Mues C, et al. Benchmarking Classifica- tion Models for Software Defect Prediction: A Proposed Framework and Novel Findings. IEEE Trans on Software Engineering, 2008, 34 (4) : 485-496.
  • 5Khoshgoftaar T M, Seliya N. Analogy-Based Practical Classification Rules for Software Quality Estimation. Empirical Software Engineer-ing, 2003, 8(4) : 325-350.
  • 6Emam K E, Benlarbi S, Goel N, et al. Comparing Case-Based Rea- soning Classifiers for Predicting High Risk Software Components. Journal of Systems and Software, 2001,55 (3) : 301-320.
  • 7Turhan B, Bener A. Analysis of Naive Bayes' Assumptions on Soft- ware Fault Data: An Empirical Study. Data and Knowledge Engi- neering, 2009, 68(2) : 278-290.
  • 8Khoshgoftaar T M, Allen E B, Hudepohl J P, et al. Application of Neural Networks to Software Quality Modeling of a Very Large Tele- communications System. IEEE Trans on Neural Networks, 1997, $ (4) : 902-909.
  • 9Zheng Jun. Cost-Sensitive Boosting Neural Networks for Software Defect Prediction. Expert Systems with Applications, 2010, 37 (6) : 4537-4543.
  • 10Selby R W, Porter A A. Learning from Examples: Generation and Evaluation of Decision Trees for Software Resource Analysis. IEEE Trans on Software Engineering, 1988, 14(12) : 1743-1757.

共引文献7

同被引文献14

引证文献3

二级引证文献4

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部