期刊文献+

面向软件缺陷预测的互信息属性选择方法 被引量:12

Mutual information-based feature selection approach for software defect prediction
下载PDF
导出
摘要 软件开发过程中准确有效地预测具有缺陷倾向的软件模块是提高软件质量的重要方法。属性选择能够显著地提高软件缺陷预测模型的精确度和效率。提出了一种基于互信息的属性选择方法,将选择出的最优属性子集用于软件缺陷预测模型。方法采用了前向搜索策略,并在评价函数中引入非线性平衡系数。实验结果表明,基于互信息的属性选择方法提供的属性子集能提高各类软件缺陷预测模型的预测精度和效率。 Predicting defect-prone software modules accurately and effectively is an important way to control the quality of a software system during software development.Feature selection can highly improve the accuracy and efficiency of the software defect prediction model.A mutual information-based feature selection method for software defect prediction was proposed.The optimal feature subsets generated by the proposed approach were applied to train and test various prediction models.Forward search was introduced into the proposed approach,and nonlinear equilibrium coefficient was also added to the evaluation function.The experimental results show that all the classifiers achieve higher accuracy and performance by using the feature subset provided by the proposed approach.
出处 《计算机应用》 CSCD 北大核心 2012年第6期1738-1740,共3页 journal of Computer Applications
基金 教育部人文社会科学研究规划基金资助项目(11YJAZH040)
关键词 软件质量 互信息 属性选择 最优属性子集 缺陷预测 software quality mutual information feature selection optimal feature subset defect prediction
  • 相关文献

参考文献11

  • 1王青,伍书剑,李明树.软件缺陷预测技术[J].软件学报,2008,19(7):1565-1580. 被引量:148
  • 2刘海,郝克刚.软件缺陷数据的定义[J].计算机应用,2008,28(1):226-228. 被引量:5
  • 3洪智勇,刘灿涛,邓宝林.基于二次Renyi熵的正则化互信息特征选择方法[J].计算机应用,2010,30(5):1273-1276. 被引量:7
  • 4COVER T M, THOMAS J A. Elements of information theory [ M]. New York: John Wiley & Sons Inc, 1991.
  • 5KWAK N J, CHOI C H. Input feature selection by mutual informa- tion based on Parzen window[ J]. IEEE Transactions on Pattern A- nalysis and Machine Intelligence, 2002, 24(12) : 1667 - 1671.
  • 6AMIRI F, YOUSEFI M M R, LUCAS C, et al. Mutual information- based feature selection for intrusion detection systems[ J]. Journal of Network and Computer Applications, 2011, 34(4) : 1184 - 1199.
  • 7BAE C, YEH W C, CHUNG Y Y, et al. Feature selection with in- telligent dynamic swarm and rough set[ J]. Expert Systems with Ap- plications, 2010, 37(10) : 7026 -7032.
  • 8BAT'I'ITI R. Using mutual information for selecting features in su- pervised neural net learning[ J]. IEEE Transactions on Knowledge and Data Engineering, 2005, 17(9) : 1199 - 1207.
  • 9MENZIES T, GREENWALD J, FRANK A. Data mining static code attributes to learn defect predictors[ J]. IEEE Transactions on Software Engineering, 2007, 32( 11 ) : 2 - 13.
  • 10ZHENG JUN. Cost-sensitive boosting neural networks for software defect prediction[ J]. Expert Systems with Applications, 2010, 37 (6) : 4537 - 4543.

二级参考文献31

共引文献156

同被引文献86

  • 1涂亚明,毛军鹏,余静,尹磊.系统测试阶段的软件缺陷预测模型分析[J].计算机研究与发展,2010,47(S1):108-112. 被引量:2
  • 2冯少荣.决策树算法的研究与改进[J].厦门大学学报(自然科学版),2007,46(4):496-500. 被引量:67
  • 3HANJ KAMBERM.数据挖掘:概念与技术[M].北京:机械工业出版社,2001..
  • 4蒋庆,葛宏征,谢鹏.船舶类型及吨位因素对船舶系缆力的影响[J].水运工程,2007(9):51-59. 被引量:7
  • 5Witten I H,Frank E.数据挖掘实用机器学习技术[M].北京:机械工业出版社,2006
  • 6Cover T M,Thomas J A,阮吉寿,等.信息论基础[M].北京:机械工业出版社,2005.348-354.
  • 7TAN P N, STEINBACH M, KUMAR V.数据挖掘导论(完整版)[M].范明,范宏建,译.北京:人民邮电出版社,2011:79-84.
  • 8Quinlan J R. Induction of Decision Trees [J ]. MachineLearning, 1986,1(1): 81-106.
  • 9Quinlan J R. C4. 5 : Programs for Machine Learning[M]. Boston: Morgan Kaufmann Publishers, 1993.
  • 10Yu Lei, Liu Huan. Efficient feature selection via analysisof relevance and redundancy [J ]. Journal of Machine Le-arning Research, 2004(5) : 1205-1224.

引证文献12

二级引证文献33

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部