期刊文献+

基于特征选择的Bagging分类算法研究 被引量:8

Research on Bagging Classification Algorithm Based on Feature Selection
下载PDF
导出
摘要 为了提高数据的分类性能,提出了一种基于特征选择的Bagging分类算法。通过Fisher准则和互信息的方法给定一种能够直接评价特征区分度和与类别相关性的评价方法,重新构造了计算特征区分度和与类别相关性的计算公式。并将该方法应用到Bagging分类算法当中。实现了算法迭代过程中的特征选择,使得每个基分类器都是由不同的特征子集训练所得,保证了基分类器的独立性,降低了训练误差。通过理论分析和大量的实验,对文中的方法与经典特征选择方法进行了比较,实验结果显示文中的方法能够得到更高的预测精准度。 In order to improve the classification performance of data,a Bagging classification algorithm based on feature selection is pro- posed in this paper. An evaluation method is proposed for full account of the discrimination and class information of each feature by the Fisher criterion and mutual information ,built on the formula about discrimination and class information. The feature selection algorithm is applied to the Bagging classification algorithm. The feature selection is implemented in the iterative process of algorithm, so that each base classifier is trained by different feature subsets, which ensures the independence of each base classifier, reducing the training error. Com- pared the method with several classical feature selection methods by theoretical analysis and extensive experiments, the results show that the method can achieve higher predictive accuracy.
出处 《计算机技术与发展》 2014年第4期103-106,共4页 Computer Technology and Development
基金 吉林省科技发展计划项目青年科研基金(201201070) 辽宁省社科联项目(2010lslktjyx-03)
关键词 数据挖掘 特征选择 集成学习 互信息 BAGGING 分类器 data mining feature selection ensemble learning mutual information Bagging classifier
  • 相关文献

参考文献4

二级参考文献72

  • 1顾志伟,吴秀清,荆浩,尹东,王艺元.一种基于特征选择的医学图像检索方法[J].中国生物医学工程学报,2007,26(1):30-34. 被引量:9
  • 2龚静,周经野.一种基于多重因子加权的文本特征项权值计算方法[J].计算技术与自动化,2007,26(1):81-83. 被引量:10
  • 3Langley P.Seleetion of relevant features in machine learning[J].In:Proe.AAAI Fall Symposium on Relevanee,1994:140-144.
  • 4Langley P,Iba W.Average-case analysis of a nearest neighbour algorithm[C] //Proceedings of the Thirteenth International Joint Con-Ferenee on Artifieial Intelligence,1993:889-894.
  • 5Jain A,Zongker D.Feature seleetion:evaluation,application,and Sniall sample pedortnanee[J].IEEE transactions on pattern analysis and machine intelligence,1997,19(2):153-158.
  • 6Xing E,Jordan M,Karp R.Feature seleetion for high-dimensional genomic microarray data[C] //Intl.conf.on Machine Learning,2001:601-608.
  • 7Davies S,Russl S.Np-completeness of searehes for smallest Pos Sible feature sets[C] // In:Proc.Of the AAAI Fall 94Symposium on Relevanee,1994:37-39.
  • 8Narendra PM,Fukunaga K.A branch and bound algorithm for feature subset selection[J].IEEE Transactions on Computers,1997(26):917-922.
  • 9Kittler J,Feature set search algorithms,in:C.H.Chen,Pattern Recognition and Signal Processing,Sijthoff and Noordhoff,1978:41-60.
  • 10Pudil P,Novovicova N,Kittler J.Floating search method[J].Pattern Recognition Letters,1994(15):1119-1125.

共引文献100

同被引文献96

  • 1谷雨,郑锦辉,戴明伟,何磊.基于Bagging支持向量机集成的入侵检测研究[J].微电子学与计算机,2005,22(5):17-19. 被引量:6
  • 2华德宏,刘刚,兰家隆.一种改进的一元线性回归算法[J].现代电子技术,2006,29(7):63-65. 被引量:8
  • 3Lyu M R. Software reliability engineering: a roadmap [ C ]// Proc of future of software engineering. Minneapolis: IEEE, 2007 : 153-170.
  • 4Nam J,Pan S J,Kim S. Transfer defect learning[ C]//Pro- eeedings of the 2013 international conference on software en- gineering. [ s. 1. ] : IEEE ,2013:382-391.
  • 5Seliya N, Khoshgoftaar T M, van Hulse J. Predicting faults in high assurance software[ C]//Proc of 2010 IEEE 12th inter- national symposium on high-assurance systems engineering. San Jose, CA : IEEE ,2010:26-34.
  • 6Gray D, Bowes D, Davey N, et al. Using the support vector ma- chine as a classification method for software defect prediction with static code metrics [ M ]//Engineering applications of neural networks. Berlin :Springer ,2009:223-234.
  • 7Yan Z, Chen X, Guo P. Software defect prediction using fuzzy support vector regression[ M]//Advances in Neural Networks -ISNN 2010. Berlin : Springer,2010 : 17-24.
  • 8Wang J, Shen B, Chen Y. Compressed C4.5 models for soft- ware defect prediction[ C]//Proc of 12th international confer- ence on quality software. Washington: IEEE ,2012 : 13-16.
  • 9Paikari E, Richter M M, Ruhe G. Defect prediction using case -based reasoning:an attribute weighting technique based upon sensitivity analysis in neural networks [ J ]. International Jour- nal of Software Engineering and Knowledge Engineering, 2012,22 (6) :747-768.
  • 10Wang T, Li W. Naive Bayes software defect prediction model [ C]//Proc of international conference on computational intel- ligence and software ensineex4ns. [ s. 1. ] : [ s. n. ],2010 : 1 - 4.

引证文献8

二级引证文献18

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部