期刊文献+

面向属性值遗漏数据决策树分类算法研究 被引量:1

Research on the Missing Attribute Value Data-oriented Decision Tree
下载PDF
导出
摘要 在已有的多种决策树测试属性选择方法中,未见将属性值遗漏数据处理集成在测试属性选择过程中的报道,而现有的属性值遗漏数据处理方法都会不同程度地带入偏置。基于此,提出了一种将基于联合熵的信息增益率作为决策树测试属性选择标准的方法,用以在生成决策树的过程中消除值遗漏数据对测试属性选择的影响。在WEKA机器平台上进行了对比实验,结果表明,改进算法能够从总体上提高算法的执行效率和分类精度。 In the existing multiple choice methods of decision tree'test attributes,can't see such report as "Let missing data processing integrated in the selection process of test attributes",however,the existing process methods of missing attribute value data could draw into bias in different degrees,based on this,proposed an information gain rate based on combination entropy as the decision tree's testing attributes selection criteria,which can eliminate missing value arrtibutes'infulence on testing attributes selection,and carry out contrast experiments on WEKA.Experiment results indicate that the improvement can significantly increase whole efficiency and classification accuracy of the algorithm operation.
出处 《计算机科学》 CSCD 北大核心 2011年第10期174-176,共3页 Computer Science
基金 国家自然科学基金(70971059) 辽宁省创新团队项目(2009T045) 辽宁省科技攻关项目(2007308003)资助
关键词 属性值遗漏数据 联合熵 决策树 Missing attribute value data Combination entropy Decision tree
  • 相关文献

参考文献9

  • 1Gustavo E A, Batista P A, Monard M C. An Analysis of Four Missing Data Treatment Methods for Supervised Learning [J]. Applied Artificial Intelligence, 2003,17 ( 5 / 6) :519-533.
  • 2Kryszkiewicz M. Rules in incomplete information systems [J]. Information Sciences, 1999,113 : 271- 292.
  • 3Mingers J. An empirical comparison of selection measures for decision-tree induction [J]. Machine Learning, 1989,3(4) : 319- 342.
  • 4Safavian S R, Landgrebe D. A Survey of Decision Tree Classifier Methodology[R]. 47907. School of Electrical Engineering, Pur due University, 1991 : 1-58.
  • 5冯少荣.决策树算法的研究与改进[J].厦门大学学报(自然科学版),2007,46(4):496-500. 被引量:67
  • 6Quinlan J R.C4. 5 : Programs for Machine Learning [S]. Morgan Kaufman, 1993.
  • 7Qian Yunhua,Liang Jiye. A new method for measuring the uncertainty in incomplete information systems[J]. International Journal of Uncertainty. Fuzziness and Knowledge-Based Systems,2008(9).
  • 8Leung Y, Li D Y. Maximal consistent block technique for rule acquisition in incomplete information systems [J].Information Science, 2003,153 : 85-106.
  • 9旷海兰,罗可,刘新华,徐雨明,王樱.一种基于粗糙集理论的组合分类器构造方法[J].计算机工程与应用,2006,42(16):167-170. 被引量:2

二级参考文献23

  • 1王正群,叶晖,孙兴华,杨静宇.模糊多分类器组合[J].小型微型计算机系统,2003,24(1):86-89. 被引量:4
  • 2Z Pawlak.Rough sets[J].International Journal of Information and Computer Science, 1982; 11 (5):341-356
  • 3Hu X H.Using Rough Sets Theory and Database Operations to Construct a Good Ensemble of Classifiers for Data Mining Applications[C], In:proceedings of the 2001 IEEE International conference on Data Ming, 2001: 233-240
  • 4Z Pawlak et al,Rough sets [J].Communications of the ACM,1995;38(11):88-95
  • 5J Jelonek et al.Rough set reduction of attributes and their domains for neural networks[J].Computational Intelligence, 1995; 11 (2):339-347
  • 6R Yasdi.Combining rough sets learning and neural learning method.to deal with uncertain and imprecise information [J].Neuro computation, 1995; (7):61-84
  • 7Y Hassan et al.Decision making using hybrid rough sets and neural networks[J].Intemational Journal of neural systems, 2002; 12(6):435-446
  • 8R Swiniarski.Rough sets methods in feature reduction and classification[J].Int J Appl Math Comput Sci,2001; 11 (3):565-582
  • 9A Verikas et al.Soft combination of neural classifiers:a comparative study[J].Pattern recognition letters,1999;20:429-444
  • 10K Woods,W P Kegelmeyer,K Bowyer.Combination of multiple classifiers using local accuracy estimates[J].IEEE Trans on pattern analysis and machine intelligence, 1997;19(4):405-410

共引文献67

同被引文献10

引证文献1

二级引证文献5

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部