期刊文献+

决策树分类准确率极限的研究 被引量:2

Research on Classification Accuracy Limit of Decision Tree
下载PDF
导出
摘要 采用最大分类树作为分析经验风险与结构风险的工具,对决策树分类准确率极限进行了研究。针对决策树模型的分类效果难以客观评价的问题,讨论了决策树分类准确率极限的存在条件,给出了求出该极限的方法。以最大分类树作为分析工具,提出了在经验风险和结构风险4种分布条件下分类准确率极限是否存在的4个定理,并从机器学习理论和工程建模实践2个角度进行了讨论。实验验证了该理论的正确性。 Taking maximum classification tree as a tool to analyze empirical risk and structural risk, this paper addresses the problem of classification accuracy limit of decision tree. Aiming at the difficulty to estimate the classification effectiveness of decision tree externally, it discusses the existence condition of classification accuracy limit and presents the method to get it. It points out four theorems which demonstrate the existence of classification accuracy limit under four distribution conditions of empirical risk and structural risk with analysis from machine learning theory and practical modeling. The theorems are validated from experiments on ten public datasets.
出处 《计算机工程》 CAS CSCD 北大核心 2007年第10期222-224,共3页 Computer Engineering
基金 国家自然科学基金资助项目(60432010)
关键词 决策树 分类准确率 极限 经验风险 结构风险 Decision tree Classification accuracy Limit Empirical risk Structural risk
  • 相关文献

参考文献4

  • 1Hall M A,Holms G.Benchmarking Attribute Selection Techniques for Discrete Class Data Mining[J].IEEE Transactions on Knowledge and Data Engineering,2003,15(6):1437-1447.
  • 2Peng Y,Flach P.Soft Discretization to Enhance the Continuous Decision Tree Induction[C]//Proc.of ECML/PKDD'01.2001:109-118.
  • 3Dong M,Kothari R.Classifiability Based Pruning of Decision Trees[C]//Proceedings of International Joint Conference on Neural Networks,Washington,D.C..2001:1739-1743.
  • 4Zhao Huimin,Sinha A P.An Efficient Algorithm for Generating Generalized Decision Forests[J].IEEE Transactions on Systems,Man and Cybernetics(Part A),2005,35(5):754-762.

同被引文献20

引证文献2

二级引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部