期刊文献+

两种提高决策树性能的算法研究 被引量:2

Research on algorithms of two methods to improve performance of decision trees
下载PDF
导出
摘要 为了克服用重复剪辑近邻法筛选训练样本集产生样本被误剔除进而增大决策树的判决风险和误判概率的问题,提出一种新的方法-引入拒绝阀值的重复剪辑近邻法,并从理论上分析了它降低判决风险和误判概率的原理。通过实验比较这两种方法发现,引入拒绝阀值的重复剪辑近邻法在降低判决风险和误判概率上要优于重复剪辑近邻法;而在决策树的规模和分类错误率上,重复剪辑近邻法的精度要优于引入拒绝阀值的重复剪辑近邻法。 Mistakenly removing sample thereby increasing the risk in the judgement and the probability misjudgment of the decision tree is a problem with multi-edit-nearest-neighbor algorithm in selecting training samples. For solving this problem, a new algorithm, refusing threshold multi-edit-nearest-neighbor algorithm, is proposed. The mechanism how it avoided this default is analysed. By experiments comparing to multi-edit-nearest-neighbor algorithm show that the refusing threshold multi-edit-nearest-neighbor algorithm in reducing the risk the judgement and probability misjudgment is superior to multi-edit-nearest-neighbor algorithm. However, in term of the accuracy rate between decision trees and the classification, the multi-edit-nearest-neighbor algorithm is better than the refusing threshold multiedit-nearest-neighbor algorithm.
出处 《计算机工程与设计》 CSCD 北大核心 2008年第15期3989-3990,4057,共3页 Computer Engineering and Design
关键词 数据挖掘 决策树 引入拒绝阀值的重复剪辑近邻法 重复剪辑近邻法 样本筛选 data mining decision tree refusing threshold multi-edit-nearest-neighbor algorithm multi-edit-nearest-neighboralgorithm sample selection
  • 相关文献

参考文献9

  • 1Quinlan J R. Induction of decision trees[J].Machine Leaming,1986(1):81-106.
  • 2Quinlan J R.C4.5:Programs for machine leaming[M].San Mateo, CA:Morgan Kaufmann, 1993:23-30.
  • 3叶晨洲,杨杰,姚莉秀,陈念贻.采用重复剪辑近邻法提高决策树算法的性能[J].控制与决策,2003,18(1):96-98. 被引量:4
  • 4孙即祥.现代模式识别[M].长沙:国防科技大学出版社,2001.
  • 5Carlotta Domeniconi, Jing Peng, Dimitrios Gunopulos. Locally adaptive metric nearest-neighbor classification[C].IEEE Trasactions on Pattem Analysis and Machine Intelliengce,2002,24(9): 1281-1285.
  • 6Olaru C, Wehenkel L. A complete fuzzy decision tree technique [J]. Fuzzy Sets and Systems,2003,138(2):221-254.
  • 7Chidanand Apte,Sholom Weiss.Data mining with decision trees and decision rules [J]. Future Gener Comp Syst, 1997,13 (2): 197-210.
  • 8Quinlan J R. Improved use of continuous attributes in C4.5[J]. Journal of Artificial Intelligence Research, 1996,4:77-90.
  • 9Tu P L,Chung J Y.A new decision-tree classification algorithm for machine learning[C].Proceedings of the IEEE International Conference on Tools for Artificial Intelligence.Arlington Virginia, USA: IEEE Computer Society, 1992: 370- 377.

二级参考文献1

  • 1边肇祺 张学工.模式识别(第2版)[M].北京:清华大学出版社,1999..

共引文献27

同被引文献12

引证文献2

二级引证文献16

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部