期刊文献+

基于遗传算法的决策树优化模型 被引量:1

The Model of Decision Tree Based on Genetic Algorithm
下载PDF
导出
摘要 在分析C4.5算法原理的基础上,进一步讨论了C4.5算法在决策树的规模控制、属性选择、滤躁和去除不相关属性等方面的不足,讨论了决策树挖掘中对训练数据进行属性约简的必要性。从实用的角度提出了一种利用遗传算法进行寻优的、基于属性约简的决策树构建模型,并为此模型设计了一个适应度函数。该模型具有自适应的特点,通过调整适应度函数的参数,可以约束遗传算法的寻优方向,实现对决策树的优化。实验表明,决策树寻优后,在所用训练集属性减少的同时,分类精度却有一定程度的提高,而分类规则的规模却降低了,因此,该模型具有一定的实用价值。 Based on the analysis of C4.5 algorithm, presents the defects of the scale control of decision tree and attribute selection,and in eliminating noise and irrelevant attributes. The paper also discusses the necessity of conducting attribute reduction for the training data in the course of decision tree mining. In addition, for the practical demands, the paper, based on attribute reduction, proposes a model for decision tree to optimize it by adopting genetic algorithm. Then a fitness function is designed for the model. The model maintains the characteristic of self- adjustment,can control the optimization direction of genetic algorithm,and optimize the decision tree by adjusting the parameters of fitness function. An experiment is conducted and the findings of the experiment show that after the optimization of the decision tree, the attributes of training data will be reduced, the classification accuracy will be improved and the scale of the classification rules will be made smaller. Therefore, the model is of great practical value.
出处 《计算机技术与发展》 2007年第3期116-118,共3页 Computer Technology and Development
关键词 决策村 属性约简 遗传算法 适应度函数 decision tree attribute reduction genetic algorithm fitness function
  • 相关文献

参考文献6

  • 1HanJiawei MichelineKambe.数据挖掘概念与技术[M].北京:机械工业出版社,2001..
  • 2Quinlan J R.C4.5:Programs for Machine Learning[M].[s.l.]:Morgan Kaufmann Publishers,1993.
  • 3DUNHAM M H.数据挖掘教程[M].郭崇慧,田凤占,勒晓明,等译.北京:清华大学出版社,2005.
  • 4任庆生,叶中行,曾进,戚飞虎.对常用选择算子的分析[J].上海交通大学学报,2000,34(4):564-566. 被引量:19
  • 5KDDCUP'99 data[J/OL].1999.http://kdd.ics.uci.edu/databases/kddcup99/.
  • 6Quinlan J R.C4.5 Release 8[EB/OL].1992.http://www.rulequest.com/Personal/.

二级参考文献3

  • 1Ren Qiangsheng,通信学报,1997年,18卷,3期,54页
  • 2陈国良,遗传算法及其应用,1996年
  • 3刘勇,非数值并行算法.2.遗传算法,1995年

共引文献182

同被引文献8

引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部