期刊文献+

竞争选择分裂属性的决策树分类模型

A Decision-Tree Classifier Model of Competition in Choosing Split Attribute
下载PDF
导出
摘要 构建决策树分类器关键是选择分裂属性。通过分析信息增益和增益比率、Gini索引、基于Goodman-Kruskal关联索引这三种选择分裂属性的标准,提出了一种改进经典决策树分类器C4.5算法的方法(竞争选择分裂属性的决策树分类模型),它综合三种选择分裂属性的标准,通过竞争机制选择最佳分裂属性。实验结果表明它在大多数情况下,使得不牺牲分类精确度而获得更小的决策树成为了可能。 The construction of decision- tree is centered on the selection algorithm of an attribute that generates a partition of the subsets of the training database that is located in the node about to be split. On the basis of analyzing three techniques for choosing the splitting attributes including the entropy gain and the gain ratio, the gini index and Goodman - Knaskal association index, propose a strategy to improve on classical decision - tree classifier C4.5 arithmetic(a decision-tree classifier model of competition in choosing split attribute). Experimental results show it is possible, in most cases, to obtain smaller decision trees without sacrificing accuracy.
作者 房立 黄泽宇
出处 《计算机技术与发展》 2006年第8期106-109,共4页 Computer Technology and Development
关键词 决策树 信息增益 增益比率 Gini索引 Goodman-Kruskal关联索引 decision-tree entropy gain gain ratio gini index Goodman-Kruskal association index
  • 相关文献

参考文献10

  • 1Han Jiawei,Kamber M.Data mining concepts and techniques[M].San Francisco:Morgan Kaufmann Publishers,2001.185-219.
  • 2Mitchell T M.Machine learning[M].New York,America:McGraw-Hill Companies,Inc,1997.112-140.
  • 3Simovici D A,Szymon J.A metric approach to building decision trees based on Goddman-Kruskal association index[A].The Eighth Pacific-Asia Conference on Knowledge Discovery and Data Mining[C].Sydney,Australia:[s.n.],2004.181-190.
  • 4Witten I H,Frank E.Data mining:practical machine learning tools and techniques with java implementations[M].Seattle:Morgan Kaufmann,2000.
  • 5Quinlan R.C4.5:Pragrams for machine learning[M].California:Morgan Kaufmann Publishers,Inc,1993.
  • 6Quinlan J R.Induction of decision trees[J].Machine Learning,1986,1(1):81-106.
  • 7Kohavi R.Scaling up the accuracy of Naive-Bayes classifiers:a decision-tree Hybrid[A].In:Simoudis E,Han J,Fayyad U M,eds.Proc of the 2nd Int'l Conf on Knowledge Discovery and Data mining[C].Menlo Park:AAAI Press,1996.202-207.
  • 8Blake C L,Merz C J.UCI Repository of machine learning databases[Z].Irvine,CA:Department of Information and Computer Science,University of California,1998.
  • 9石洪波,王志海,黄厚宽,励晓健.一种限定性的双层贝叶斯分类模型[J].软件学报,2004,15(2):193-199. 被引量:44
  • 10Friedman J H,Kohavi R,Yun Y.Lazy decision trees[A].Thirteenth National Conference on Artificial Intelligence[C].Menlo Park:AAAI Press,1996.717-724.

二级参考文献15

  • 1Friedman N,Geiger D,Goldszmidt M.Bayesian network classifiers.Machine Learning,1997,29(2-3):131-163.
  • 2Langley P,Iba W,Thompson K.An analysis of Bayesian classifiers.In:Rosenbloom P,Szolovits P,eds.Proc.of the 10th National Conf.on Artificial Intelligence.Menlo Park:AAAI Press,1992.223-228.
  • 3Kononenko I.Seminaive Bayesian classifier.In:Kodratoff Y,ed.Proc.of the 6th European Working Session on Learning.New York:Springer-Verlag,1991.206-219.
  • 4Pazzani MJ.Searching for dependencies in Bayesian classifiers.In:Fisher D,Lenz HJ,eds.Learning from Data:Artificial Intelligence and Statistics V.New York:Springer-Verlag.1996.239-248.
  • 5Langley P,Sage S.Induction of selective Bayesian classifiers.In:Mantaras RL,Poole DL,eds.Proc.of the 10th Conf.on Uncertainty in Artificial Intelligence.San Francisco:Morgan Kaufmann Publishers,1994.399-406.
  • 6Webb GI,Pazzani MJ.Adjusted probability naive Bayesian induction.In:Antoniou G,Slaney JK,eds.Proc.of the 11th Australian Joint Conf.on Artificial Intelligence.Berlin:Springer-Verlag,1998.285-295.
  • 7Kohavi R.Scaling up the accuracy of Naive-Bayes classifiers:A decision-tree hybrid.In:Simoudis E,Han J,Fayyad UM,eds.Proc.of the 2nd Int'l Conf.on Knowledge Discovery and Data Mining.Menlo Park:AAAI Press,1996.202~207.
  • 8Keogh EJ,Pazzani MJ.Learning augmented Bayesian classifiers:A comparison of distribution-based and classification-based approaches.In:Heckerman DE,Whittaker J,eds.Proc.of the Uncertainty'99:The 7th Int'l Workshop on Artificial Intelligence and Statistics.
  • 9Cheng J,Greiner R.Comparing Bayesian network classifiers.In:Laskey KB,Prade H,eds.Proc.of the 15th Conf.on Uncertainty in Artificial Intelligence.San Francisco:Morgan Kaufmann Publishers,1999.101-108.
  • 10Chickering DM,Geiger D,Heckerman D.Learning Bayesian networks is NP-complete.In:Fisher DH,Lenz HJ,eds.Learning from Data:Artificial Intelligence and Statistics V.New York:Springer-Verlag,1996.121-130.

共引文献43

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部