期刊文献+

多变量决策树的构造算法研究

Study of Building Multivariate Decision Tree
下载PDF
导出
摘要 应用粗糙集理论,提出了一种新的多变量决策树构造算法.该算法以核相对于决策类的泛化来划分样本集,如果所划分子集的样本存在不一致决策类并且未用于划分的属性为空时,试探着分别把该子集和一致性子集合并,计算各合并子集的条件类对决策类的确定性程度,选择确定性程度大的作为同一子集,并用一致性子集的类标号进行标示.和苗夺谦提出的多变量决策树算法比较,本算法充分考虑了训练集中的噪声数据,允许在构造决策树的过程中划入正域的实例类别存在一定的不一致性,可简化生成的决策树,提高决策树的泛化能力. In this paper, a new method to build multivariate decision trees based on Rough Set is proposed. The relative generalization of core with respect to decision attributes is used for devising to the decision table, if a certain inconsistency exists in the subset of examples and there are not attributes, then the inconsistency subset merged with the consistency subsets, respectively. And the measures of certainty of condition equivalence relation with respect to decision equivalence relation are accounted respectively, the great result of the combined subset is got, and the label of the consistency subset to label the node. Compared with the algorithm of Literature E51, noisy data of training sets are considered. A certain inconsistency is allowed to exist in examples of the positive regions, so the method can simplify the decision trees and improve its extensive ability.
作者 常志玲
出处 《洛阳师范学院学报》 2011年第5期74-77,共4页 Journal of Luoyang Normal University
基金 河南省教育厅自然科学研究计划项目(200913520021) 河南省科技发展计划项目(102102210472)
关键词 多变量决策树 粗糙集 相对泛化 确定性程度 multivariate decision tree rough set generalization measure of certainty
  • 相关文献

参考文献8

二级参考文献23

  • 1倪春鹏,王正欧.一种新型决策树属性选择标准[J].武汉科技大学学报,2004,27(4):437-440. 被引量:10
  • 2沈晨鸣.决策树分类算法研究[J].盐城工学院学报(自然科学版),2005,18(4):22-24. 被引量:8
  • 3关晓蔷,刘煜伟.一种基于粗糙集的决策树构造方法[J].科技情报开发与经济,2006,16(13):136-138. 被引量:4
  • 4Moore AW, Zuev D. Internet traffic classification using Bayesian analysis techniques. In: Proc. of the 2005 ACM SIGMETRICS Int'l Conf. on Measurement and Modeling of Computer Systems, Banff, 2005. 50-60. http://www.cl.cam.ac.uk/-awm22 /publications/moore2005internet.pdf.
  • 5Madhukar A, Williamson C. A longitudinal study of P2P traffic classification. In: Proc. of the 14th IEEE Int'l Syrup. on Modeling, Analysis, and Simulation. Monterey, 2006. http://ieeexplore.ieee.org/xpl/ffeeabs_all.jsp?arnumber=1698549.
  • 6Moore AW, Papagiannaki K. Toward the accurate identification of network applications. In: Dovrolis C, ed. Proc. of the PAM 2005. LNCS 3431, Heidelberg: Springer-Verlag, 2005.41-54.
  • 7Karagiannis T, Papagiannaki K, Faloutsos M. BLINC: Multilevel traffic classification in the dark. In: Proc. of the ACM SIGCOMM. Philadelphia, 2005. 229-240. http://conferences.sigcomm.org/sigcomm/2005/paper-KarPap.pdf.
  • 8Roughan M, Sen S, Spatscheck O, Dutfield N. Class-of-Service mapping for QoS: A statistical signature-based approach to IP traffic classification. In: Proc. of the ACM SIGCOMM Internet Measurement Conf. Taormina, 2004. 135-148. http://www.imconf.net/imc-2004/papers/p 135-roughan.pdf.
  • 9Zuev D, Moore AW. Traffic classification using a statistical approach. In: Dovrolis C, ed. Proc. of the PAM 2005. LNCS 3431, Heidelberg: Springer-Verlag, 2005. 321-324.
  • 10Nguyen T, Armitage G. Training on multiple sub-flows to optimise the use of Machine Learning classifiers in real-world IP networks. In: Proc. of the 31 st IEEE LCN 2006. Tampa, 2006. http://ieeexplore.ieee.org/xpl/freeabs_all.jsp?arnumber=4116573.

共引文献374

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部