期刊文献+

连续属性离散化算法比较研究 被引量:20

Study on comparison of discretization algorithms of continuous attributes
下载PDF
导出
摘要 探讨了贪心及其改进算法、基于属性重要性、基于信息熵和基于聚类四类连续属性离散化算法,并通过实验验证这四类算法的离散化效果。实验结果表明,数据集离散化的效果不仅取决于使用算法,而且与数据集连续属性的分布和决策数据值的分类也有密切关系。 This paper disscussed four kinds of discretization methods which include greedy and some improved algorithms, significance of attributes, entropy of information and clustering-based algorithms. And compard the quality of the four categories of algorithms. The last experiments indicate that the quality of discretization of dataset not only lies on the algorithm, but also is closely related to distributing of continuous attributes and data of decision.
出处 《计算机应用研究》 CSCD 北大核心 2007年第9期28-30,33,共4页 Application Research of Computers
基金 国家自然科学基金(70471046) 教育部博士点基金(20040359004)
关键词 离散化 贪心算法 属性重要性 信息熵 聚类 discretization greedy algorithm significance of attributes entropy of information clustering
  • 相关文献

参考文献9

二级参考文献41

  • 1曾黄麟.粗集理论及其应用-关于数据推理的新方法 (修订版)[M].重庆:重庆大学出版社,1998.83-87.
  • 2苗夺谦.Rough Set理论及其在机器学习中的应用研究(博士学位论文)[M].北京:中国科学院自动化研究所,1997..
  • 3黄黄麟.粗集理论及其应用--关于数据推理的新方法(修订版)[M].重庆:重庆大学出版社,1998..
  • 4Nguyen S H, Skowron A. Quantization of Real Value Attributes---Rough Set and Boolean Reasoning Approach [A]. Proc of the Second Joint Conference on Information Sciences [C]. 1995.
  • 5Holte R C. Very Simple Classification Rules Performs Well on Most Commonly Used Datasets[J].Machine Learing,1993,1
  • 6Kerber R. Discretization of Numeric Attributes[A]. The 9^th International Conference on Artificial Intelligence[C]. 1992.
  • 7Fayyad U M, Irani K B. Multi-Interval Discretization of Continuous-Valued Attributes for Classification Learning [A]. 13^th IJCAI[C]. 1993.
  • 8Nguyen S H. Discretization of Real Value Attributes: A boolean reasoning approach [D]. Warsaw: Warsaw University, 1997.
  • 9Michal R Chmielewski, Jerzy W Grzymala-Bussse. Global Discretization of Continuous Attributes as Preprocessing for Machine Learning [J].International Journal of Approximate Reasoning, 1996, 15.
  • 10Quinlan, J R.C4.5 Programs for Machine learning[D]. America:Morgan-Kaufmann 1993.

共引文献364

同被引文献121

引证文献20

二级引证文献46

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部