期刊文献+

基于遗传算法的数量属性离散化算法

A GA-based algorithm for discretizing values of quantitative attributes
下载PDF
导出
摘要 提出了在没有任何领域知识可供借鉴的情况下,基于聚类思想,利用遗传算法对数量型属性进行离散化的新算法———遗传C均值算法.该算法利用遗传算法具有全局寻优的特性,对训练样本根据其每一属性值进行聚类,将样本划分为不同的类,从而为每一属性找到其值的最佳分割点.然后,对不同类赋以不同的编码.该算法的优点是能得到最优的离散化结果.在VC++6.0环境下实现了该算法.仿真实验证明该方法有效解决了利用粗糙集理论进行分类规则挖掘时,数量型属性的离散化问题. A cluster-based algorithm for discretizing values of quantitative attributes is presented, called genetic-C means algorithm, which works without any experienced knowledge about special field. The algorithm takes the advantage of genetic algorithms for global optimization, and can get the best cuts for any one attribute via clustering examples. And it is implemented in VC(++)6.0. Computer simulations prove its validity for finding the best cuts for the values of quantitative attributes.
作者 谢娟英 刘芳
出处 《陕西师范大学学报(自然科学版)》 CAS CSCD 北大核心 2004年第2期28-30,共3页 Journal of Shaanxi Normal University:Natural Science Edition
关键词 遗传算法 数量属性 离散化算法 分类规则挖掘 粗糙集理论 quantitative attribute discretization genetic algorithm rough sets theory mining classification rules
  • 相关文献

参考文献11

  • 1Pawlak Z. Rough set approach to knowledge-based decision support[J]. European Journal of Operational Research,1997, 99(1) :48-57.
  • 2Pawlak Z. Rough sets and intelligent data analysis [J].Information Sciences, 2002, 147(1-4): 1-12.
  • 3Berka P, Bruha I. Discretization and grouping:preprocessing steps for data mining[Z]. Principles of Data Mining and Knowledge Discovery, 1998. 239-245.
  • 4Nguyen H S, Dougherty J , Kohavi R, et al. Supervised and unsupervised discretization of continuous features[A].In: Proceedings of the Twelfth International Conference on Machine Learning [C]. Tahoe City CA: Morgan Kaufmann, 1995. 194~202.
  • 5Liu H, Setiono R. Feature selection and discretization of numeric attributes [R]. In: Proceedings of 7th IEEE Int.1 Conference on Tools with Artificial Intelligence, 1995.191-199.
  • 6Nguyen H S, Skowron A. Quantization of real values attributes, rough set and Boolean reasoning approaches [A]. Proc. of the 2nd Joint Annual Conf. on Information Sci[C]. NC: USA Wrightsville Beach, 1995.34-37.
  • 7Nguyen H S. Discretization problem for rough sets methods[A]. Proc of the 1st Int. Conf. On Rough Sets and Current Trends in Computing (RSCTC' 98 ) [C]. RSAW:Poland, 1998. 545-552.
  • 8Nguyen H S. Some efficient algorithms for Rough set methods[A]. Proc of the Conf. of Information Processing and Management of Uncertainty in Knowledge Based Systems[C]. Kluwer: Granada Spain, 1996.1 451- 1456.
  • 9侯利娟,王国胤,聂能,吴渝.粗糙集理论中的离散化问题[J].计算机科学,2000,27(12):89-94. 被引量:104
  • 10周明 孙树栋.遗传算法原理及应用[M].国防工业出版社,2001..

二级参考文献7

共引文献166

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部