摘要
对目前粗糙集的离散化算法进行了分类讨论,重点分析了基于信息熵的离散化算法的理论基础以及实现步骤,并就该算法对于同一属性在不同样本数据集上的应用情况进行了分析.实验表明,该算法对于部分属性具有数据敏感性,当选择这些属性作为依据时会影响系统的决策能力.
Based on classification and discussion of current discretization algorithms,the theoretical basis and accomplishment steps of discretization algorithm using information entropy were analyzed in details,and applications of the algorithm in different sample sets of the same attribute were presented.The experimental results show that the algorithm has sensitivity to some attributes and can affect the decision ability of the system when chosen these attributes.
出处
《上海工程技术大学学报》
CAS
2010年第3期240-244,共5页
Journal of Shanghai University of Engineering Science
基金
上海市教委科研创新资助项目(09YZ370)
上海工程技术大学科研基金项目(校启07-22)
关键词
粗糙集
熵
离散化
信息增益
rough set
entropy
discretization
information gain