摘要
连续属性的离散化是粗糙集理论亟待解决的关键问题之一。基于灰色系统和粗糙集的有关理论,提出了一种新的基于属性重要性的离散化算法。该算法以条件属性对决策属性的灰色关联度来度量条件属性的重要性,在保证决策表原始分类能力不变的前提下,按照属性重要性由小到大的顺序对每个条件属性的侯选断点进行考察,将冗余的断点去掉,从而将条件属性离散化。同时给出了该算法的时间复杂度分析,并通过实例分析验证了算法的有效性和实用性。
Discretization of continuous attributes is always one of the key problems that need urgent solutions in rough sets theory. Based on theory of grey system and rough sets, a new discretization algorithm of continuous attributes in decision table is offered. In this algorithm, the grey correlation degree of condition attributes for decision attribute is used to measure the importance of condition attributes ; on the premise of keeping the stability of original decision table, all break points of every condition attributes are examined and the redundant ones are eliminated according to which the condition attributes are sorted in a descending order, and then the decision table is discretized. The time complexity of the algorithm is proposed and an example is investigated to verify its validity and practicability.
出处
《重庆邮电大学学报(自然科学版)》
2007年第4期409-412,共4页
Journal of Chongqing University of Posts and Telecommunications(Natural Science Edition)
基金
安徽省教育厅自然科学基金项目(2005KJ094)
皖南医学院中青年科研基金项目(WK200730)
关键词
灰色关联度
粗糙集
连续属性
数据离散化
grey correlation degree
rough sets
continuous attributes
data discretization