摘要
根据对象属性的差异性与相似性 ,以及对DSM(difference similitudematrix)矩阵元素mdij,msij的特性分析 ,定义了属性的重要度和合并度 ,给出了最佳属性约简集的修正子集的求解方法 ,从而提出了基于DSM的知识约简方法 ,该方法能在保证规则相容的情况下生成少量规则 ,同时只使用部分条件属性 .通过约简UCI机器学习数据库 ,并与粗集理论约简的结果比较 ,表明了该方法的合理性和有效性 ,并在约简效率和规则的正确率上都要好于粗集理论 .
By defining the significance and the uniformity of the attributes, and analyzing the elements md ij &s ij in DSM, the important principle of the optimization knowledge reduction and a new data reduction method are put forward.The method can reduce the superfluous data while preserving the consistency of classifications. This data reduction method based on DSM is employed to analyze databases from UCI reposity. Through comparing the reducing result of DSM method and Rough set theory method, it show that DSM method can obtain higher reduction rate of instances. The DSM method is effective in reducing information systems with its higher validity by using leave-one-out' to examine.
出处
《武汉大学学报(理学版)》
CAS
CSCD
北大核心
2003年第3期378-382,共5页
Journal of Wuhan University:Natural Science Edition
基金
国家自然科学基金资助项目 ( 90 2 0 40 0 8)