基于DSM的知识约简方法研究被引量：1

Data Reduction Based on DSM

下载PDF

导出

摘要根据对象属性的差异性与相似性 ,以及对DSM(difference similitudematrix)矩阵元素mdij,msij的特性分析 ,定义了属性的重要度和合并度 ,给出了最佳属性约简集的修正子集的求解方法 ,从而提出了基于DSM的知识约简方法 ,该方法能在保证规则相容的情况下生成少量规则 ,同时只使用部分条件属性 .通过约简UCI机器学习数据库 ,并与粗集理论约简的结果比较 ,表明了该方法的合理性和有效性 ,并在约简效率和规则的正确率上都要好于粗集理论 . By defining the significance and the uniformity of the attributes, and analyzing the elements md ij &s ij in DSM, the important principle of the optimization knowledge reduction and a new data reduction method are put forward.The method can reduce the superfluous data while preserving the consistency of classifications. This data reduction method based on DSM is employed to analyze databases from UCI reposity. Through comparing the reducing result of DSM method and Rough set theory method, it show that DSM method can obtain higher reduction rate of instances. The DSM method is effective in reducing information systems with its higher validity by using leave-one-out' to examine.

作者江昊晏蒲柳

机构地区武汉大学电子信息学院

出处《武汉大学学报（理学版）》 CAS CSCD 北大核心 2003年第3期378-382,共5页 Journal of Wuhan University:Natural Science Edition

基金国家自然科学基金资助项目 ( 90 2 0 40 0 8)

关键词 DSM 知识约简差异-相似性矩阵数据约简粗集理论 UCI机器学习数据库属性约简集 data reduction DSM (difference similitude matrix) Rough set theory UCI database

分类号 TP182 [自动化与计算机技术—控制理论与控制工程]

引文网络
相关文献

参考文献1

1王珏,王任,苗夺谦,郭萌,阮永韶,袁小红,赵凯.基于Rough Set理论的“数据浓缩”[J].计算机学报,1998,21(5):393-400. 被引量：239

二级参考文献4

1王珏,苗夺谦,周育健.关于Rough Set理论与应用的综述[J].模式识别与人工智能,1996,9(4):337-344. 被引量：264
2Wang J，J Comput Sci Technol，1998年，13卷，2期，189页
3周育健，硕士学位论文，1996年
4Hu X H，Comput Intell，1995年，11卷，2期，323页

共引文献238

1杨善林,刘业政,马溪骏.基于β-δ0粗糙集模型的属性约简算法[J].中国管理科学,2003,11(z1):41-45.
2孙芳,王淑礼.一种面向用户的属性约简算法[J].光盘技术,2008(1):51-52.
3刘叶玲,杜力博.基于RS-SVM在电力短期负荷预测中的应用[J].科技信息,2009(1):181-182. 被引量：2
4Dai Jian\|hua 1,2 , Li Yuan\|xiang 1,2 ,Liu Qun 3 1. State Key Laboratory of Software Engineering, Wuhan University, Wuhan 430072, Hubei,China 2. School of Computer, Wuhan University, Wuhan 430072, Hubei, China 3. School of Computer Science,.A Hybrid Genetic Algorithm for Reduct of Attributes in Decision System Based on Rough Set Theory[J].Wuhan University Journal of Natural Sciences,2002,7(3):285-289. 被引量：6
5苏永华,刘科伟,罗正东.基于粗糙集补齐算法与神经网络的隧道塌方预测系统[J].工业建筑,2013,43(S1):407-411. 被引量：3
6唐彬,李龙澍.关于基于分明矩阵的属性约简算法的探讨[J].计算机工程与应用,2004,40(14):184-186. 被引量：5
7叶东毅,陈昭炯.一个新的二进制可辨识矩阵及其核的计算[J].小型微型计算机系统,2004,25(6):965-967. 被引量：49
8叶东毅.信息表属性约简之间的若干关系[J].福州大学学报（自然科学版）,2004,32(4):448-450.
9徐章艳.一个基于差别矩阵思想的高效求核算法[J].计算机工程与应用,2004,40(17):74-75. 被引量：3
10刘刚,秦勇,贾利民.关系数据库中的属性约简[J].海军工程大学学报,2004,16(5):25-29. 被引量：2

同被引文献7

1YANG Y, PEDERSEN JP. A Comparative Study on Feature Selection in Text Categorization[ A]. Proceedings of the Fourteenth International Conference on Machine Learning[ C]. Tennessee, USA:Vanderbilt University, 1997.
2DUIN RPW, LOOG M, HAEB - UMBACH R. Multi - class Linear Feature Extraction by Nonlinear PCA[ A]. Proceedings of 15th International Conference on Pattern Recognition[ C]. Barcelona, Spain:IEEE Computer Science Press, 2000. 398 -401.
3NGUYEN, SON H. Scalable classification method based on rough sets[ A]. Rough Sets and Current Trends in Computing, Lecture Notes in Computer Science[ C]. PA, USA: Springer, 2002. 433-440.
4夏德麟晏蒲柳.[D].武汉:武汉大学大学电子信息学院,2001.
5ZHOU JG, XIA DL, YAN PL. Incremental Machine Learning Theorem and Algorithm Based on DSM Method[ A]. Proceedings of the Third International Conference on Machine Learning and Cybernetics[C]. Shanghai: IEEE, 2004. 2202-2207.
6AIZAWA A. The Feature Quantity: An Information Theoretic Perspective of Tfidf-like Measures[ A]. Proceedings of the 23rd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval[ C]. Tarrytown, NY, USA: Pergamon Press, Inc, 2000. 104 -111.
7Reuters-21578TextCategorizationCollection[ DB/OL] . http://kdd. ics. uci. edu/databases/reuters21578/reuters21578, html,2004.

引证文献1

1黄晓春,晏蒲柳,夏德麟,陈健.基于差异—相似矩阵的文本降维方法[J].计算机应用,2005,25(8):1821-1823. 被引量：1

二级引证文献1

1何秋红,余滨,李绍滋,苏松志.决策表属性约简算法及其在行人检测中的应用[J].数码设计,2019,8(9):5-12.

1WU Ming YAN Puliu.Feature Selection Based on Difference and Similitude in Data Mining[J].Wuhan University Journal of Natural Sciences,2007,12(3):467-470.
2白秀玲,王平,普杰信.一种粗糙集值约简算法及其应用[J].微计算机信息,2006,22(11X):207-209. 被引量：15
3李三乐.基于邻域粗糙集模型的属性约简算法改进[J].微计算机信息,2010,26(36):268-269. 被引量：3
4李三乐,肖政宏.基于差别矩阵的启发式属性约简算法及其应用[J].广东技术师范学院学报,2010,31(6):11-14.
5李侃,刘玉树,王蕾.一种粗糙集属性约简算法[J].计算机工程与应用,2002,38(5):15-19. 被引量：25
6姜万录,刘思远.粗糙集及主元分析的机械故障诊断研究[J].机床与液压,2009,37(12):215-218. 被引量：1
7常犁云,263.net,王国胤,263.net,吴渝,263.net.一种基于Rough Set理论的属性约简及规则提取方法[J].软件学报,1999,10(11):1206-1211. 被引量：285
8李楠,谢娟英.基于邻域粗糙集的增量特征选择[J].计算机技术与发展,2011,21(11):149-152. 被引量：7
9JiangHao YanPu-liu ChenXiao WuJing.Network Fault Diagnosis Using DSM[J].Wuhan University Journal of Natural Sciences,2004,9(1):63-67. 被引量：1
10刘军,卢炎生.一种粗集与灰理论结合算法在柴油机故障诊断系统中的应用[J].小型微型计算机系统,2010,31(4):797-800. 被引量：2

武汉大学学报（理学版）

2003年第3期

浏览历史

内容加载中请稍等...

基于DSM的知识约简方法研究被引量：1

参考文献1

二级参考文献4

共引文献238

同被引文献7

引证文献1

二级引证文献1

相关作者

相关机构

相关主题

浏览历史

基于DSM的知识约简方法研究 被引量：1

参考文献1

二级参考文献4

共引文献238

同被引文献7

引证文献1

二级引证文献1

相关作者

相关机构

相关主题

浏览历史

基于DSM的知识约简方法研究被引量：1