期刊文献+

基于粒化可拓决策的属性约简算法研究 被引量:3

Research on Attribute Reduction Algorithm Based on Granulation Extension Decision
下载PDF
导出
摘要 针对传统属性约简算法利用等价关系计算过程烦琐、运行时间较长的问题,定义能体现属性间相关程度的绝对关联度,提出一种基于粒化可拓决策的属性约简算法。首先,利用K-means聚类算法,对原始数据集进行粒化,得到各簇中心;其次,运用可拓决策理论确定经典域、节域和待评物元,通过计算各簇中属性之间的关联度构建指示矩阵,并计算各属性的指示值;最后,根据指示值,从大到小依次选择属性,实现样本集属性约简。实验结果表明,算法运算速度较快,约简后对数据集分类精度影响小,部分数据集分类精度有所提升,验证了算法的有效性。 Aiming to deal with the cumbersome calculation process and long running time of traditional attribute reduction algorithms using equivalence relations,an absolute correlation degree that could reflect the degree of correlation between attributes was defined,and an attribute reduction algorithm based on granulation extension decision was proposed.Firstly,K-means clustering algorithm was used to granulate the original data set to obtain the centers of each cluster.Secondly,the extension decision theory was used to determine the classical domain,the node domain and the matter to be evaluated.The indicator matrix was constructed by calculating the correlation degree between the attributes in each cluster,and the indicator value of each attribute was calculated.Finally,according to the indicated value,the attributes were selected in order from the largest to the smallest to realize the attribute reduction of the sample set.The experimental results showed that the algorithm had a faster operation speed.The reduction had small impact on the classification accuracy of the data set,while the classification accuracy of some data set had been improved,which verified the effectiveness of the algorithm.
作者 王君宇 杨亚锋 赵佳亮 代琪 李丽红 WANG Junyu;YANG Yafeng;ZHAO Jialiang;DAI Qi;LI Lihong(College of Science,North China University of Science and Technology,Tangshan 063210,China;Hebei Province Key Laboratory of Data Science and Application,Tangshan 063210,China;Tangshan Key Laboratory of Engineering Computing,Tangshan 063210,China;Department of Automation,China University of Petroleum,Beijing 102249,China)
出处 《郑州大学学报(理学版)》 CAS 北大核心 2022年第5期72-81,共10页 Journal of Zhengzhou University:Natural Science Edition
基金 河北省数据科学与应用重点实验室项目(10120201) 唐山市数据科学重点实验室项目(10120301)。
关键词 属性约简 粒化 K-MEANS聚类 可拓决策 关联度 attribute reduction granulation K-means clustering extension decision correlation degree
  • 相关文献

参考文献17

二级参考文献178

共引文献405

同被引文献11

引证文献3

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部