期刊文献+

基于邻域信息熵度量数值属性快速约简算法 被引量:7

An effective continuous attributes reduction algorithm based on neighborhood entropy-based measurement
下载PDF
导出
摘要 阐述邻域粗糙集和邻域信息熵的基本定义及性质,为避免数值属性信息系统属性约简过程中,属性离散化造成特征信息的丢失,提出一种新的基于邻域信息熵度量数值属性约简算法。扩展邻域信息系统核属性集生成约简属性集,邻域信息熵度量不仅关注约简属性集正域变化,而且考察负域样本空间约简属性邻域等价类在决策属性划分的分布,具备更好的邻域关系度量细粒度。实验表明,对比邻域粗糙集近似度量、邻域有效信息率度量、邻域软间隔度量的属性约简方法,该算法能有效进行邻域信息系统属性约简的同时,也保持了约简属性集更好的分类精度。 The paper elaborates the basic definitions and properties of neighborhood rough sets and neighborhood entropy. To avoid losing feature information caused by diseretization of continuous attri- butions while reducing attributions, we present a new algorithm of continuous attributions reduction based on neighborhood entropy-based measurement. In the process of expending from core attribute sets to the reduction of attribute sets in neighborhood information system (NIS), neighborhood entropybased measurement is not only concerned with the positive field change of the reduction of attribute sets, but examines the distribution characteristics of the neighborhood equivalence classes of sample space in negative field in the decision attribute partition, which possess the finer granularity in the measurement of neighborhood relationship. Experimental results with UCI standard datasets show that compared with those attributions reduction algorithms based on neighborhood approximation measurement, neighborhood effective information ratio measurement, and neighborhood soft margin measurement, the proposed algorithm can effectively reduce continuous attributions in NIS, and at the same time, it maintains better classification accuracy of the reduction of attribute sets.
机构地区 中南大学商学院
出处 《计算机工程与科学》 CSCD 北大核心 2016年第2期350-355,共6页 Computer Engineering & Science
基金 国家自然科学基金委创新群体项目(70921001) 中国移动通信集团业务支撑重点联合研发项目(2014_LH_21)
关键词 属性约简 邻域信息熵度量 核属性 邻域信息系统 负域样本空间 分类精度 attribute reduction neighborhood entropy-based measurement core attribute neighborhood information system sample space in negative field classification accuracy
  • 相关文献

参考文献2

二级参考文献28

  • 1徐章艳,刘作鹏,杨炳儒,宋威.一个复杂度为max(O(|C||U|),O(|C^2|U/C|))的快速属性约简算法[J].计算机学报,2006,29(3):391-399. 被引量:234
  • 2Wilson D R, Martinez T R. Improved Heterogeneous Distance Functions. Journal of Artificial Intelligence Research, 1997, 6( 1 ) : 1 - 34
  • 3Hu Qinghua, Yu Daren, Xie Zongxia. Neighborhood Classifiers. Expert Systems with Applications: An International Journal, 2008, 34 (2) : 866 - 876
  • 4LIU H,YU L. Toward integrating feature selection algorithms for classification and clustering[J].IEEE Transactions on Knowledge and Data Engineering,2005,(04):491-502.doi:10.1109/TKDE.2005.66.
  • 5GUYON I,ELISSEEFF A. An introduction to variable and feature selection[J].The Journal of Machine Learning Research,2003,(7/8):1157-1182.
  • 6MITRA P,MURTHY C,PAL S. Unsupervised feature selection using feature similarity[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2002,(03):301-312.
  • 7DASH M,LIU H. Consistency-based search in feature selection[J].Artificial Intelligence,2003,(1/2):155-176.
  • 8DASH M,CHOI K,SCHEUERMANN P. Feature selection for clustering:a filter solution[A].Piscataway,NJ,USA:IEEE,2002.115-122.
  • 9HO T,BASU M. Complexity measures of supervised classification problems[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2002,(03):289-300.
  • 10CHING J,WONG A,CHAN K. Class-dependent discretization for inductive learning from continuous and mixed-mode data[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,1995,(07):641-651.

共引文献82

同被引文献81

引证文献7

二级引证文献30

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部