期刊文献+

一种基于ReliefF特征加权的R-NIC算法 被引量:6

A R-NIC Algorithm Based on Relief F Feature Weighting
下载PDF
导出
摘要 非参数信息理论聚类(NIC)算法通过计算数据点与簇间的互信息来实现聚类,利用无参估计法计算集群平均熵,从而降低人为参与的成本,但该算法假定待分析样本的所有特征对分类具有相同的贡献,与目前已有的研究结果相悖。为此,提出一种特征加权的R-NIC算法,该算法考虑各维特征对模式分类的不同影响,使用Relief F对特征进行加权变换,抑制冗余特征,加强有效特征,利用NIC算法在变换后的特征空间中进行聚类以提高聚类效果。在UCI数据集上的实验结果表明,该算法具有较高的聚类性能,聚类效果优于NIC算法。 Nonparametric Information theoretic Clustering( NIC) utilizes a non-parametric estimation of the average cluster entropies to maximize the estimated mutual information betw een data points and clusters,w hich effectively reduces the cost of participation. How ever,the algorithm assumes that all features of the sample to be analyzed plays a uniform contribution in the process of cluster analysis. Obviously,the hypothesis is inconsistent w ith a lot of practices. Therefore,this paper proposes a novel non-parametric feature w eighting clustering algorithm based on Relief F,w hich is named RNIC,to consider of different feature. It adopts Relief F to transform and w eighting features,R-NIC can inhibit redundant features,improves the clustering results by clustering in the transformed feature space. Experimental results on UCI datasets show that the performance of the proposed R-NIC algorithm is superior to the NIC algorithm.
出处 《计算机工程》 CAS CSCD 北大核心 2015年第4期161-165,共5页 Computer Engineering
基金 国家自然科学基金资助项目"多变量IB方法及算法的研究"(61170223) 国家自然科学基金联合基金资助项目"可扩展迁移学习中跨媒体复杂问题自动映射研究"(U1204610)
关键词 非监督 聚类 互信息 非参数信息理论聚类算法 准确率 特征加权 unsupervised clustering mutual information Nonparametric Information theoretic Clustering(NIC) algorithm accuracy feature w eighting
  • 相关文献

参考文献12

  • 1樊晓平,盛荣芬,廖志芳,刘丽敏.一种选择性加权聚类融合算法[J].计算机工程与应用,2012,48(22):195-200. 被引量:3
  • 2何清.模糊聚类分析理论与应用研究进展[J].模糊系统与数学,1998,12(2):89-94. 被引量:113
  • 3Faivishevsky L, Goldberger J. A Nonparametric Info- rmation Theoretic Clustering Algorithm [ C ]//Proc- eedings of the 27th International Conference on Machine Learning. Washington D. C., USA: IEEE Press,2010: 351-358.
  • 4李洁,高新波,焦李成.基于特征加权的模糊聚类新算法[J].电子学报,2006,34(1):89-92. 被引量:113
  • 5Faivishevsky L,Goldberger J. ICA Based on a Smooth Estimation of the Differential Entropy I C ]//Proceedings of Advances in Neural Information Processing Systems. Washington D. C. , USA : IEEE Press,2008:433-440.
  • 6Kozachenko L F, Leonenko N N, Sample Estimate of Entropy of a Random Vector [ J ]. Problems of Information Transmission, 1987,23 ( 2 ) : 9-16.
  • 7Kimura M, Sugiyama M. Dependence-maximization Clustering with Least-square Mutual Information [ J ]. Journal of Advance Computational Intelligence and Intelligent Informatics, 2011,15 ( 7 ) : 800-805.
  • 8Spola6r N, Cherman E A, Monard M C, et al. A Comparison of Multi-label Feature Selection Methods Using the Problem Transformation Approach [ J ]. Electronic Notes in Theoretical Computer Science,2013, 292:135-151.
  • 9Kira K,Rendell L A. A Practical Approach to Feature Selection [ C ]//Proceedings of the 9th International Workshop on Machine Learning. San Francisco, USA: Morgan Kaufmann Publishers Inc:1992:249-256.
  • 10Lim S C, Jang S J, Lee S P. Music Genre/mood Classification Using a Feature-based Modulation Spectrum [ C ]//Proceedings of International Conference on Mobile IT Convergence. Washington D. C., USA: IEEE Computer Society ,2011 : 133-136.

二级参考文献35

  • 1阳琳贇,王文渊.聚类融合方法综述[J].计算机应用研究,2005,22(12):8-10. 被引量:28
  • 2Zhexue Huang,Michael K Ng.A fuzzy k-modes algorithm for clustering categorical data[J].IEEE Trans on Fuzzy Systems,August,1999,7(4):446-452.
  • 3Zhexue Huang.A fast clustering algorithm to cluster very large categorical data sets in data mining[A].Proceedings of the SIGMOD Workshop on Research Issues on Data Mining and Knowledge Discovery[C].USA:ACM Press,1997.1-8.
  • 4Kononenko I.Estimating attributes:Analysis and extensions of Relief[A].Proceedings of the 7th European Conference on Machine Learning[C].Berlin:Springer,1994.171-182.
  • 5Kira K,Rendell L A.A practical approach to feature selection[A].Proceedings of the 9th International Workshop on Machine Leaning[C].San Francisco,CA:Morgan Kaufmann,1992.249-256.
  • 6Duda R O,Hart P E.Pattern classification and scene analysis[M].New York:John Wiley & Sons,1973.89-91.
  • 7Hathaway R J,Bezdek J C.Nerf C-means:Non-Euclidean relation fuzzy clustering[J].Pattern recognition,1994,27(3):429-437.
  • 8Michalski R S,Stepp R E.Automated construction of classifications:Conceptual clustering versus numerical taxonomy[J].IEEE PAMI,1983,5:396-410.
  • 9Jollois F X,Nadif M.Clustering large categorical data[A].Advances in Knowledge Discovery and Data Mining[C].Heidelberg:Springer-Verlag,2002.257-263.
  • 10Fred A L.Finding consistent clusters in data partitions[C]// Volume 2096 of Lecture Notes in Computer Science: Proceeding of the 2nd International Workshop on Multi- ple Classifier Systems. [S.1.] : Springer, 2001 : 309-318.

共引文献260

同被引文献55

引证文献6

二级引证文献30

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部