期刊文献+

基于粗糙集理论的聚类融合加权迭代模型 被引量:1

Iterative weighted cluster ensemble model based on rough set theory
原文传递
导出
摘要 针对聚类融合问题,考虑了聚类成员的质量和噪声对聚类结果的影响,提出一种加权迭代的聚类融合模型,利用粗糙集理论中的决策表属性重要性的信息熵来衡量聚类成员的重要性,迭代更新聚类成员的权重。该文在模拟和真实数据集上进行了校验。结果表明,该模型能较好地处理聚类成员间的质量差异,并能有效地消减噪声对融合的影响,从而得到更好的聚类融合结果。 An iterative weighted cluster ensemble (IWCE) model was developed taking into account the qualities of the cluster members and the noise in the cluster ensemble. The model evaluates the significance of each cluster member using information measuring the attribute significance in the rough set and iteratively updates the weight values. Experiments on several synthetic and real data sets show that the model can handle different-quality cluster members and effectively lessens the effect of noise. Therefore, the model provides better ensemble results than general cluster ensemble methods.
出处 《清华大学学报(自然科学版)》 EI CAS CSCD 北大核心 2009年第8期1106-1108,1121,共4页 Journal of Tsinghua University(Science and Technology)
关键词 聚类融合 共生矩阵 信息熵 加权迭代模型 cluster ensemble co-association matrix information entropy iterative weighted model
  • 相关文献

参考文献7

  • 1Strehl A, Ghosh J. Cluster ensembles--A knowledge reuse framework for combining multiple partitions [J]. Journal of Machine Learning Research, 2003, 3(3) : 583 - 617.
  • 2Karypis G, Kumar V. A fast and high quality multilevel scheme for partitioning irregular graphs[J]. SIAM Journal on Scientific Computing, 1998, 20(1) : 359 - 392.
  • 3Fred A, Jain A K. Data clustering using evidence accumulation [C]// Proceedings of the 16th International Conference on Pattern Recognition (ICPR 2002). 2002, 4: 276 - 280.
  • 4Ayad H, Kamel M. Finding natural clusters using multi-clusterer combiner based on shared nearest neighbors[C]// Volume 2709 of Lecture Notes in Computer Science. Springer, 2003:166 - 175.
  • 5Ayad H, Kamel M. Refined shared nearest neighbors graph for combining multiple data clusterings, advances in intelligent data analysis[C]//Volume 2810 of Lecture Notes in Computer Science. Springer, 2003:307 - 318.
  • 6Merz C, Murphy B. UCI repository of machine learning databases [D/OLd. http: //www. ics. uei. edu/#mlearn/ mlrepository, html. 1996.
  • 7Larson B, Aone C. Fast and effective text mining using linear time document clustering [C]// Conference on Knowledge Discovery in Data, Proceedings of the fifth ACM SIGKDD international conference on Knowledge discovery and data mining. San Diego, California, United States, 1999: 16-22.

同被引文献2

引证文献1

二级引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部