期刊文献+

层次聚类的簇集成方法研究 被引量:11

Research on cluster ensembles methods based on hierarchical clustering
下载PDF
导出
摘要 聚类集成比单个聚类方法具有更高的鲁棒性和精确性,它主要由两部分组成,即个体成员的产生和结果的融合。针对聚类集成,首先用k-means聚类算法得到个体成员,然后使用层次聚类中的单连接法、全连接法与平均连接法进行融合。为了评价聚类集成方法的性能,实验中使用了ARI(Adjusted Rand Index)。实验结果表明,平均连接法的聚类集成性能优于单连接法和全连接法。研究并讨论了融合方法的聚类正确率和集成规模的关系。 Cluster ensembles method is considered as a robust and accurate alternative to single clustering runs.It mainly consists of both generation of individual member and fusion methods.In this paper,the cluster ensembles are studied where individual members are obtained based on k-means clustering algorithm and fusion method of hierarchical clustering is used. Three consensus functions, which are single linkage, complete linkage and average linkage, respectively, is studied and discussed in hierarchical clustering fusion.For evaluating performance of cluster ensembles,Adjusted Rand Index is considered. Experimental results show that performance of cluster ensembles with the average linkage is superior to one with single linkage and complete linkage.Moreover, the relationship between accuracy and ensemble size of the three fusion methods is also studied and discussed.
作者 李凯 王兰
出处 《计算机工程与应用》 CSCD 北大核心 2010年第27期120-123,共4页 Computer Engineering and Applications
基金 国家自然科学基金No.60773062 河北省自然科学基金No.F2009000236~~
关键词 聚类集成 融合函数 聚类 ARI cluster ensembles consensus functions clustering Adjusted Rand Index(ARI)
  • 相关文献

参考文献9

  • 1Strehl A,Ghosh J.Claster ensembles-A knowledge reuse framework for combining multiple partitions[J].The Journal of Machine Learning Research,2003,3(3):583-617.
  • 2Hadjitodorov S T,Kuncheva L I,Todorova L P.Moderate diversity for better cluster ensembles[J].Information Fusion,2005,7(3):264-275.
  • 3Hubert L,Arabie P.Comparing partitions[J].Journal of Classification,1985,2(1):193-218.
  • 4Hu X,Yoo I.Cluster ensemble and its applications in gene expression analysis[C]//Chen Y P P.Proc 2nd Asia-Pacific Bioinformatics Conference,Dunedin,New Zealand,2004:297-302.
  • 5Topchy A,Jain A,Punch W.Combining multiple weak clusterings[C]//Proc Third IEEE International Conference on Data Mining,Melbourne Florida,2003:331-338.
  • 6Minaei B,Topchy A,Punch W.Ensembles of partitions via data Resampling[C]//Proceedings of the International Conference on Information Technology on Coding and Computing,Las Vegas,NV,2004,2:188-192.
  • 7Kuncheva L I.Evaluation of stability of K-means cluster ensembias with respect to random initialization[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2006,28 (11):1798-1808.
  • 8Rand W M.Objective criteria for the evaluation of clustering methods[J].Journal of the American Statistical Association,1971,66(336):846-850.
  • 9Topchy A,Jain A K,Punch W.A mixture model for clustering ensembles[C]//Proceedings of SIAM Conference on Data Mining,2004:379-390.

同被引文献76

引证文献11

二级引证文献85

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部