期刊文献+

基于信息理论的合作聚类算法研究 被引量:8

Study on New Information Theory Based Cooperative Clustering Algorithm
下载PDF
导出
摘要 传统的聚类算法是针对一个独立数据集的学习分类算法,如FCM(Fuzzy-C-Means)聚类算法.在现实生活中,一个数据集独立于其它数据集,而往往通过与别的数据集交换信息与之相互合作.因此在聚类过程中,需要考虑来自其它数据集的影响,从而得到更能反映现实的数据结构.该文提出了一种基于信息理论的信息增益方法来建模并定量分析多个数据集间的合作关系.在此基础上,导出了相应的新合作聚类算法CCA(CooperativeCluste-ringAlgorithm).理论分析表明该算法最终收敛.实验结果也进一步表明了该合作聚类算法的可行性与有效性. Conventional clustering algorithms are designed for a single independent dataset, e.g.Fuzzy C-Means (FCM) clustering algorithm. In real world, a dataset is independent of other datasets but sometimes can be cooperative with others by exchanging information, such as the relationship between the subsidiary companies. So the influence from other relative collaborative datasets should be considered while performing clustering learning under such collaborative circumstances. Two different collaborative models are discussed and new proper methods are proposed to quantitatively measure such collaboration between datasets in this paper, e.g. information gain. The corresponding collaborative clustering algorithms are presented accordingly and the theoretic analysis shows that the new cooperative clustering algorithms can finally converge to local minimum. Experimental results demonstrate that the clustering structures obtained by new cooperative algorithms are different from those of conventional algorithms for the consideration of collaboration and the performances of these collaborative clustering algorithms can be much better than those conventional “single” clustering algorithms under the cooperating circumstances.
出处 《计算机学报》 EI CSCD 北大核心 2005年第8期1287-1294,共8页 Chinese Journal of Computers
基金 中法先进计划项目基金(PRASI03-02)资助
关键词 信息论 聚类 模糊 模式识别 information theory clustering fuzzy pattern recognition
  • 相关文献

参考文献20

  • 1Hopper F.. Fuzzy Cluster Analysis. Chichester: John Wiley, 1999.
  • 2Han Jia-Wei.,Kamber M.. Data Mining: Concept and Techniques. San Mateo: Morgan Kanfmann, 2001.
  • 3Bezdek J.C.. Pattern Recognition with Fuzzy Objective Function Algorithms. New York: Plenum Press, 1981.
  • 4沈红斌,王士同,吴小俊.离群模糊核聚类算法[J].软件学报,2004,15(7):1021-1029. 被引量:37
  • 5Shen Hong-Bin, Yang Jie, Wang Shi-Tong. Outlier detecting in fuzzy switching regression models. In: Proceedings of the AIMSA'04, Varna, Bulgaria, 2004, 208~215.
  • 6Wu K.L., Yang M.S.. Alternative c-means clustering algorithms. Pattern Recognition, 2002, 35(10): 2267~2278.
  • 7Sun Ying, Zhu Qiu-Ming, Chen Zheng-Xin. An iterative initial-points refinement algorithm for categorical data clustering. Pattern Recognition Letters, 2002, 23(7):875~884.
  • 8Hathaway R., Benzdek J.. Switching regression models and fuzzy clustering. IEEE Transactions on Fuzzy Systems, 1993, 1(3): 195~204.
  • 9Merz P.. Analysis of gene expression profiles: An application of memetic algorithms to the minimum sum-of-squares clustering problem. BioSystems, 2003, 72(11): 99~109.
  • 10Eppstein D.. Fast hierarchical clustering and other applications of dynamic closest pairs. In: Proceedings of the 9th Symposium Discrete Algorithms, San Francisco, 1998, 619~628.

二级参考文献1

共引文献36

同被引文献98

引证文献8

二级引证文献58

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部