期刊文献+

引入信息增益的层次聚类算法 被引量:6

New hierarchical clustering method using information gain
下载PDF
导出
摘要 层次聚类分析是模式识别和数据挖掘领域中一个非常重要的研究课题,具有广泛的应用前景。受决策树学习中选择最佳分类属性的启发,提出一种引入信息增益的层次聚类方法,该方法利用信息增益指导层次聚类中的属性加权,从而提高聚类结果质量。在UCI数据集上的实验结果表明,该算法性能明显优于原层次聚类算法。 Hierarchical clustering analysis is a very important subject in the fields of pattern recognition and data mining, and has a broad application prospect. Inspired by the idea of selecting the best classification attributes in decision tree algorithm, a novel hierarchical clustering algorithm using information gain is proposed. This algorithm directs the attribute weighting in a hierarchical clustering by computing the information gains, thereby improving the quality of clustering results. The experiment results on UCI machine learning data sets indicate that it yields better stability compared with the quondam hierarchical clustering algorithm.
出处 《计算机工程与应用》 CSCD 2012年第1期142-144,共3页 Computer Engineering and Applications
基金 山东省科技研究计划项目(No.ZR2010FM021 2010G0020115 2008B0026) 山东省教育厅科研项目(No.J09LG02)
关键词 层次聚类 信息增益 属性加权 hierarchical clustering information gain attribute weighting
  • 相关文献

参考文献10

  • 1Theodoridis S, Koutroumbas K.Pattem recognition[M].[S.l.]: Academic Press,2006.
  • 2Xu R.Survey of clustering algorithms[J].IEEE Transactions on Neural Networks 12005,16 (3) : 645-678.
  • 3Xu R, Wunsch D.Clustering[M].New York:IEEE Press,2009: 18-23.
  • 4Ng A Y, Jordan M I, Weiss Y..On spectral clustering: analysis and an algorithm[C]//Advances in Neural International Processing Systems, 2001 : 849-856.
  • 5Zhang T, Ramakrishnan R, Eivny M.BIRCH: an efficient data clustering method for very large databases[C]//Proc of ACM SIGMOD International Conference on Management of Data.Canada:ACM Press, 1996: 103-114.
  • 6Guha S,Rastogi R, Shim K.CURE:an efficient clustering algorithm of large databases[C]//Proc of ACM SIGMOD International Conference on Management of Data.Seattle: ACM Press, 1998: 73-84.
  • 7Karypis G, Han E,Kumar V.CHAMELEON:a hierarchical clustering algorithm using dynamic modeling[J].IEEE Computer, 1999,27(3) :329-341.
  • 8Topchy A, Law M H, Jain A K, et al.Analysis of consensus partition in cluster ensemble[C]//Proc of the 5th IEEE International Conference on Data Mining,2004:225-232.
  • 9Mitchell T M.Machine learning[M].[S.l.]: McGraw-Hill Science/ Engineering/Math, 1997: 40-45.
  • 10Blake C, Keogh E,Merz C J.UCI repository of machine learning databases[EB/OL].(2008).http ://www.ics.uci.edu/-mlearn/MLRepository, html.

同被引文献58

引证文献6

二级引证文献29

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部