期刊文献+

对聚类算法普遍存在问题的解决办法 被引量:10

Solutions to General Clustering Algorithmic Issues
下载PDF
导出
摘要 聚类广泛应用于统计、机器学习、模式识别、数据分析等领域并越来越受重视。本文研究了各种聚类算法共同面临的五个问题:聚类效果评估、类数目估计、数据预处理、样本间相似性测量、抗干扰性能,分析了对这些问题的有代表性的解决方法,总结并预测了未来聚类算法在这五个方面的研究方向。 Clustering is widely used in several fields such as statistics, machine learning, pattern recognition and numerical analysis. Recently, more and more attention has been paid to it. In this paper, five issues commonly concerned are discussed, they are: assessment of clustering results, estimation of total number of clusters, data preparation, measures of data proximity and outlier handling. Representative solutions to these issues are surveyed, conclusions are summed up, development trend of algorithms to deal with these five issues is forecasted.
出处 《电路与系统学报》 CSCD 2004年第3期92-99,共8页 Journal of Circuits and Systems
基金 国家自然科学基金资助项目(60002003)
关键词 聚类 效果评估 类数目估计 预处理 相似性测量 抗干扰性能 clustering assessment of results estimation of total number of clusters data preparation proximity measure outlier handling
  • 相关文献

参考文献68

  • 1FISHER D. Knowledge acquisition via incremental conceptual clustering [J]. Machine Learning, 1987, (2): 139-172.
  • 2HARTIGAN J. Clustering Algorithms [M]. New York: John Wiley & Sons, 1975.
  • 3HARTIGAN J, WONG M. Algorithm AS136: A k-means clustering algorithm [J]. Applied Statistics, 1979, 28: 100-108.
  • 4Kaufman L, Rousseeuw P. Finding Groups in Data: An Introduction to Cluster Analysis [M]. New York: John Wiley and Sons, 1990.
  • 5Sheikholeslami G, Chatterjee S, Zhang A. WaveCluster: A multi-resolution clustering approach for very large spatial databases [A]. Proceedings of the 24th Conference on VLDB [C]. New York, 1998, 428-439.
  • 6Barbara D, Chen P. Using the fractal dimension to cluster datasets [A]. Proceedings of the 6th ACM SIGKDD [C]. Boston, MA., 2000, 260-264.
  • 7俞蓓,王军,叶施仁.基于近邻方法的高维数据可视化聚类发现[J].计算机研究与发展,2000,37(6):714-720. 被引量:7
  • 8Kandogan E. Visualizing multi-dimensional clusters, trends and outliers using star coordinates [A]. Proceedings of the 7th ACM SIGKDD [C]. San Francisco, CA., 2001, 107-116.
  • 9沈越泓,益晓新,徐发强,李兴国.模糊聚类和模糊模式识别技术在通信设备抗干扰性能评估系统中的应用[J].电子科学学刊,2000,22(2):210-217. 被引量:11
  • 10Bezdek J C. Pattern Recognition With Fuzzy Objective Function Algorithms [M]. New York: Plenums Press, 1981, 95-107.

二级参考文献111

共引文献1142

同被引文献94

引证文献10

二级引证文献40

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部