期刊文献+

一种优化初始中心点的K-means算法 被引量:139

A K-means Algorithm Based on Optimized Initial Center Points
原文传递
导出
摘要 针对K-means算法所存在的问题,提出一种优化初始中心点的算法.采用密度敏感的相似性度量来计算对象的密度,启发式地生成样本初始中心.然后设计一种评价函数——均衡化函数,并以均衡化函数为准则自动生成聚类数目.与传统算法相比,本文算法可得到较高质量的初始中心和较稳定的聚类结果.实验结果表明该算法的有效性和可行性. Aiming at the problems of K-means algorithm, a method is proposed to optimize the initial center points through computing the density of objects. Thus, the initial center of the samples can be built in a heuristic way. Then, a new evaluation function is proposed, namely equalization function, and consequently the cluster number is generated automatically. Compared with the traditional algorithms, the proposed algorithm can get initial centers with higher quality and steadier cluster results. Experimental results show the effectiveness and feasibility of the proposed algorithm.
出处 《模式识别与人工智能》 EI CSCD 北大核心 2009年第2期299-304,共6页 Pattern Recognition and Artificial Intelligence
基金 国家自然科学基金资助项目(No.60775037)
关键词 K-MEANS算法 密度 初始中心点 均衡化函数 K-means Algorithm, Density, Initial Center Point, Equalization Function
  • 相关文献

参考文献13

  • 1Han J, Kamber M. Data Mining Concepts and Techniques. Orlando, USA: Morgan Kaufmann Publishers, 2001
  • 2史忠植.知识发现[M].北京:清华大学出版社,2004.137-140.
  • 3Huang J Z, Ng M K, Rang Hongqiang, et al. Automated Variable Weighting in K-means Type Clustering. IEEE Trans on Pattern Analysis and Machine Intelligence, 2005, 27 (5) : 657 - 668
  • 4Dhillon I S, Guan Yuqiang, Kogan J. Refining Clusters in High Dimensional Text Data//Proc of the 2nd SIAM Workshop on Clustering High Dimensional Data. Arlington, USA, 2002 : 59 - 66
  • 5Zhang B. Generalized K-Harmonic Means: Dynamic Weighting of Data in Unsupervised Learning//Proc of the 1 st SIAM International Conference on Data Mining. Chicago, USA, 2001 : 1 - 13
  • 6杨风召,朱扬勇.一种有效的量化交易数据相似性搜索方法[J].计算机研究与发展,2004,41(2):361-368. 被引量:26
  • 7Sarafis I, Zalzala A M S, Trinder P W. A Genetic Rule-Based Data Clustering Toolkit//Proc of the Congress on Evolutionary Computation. Honolulu, USA, 2002 : 1238 - 1243
  • 8Ma J, Perkins S. Time-Series Novelty Detection Using One-Class Support Vector Machines// Proc of the International Joint Conference on Neural Networks. Portland, USA, 2003, Ⅲ: 1741 - 1745
  • 9Kaufman L,Rousseeuw P J. Finding Groups in Data: An Introduction to Cluster Analysis. New York, USA: John Wiley & Sons, 1990
  • 10钱线,黄萱菁,吴立德.初始化K-means的谱方法[J].自动化学报,2007,33(4):342-346. 被引量:32

二级参考文献45

  • 1李洁,高新波,焦李成.基于特征加权的模糊聚类新算法[J].电子学报,2006,34(1):89-92. 被引量:113
  • 2A Guttman. R-Tree: A dynamic index structure for spatial searching. The ACM SIGMOD Int'l Conf on Management of Data, Boston, MA, 1984
  • 3T Sellis, N Roussopoulos, C Faloutsos. The R+ tree: A dynamic index for multidimensional objects. The 13th Int'l Conf on Very Large Data Bases, Brighton, England, 1987
  • 4N Beckman, H-P Kriegel, R Schneider et al. The R*-tree: An efficient and robust method for points and rectangles. The ACM SIGMOD Int'l Conf on Management of Data, Atlantic City, NJ, 1990
  • 5N Katayama, S Satoh. The SR-tree: An index structure for high dimensional nearest neighbor queries. The ACM SIGMOD Int'l Conf on Management of Data, Tucson, Arizona, USA, 1997
  • 6S Berchtold, D Keim, H-P Kriegel. The X-tree: An index structure for high-dimensional data. The 22nd Int'l Conf on Very Large Data Bases, Bombay, India, 1996
  • 7S Berchtold, C Bhm, H V Jagadish et al. Independent quantization: An index compression technique for high-dimensional data spaces. The 16th Int'l Conf on Data Engineering, San Diego, California, USA, 2000
  • 8Y Sakurai, M Yoshikawa, S Uemura et al. The A-tree: An index structure for high-dimensional spaces using relative approximation. The 26th Int'l Conf on Very Large Data Bases, Cairo, Egypt, 2000
  • 9R Weber, H J Scheck, S Blott. A quantitative analysis and performance study for similarity search methods in high dimensional spaces. The 24th Int'l Conf on Very Large Data Bases, New York City, New York, USA, 1998
  • 10K Beyer, J Goldstein, R Ramakrishnan et al. When is nearest neighbors meaningful? The 7th Int'l Conf on Database Theory, Jerusalem, Israel, 1999

共引文献152

同被引文献1035

引证文献139

二级引证文献1116

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部