期刊文献+

K-means聚类算法的研究 被引量:44

Research of K-means Algorithm
下载PDF
导出
摘要 为解决原始K-means算法随机选取初始聚类中心对聚类结果的影响较大的不足,提出了改进算法。采取基于采样选取聚类中心距离的规则,进行多次选择决定最终的初始聚类中心,使得改进后的算法受初始聚类中心选择的影响达到最小;同时,在选取初始聚类中心后,对初值进行数据标准化处理。将改进的K-means算法应用于销售行业,结果显示,改进后的算法比原始的算法在效率上得到了提高。 K-means clustering algorithm has a few deficiencies. For instance, the result of the algorithm is affected by initial clustering centre. In this paper an improvement was made in the selection of initial cluster center in K-means algorithm. By adopting the principle of selecting cluster center by sampling, this improved algorithm ultimately defines the initial cluster center after multiple choices, thus minimizing the effect of initial cluster center. Simultaneously, the initial data is standadized once the initial cluster center is selected. On this basis, the improved algorithm was applied to analyze the sell industry data to produce different characteristics of the sell areas. For different sell areas, enterprise can offer differentiated services or products according to the different characteristics of each area.
作者 韩晓红 胡彧
出处 《太原理工大学学报》 CAS 北大核心 2009年第3期236-239,共4页 Journal of Taiyuan University of Technology
关键词 数据挖掘 K—means算法 初始聚类中心 聚类分析 Clustering algorithm K-Means algorithm sell industry
  • 相关文献

参考文献4

二级参考文献33

  • 1余建桥,张帆.基于数据场改进的PAM聚类算法[J].计算机科学,2005,32(1):165-167. 被引量:15
  • 2[2]Jianwei Han, M Kamber. Data Mining: Concepts and Techniques. San Francisco: Morgan Kaufmann Publishers, 2000
  • 3[3]J Grabmeier, A Rudolph. Techniques of cluster algorithms in data mining. Data Mining and Knowledge Discovery, 2002, 6(4): 303~360
  • 4[4]A K Jain, M N Murty, P J Flynn. Data clustering: A review. ACM Computing Surveys, 1999, 31(3): 264~323
  • 5[5]J MacQueen. Some methods for classification and analysis of multivariate observations. In: L M Le Cam, J Neyman eds. Proc of the 5th Berkeley Symp on Mathematics, Statics and Probability, Vol 1. Berkeley: Berkeley University of California Press, 1967. 281~298
  • 6[6]J C Bezdek. Pattern Recognition with Fuzzy Objective Function Algorithms. New York: Plenum Press, 1981
  • 7[7]L Kaufman, P J Rousseeuw. Finding Groups in Data: An Introduction to Cluster Analysis. New York: John Wiley & Sons, 1990
  • 8[8]M Ester, H P Kriegel, J Sander et al. A density-based algorithm for discovering clusters in large spatial databases with noise. In: E Simoudis, J Han, U Fayyad eds. Proc of the 2nd Int'l Conf on Knowledge Discovery and Data Mining (KDD-96). Menlo Park: AAAI Press, 1996, 226~231
  • 9[9]M Ankerst, M Breuning, H P Kriegel et al. OPTICS: Ordering points to identify the clustering structure. In: A Delis, C Faloutsos, S Ghandeharizadeh eds. Proc of the 1999 ACM SIGMOD Int'l Conf on Management of Data. New York: ACM Press, 1999. 49~60
  • 10[10]W Wang, J Wang, R Muntz. STING: A statistical information grid approach to spatial data mining. In: M Jarke, M J Carey, K R Dittrich et al eds. Proc of the 23rd IEEE Int'l Conf on Very Large Data Bases. San Francisco: Morgan Kaufmann Publishers, 1997, 186~195

共引文献98

同被引文献393

引证文献44

二级引证文献248

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部