期刊文献+

基于Python的K-means算法实现方式对比研究 被引量:3

Research of K-means Algorithm Programming Based on Python
下载PDF
导出
摘要 大数据时代的到来使Python语言受到越来越多的关注。在国际上,IEEE颁布的顶级编程语言交互排行榜中,Python已连续多年名列榜首,在国内,Python已经进入义务教育阶段小学课程。Python以其可读性强、使用范围广受到越来越多计算机使用人员的欢迎。Python在数据处理方面光彩夺目的表现得益于和其他过程控制语言的巨大不同,本文以经典K-means算法的实现为切入点,通过不同的编程方式实现同样的聚类过程,在UCI和生成数据集上分别运行不同程序,发现采用Numpy数据处理库可以显著提升程序运行效率,减少运行时间,展现出Python向量式数据计算的巨大优势。 With the advent of the big data era,python language has attracted more and more attention.Internationally,in the top programming language interaction ranking released by IEEE,python has been ranked first for many years.In China,python has entered primary school.Python is widely used by more and more computer users because of its readability.However,Python's advantages in data processing are also shown out of the huge differences of other process control languages.This paper takes the implementation of the classic k-means algorithm as an examples,program the same clustering process by different programming methods,run the program on the UCI and generating data set respectively,we found that using numpy data processing library can significantly improve the running efficiency of the program and reduce the running time,so then show the huge advantages of Python vector data computing.
作者 王习涛 WANG Xi-tao(Statistics Bureau Data Management Center of Henan Province,Henan Zhengzhou 410018)
出处 《软件》 2020年第8期87-88,128,共3页 Software
关键词 PYTHON K-MEANS Numpy 聚类 Python K-means Numpy Cluster
  • 相关文献

参考文献5

二级参考文献54

  • 1郝占刚,王正欧.基于遗传算法和k-medoids算法的聚类新算法[J].现代图书情报技术,2006(5):44-46. 被引量:5
  • 2Tan Pang-Ning,Steinbach M,Kuma V.Introduction to DataMining[M].北京:人民邮电出版社,2006:5-28.
  • 3Hand D J,Vinciotti V.Choosing k for two-class nearest neighbor classifiers with unbalance classes[J].Pattern Recognition Letter,2003,24(9):1555-1562.
  • 4Cuba S,Rastogi R,Shim K.CURE:An efficient clustering algorithm for large databases[C]//In:Hass L M,Tiwary A.Proc.of the ACM SIGMOD Int'1 Conf.on Management of Data.New York:ACM Press,1998:73-84.
  • 5Harmer P K,Williams P D,Gunsch G H.An Artificial Immune System Architecture for Computer Security Applications[J].IEEE Transactions on Evolutionary Computation,2002,6(3):252-280.
  • 6Yang M S,Hu Y J,Lin K C R,et al.Segmenttation techniques for tissue differentiation in MRI of ophthalmology using fuzzy clustering algorithm[J].Magnetic Resonance Imaging,2002(20):173-179.
  • 7Han Jiawei, Micheline Kamber. Data mining concepts and techniques[M].北京:机械工业出版社,2006.
  • 8徐克圣,王澜.一种自动获得k值的聚类算法[J].大连交通大学学报,2007,28(4):68-71. 被引量:3
  • 9孙吉贵,刘杰,赵连宇.聚类算法研究[J].软件学报,2008(1):48-61. 被引量:1070
  • 10雷小锋,谢昆青,林帆,夏征义.一种基于K-Means局部最优性的高效聚类算法[J].软件学报,2008,19(7):1683-1692. 被引量:113

共引文献729

同被引文献10

引证文献3

二级引证文献6

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部