期刊文献+

一种基于数据包含度的自动聚类算法 被引量:1

An Automatic Clustering Algorithm Based on Data Contained Ratio
下载PDF
导出
摘要 聚类分析是机器学习和模式识别领域的一个重要问题,聚类算法常用于解决这类问题。针对传统聚类算法运算量大、不适应任意分布数据聚类的不足,提出了一种基于数据包含度的自动聚类算法。该算法引入数据包含度的概念,能够自动确定聚类个数和聚类中心,并进一步采用跟随策略实现聚类。多组数据的实验验证了自动聚类算法的有效性。对不同分布的数据进行了自动聚类算法与K-means聚类算法的聚类结果比较,实验结果表明自动聚类算法具有很好的聚类性能。 Cluster analysis is an important issue for machine learning and pattern recognition. Clustering algorithm is usually used in solving these problems. A novel automatic clustering algorithm is developed based on data contained ratio. In automatic clustering algorithm which is presented in this paper,the concept of data contained ratio is proposed,the cluster number can be determined automatically based on the data contained ratio,and the relative cluster centers are found similarly Several groups data are used to testify and demonstrate the validity and effectiveness of the cluster algorithm. In addition,the comparison between the traditional K-means cluster algorithm and automatic cluster algorithm is processed. The results demonstrate that the automatic cluster algorithm has high performance in clustering random distribution data set.
作者 马云红 王成汗 江腾蛟 张堃 Ma Yunhong Wang Chenghan Jiang Tengjiao Zhang Kun(School of Electronics and Information, Northwestern Polytechnic University, Xi'an 710072, China)
出处 《西北工业大学学报》 EI CAS CSCD 北大核心 2016年第5期863-866,共4页 Journal of Northwestern Polytechnical University
基金 西北工业大学研究生创意创新种子基金(G2015KY0407) 国家自然科学基金青年基金项目(61401363)资助
关键词 聚类算法 数据包含度 数据局部密度 clustering algorithm data contained ratio data local density
  • 相关文献

参考文献2

二级参考文献27

  • 1刘高军,朱嬿.基于数据挖掘技术的建筑企业信用评价[J].中国矿业大学学报,2005,34(4):494-499. 被引量:21
  • 2王玲,薄列峰,焦李成.密度敏感的谱聚类[J].电子学报,2007,35(8):1577-1581. 被引量:61
  • 3ESTER M,KRIEGEL H,SANDER J,et al.A density-based algorithm for discovering clusters in large spatial databases with noise[C]// Proc of the 1996 2nd Int'l Conf on Knowledge Discovery and Data Mining.Portland:AAAI Press,1996:226-231.
  • 4ESTER M,KRIEGEL H P,SANDER J,et al.Incremental clustering for mining in a data warehousing environment[C]// Proceedings of the Twenty-Fourth Very Large Data Bases(VLDB) Conference.New York:[s.n.],1998:323-333.
  • 5JIN R,YANG G,AGRAWAL G,et al.Shared memory parallelization of data mining algorithms:Techniques,programming interface,and performance[J].IEEE Transactions on Knowledge and Data Engineering,2005,17(1):71-89.
  • 6SHI Yong,SONG Yu-qing,ZHANG Ai-dong.A shrinking-based clustering approach for multidimensional data[J].IEEE Transactions on Knowledge and Data Engineering,2005,17(10):1389-1403.
  • 7CRISTIANINI N, SHAWE-TAYLOR J, KANDOLA J. Spectral kernel methods for clustering[A]. Proceedings of the 13th Advances in Neu- ral Information Processing Systems(NIPS 2001)[C]. Vancouver, Brit- ish Columbia, Canada, 2001. 649-655.
  • 8NG A Y, JORDAN M I, WEISS Y. On spectral clustering: analysis and an algorithm[A]. Proceedings of the 14th Advances in Neural Informa- tion Processing Systems(NIPS 2002)[C]. Vancouver, British Columbia, Canada, 2002. 849-856.
  • 9DUCOURNAU A, BRETI'O A, RITAL S, et al. A reductive approach to hypergraph clustering: an application to image segmentation[J]. Pattern Recognition, 2012, 45(7):2788-2803.
  • 10ZHANG T P, TANG Y Y, .FANG B, et al. Document clustering incorrelation similarity measure space[J]. IEEE Transactions on Knowl- edge and Data Engineering, 2012, 24(6): 1002-1013.

共引文献99

同被引文献6

引证文献1

二级引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部