期刊文献+

基于核K-means的增量多视图聚类算法 被引量:1

Incremental multi-view clustering algorithm based on kernel K-means
原文传递
导出
摘要 针对基于核的多视图聚类算法(kernel based multi-view clustering method,MVKKM)在处理大规模数据集时运行时间长的缺点,引入增量聚类模型的概念,将MVKKM算法与增量聚类模型相结合,提出基于核K-means的多视图增量聚类算法(incremental multi-view clustering algorithm based on kernel K-means,IMVCKM)。通过将数据集分块,在每个数据块中使用M VKKM算法聚类,并将每个数据块的聚类中心作为下个数据块的初始聚类中心。将所有块的聚类中心进行整合后再次进行多视图聚类,得到最终的聚类结果。试验结果表明,在3个大规模数据集上,IMVCKM算法相较于MVKKM算法在3个评价指标上具有更好的聚类结果,且运行时间更短。该算法在保证聚类性能的基础上大大降低算法的运行时间。 Because of the defect of long running time in the kernel based multi-view clustering algorithm( MVKKM) when dealing with large-scale datasets,the concept of incremental clustering model was introduced. The incremental multi-view clustering algorithm based on kernel K-means( IMVKKM) was proposed by combining MVKKM algorithm and incremental clustering framework.The dataset was divided into chunks and the MVKKM method was used in each data chunk to obtain a set of cluster centers,which was regarded as the initial cluster center of the next chunk. The cluster centers of all the chunks were combined and the final set of cluster result was identified by using MVKKM. The experimental results showed that IMVKKM algorithm had better clustering results and shorter running time than MVKKM algorithm on three large-scale datasets. The proposed approach could reduce the running time while keeping the clustering performance.
作者 张佩瑞 杨燕 邢焕来 喻琇瑛 ZHANG Peirui;YANG Yan;XING Huanlai;YU Xiuying(School of Information Science and Technology, Southwest Jiaotong University, Chengdu 611756, Sichuan, China)
出处 《山东大学学报(工学版)》 CAS 北大核心 2018年第3期48-53,共6页 Journal of Shandong University(Engineering Science)
基金 国家自然科学基金资助项目(61572407) 国家科技支撑计划课题资助项目(2015BAH19F02)
关键词 多视图聚类 核函数 多视图核K-means 增量聚类 数据块 聚类中心 multi-view clusterting kernel function multi-view kernel K-means incremental clustering dataset chunk cluster center
  • 相关文献

参考文献4

二级参考文献64

  • 1李洁,高新波,焦李成.一种基于修正划分模糊度的聚类有效性函数[J].系统工程与电子技术,2005,27(4):723-726. 被引量:8
  • 2张惟皎,刘春煌,李芳玉.聚类质量的评价方法[J].计算机工程,2005,31(20):10-12. 被引量:60
  • 3何婷婷,徐超,李晶,赵君喆.基于种子自扩展的命名实体关系抽取方法[J].计算机工程,2006,32(21):183-184. 被引量:25
  • 4普运伟,金炜东,朱明,胡来招.核模糊C均值算法的聚类有效性研究[J].计算机科学,2007,34(2):207-210. 被引量:28
  • 5HALKIDI M, VAZIRGIANNIS M, BATISTAKIS Y. Quality scheme assessment in the clustering process [ C ]//Proc of the 4th Eur Conf Principles and Practice of Knowledge Discovery in Databases. 2000: 165-276.
  • 6THEODORIDIS S, KOUTROUBAS K. Pattern recognition[ M]. [S.l. ] :Academic Press, 1999.
  • 7HALKIDI M, BATISTAKIS Y, VAZIRGIANNIS M. On clustering validation techniques [ J ]. Intelligent Information Systems, 2001, 17 (2-3) :107-145.
  • 8HALKIDI M, VAZIRGIANNIS M. Clustering validity assessment using multi representatives[ C]//Proc of SETN Conference. 2002.
  • 9YANG Yan, KAMEL M, JIN Fan. A model of document clustering using ant colony algorithm and validity index [ C ]//Proc of IEEE International Joint Conference on Neural Networks. Montreal: [ s. n. ], 2005 : 2730- 2735.
  • 10RESSOM H, WANG D, NATARAJAN P. Adaptive double self-organizing maps for clustering gene expression profiles [ J ]. Neural Networks ,2003,16(5-6) :633-640.

共引文献140

同被引文献5

引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部