期刊文献+

自适应时间平滑的演化谱聚类

Adaptive Time-Smoothed Evolutionary Spectral Clustering
下载PDF
导出
摘要 传统的聚类算法一般只适用于静态数据的处理,而真实世界的数据往往数据量大且变化多,静态的聚类算法不能为动态数据提供其演化规律的分析学习。演化数据的聚类,一方面要正确反映每一时刻数据的合理簇划分,另一方面又要使动态的聚类结果在演化过程中尽可能平滑。本文提出了一种自适应时间平滑的演化聚类框架,该模型考虑到当前时刻数据与历史时刻数据的未知关联,通过限定时间回溯的范围,自适应地寻找与当前快照最相关的历史快照,并通过有机融合基于Itakura-Saito距离的静态相似度和基于时间序列的动态相似度,计算各个时间片快照上的相似度矩阵。本文进一步提出了两种自适应时间平滑的演化谱聚类算法,从不同的角度定义时间代价,得到不同的演化聚类结果。在真实数据集上的实验表明这两种算法能够有效地利用历史数据,在聚类结果上准确性更高,时间平滑性也更好。 Traditional clustering algorithms are generally only suitable for static data processing,while the real world data are often large and changeable,so static clustering algorithms cannot provide the analysis and learning of evolution rules for dynamic data.On one hand,the clustering of evolutionary data needs to reflect the reasonable cluster partition of data at each snapshot;on the other hand,it needs to make sure the dynamic clustering results are as smooth as possible.This paper proposes an adaptive time-smoothed evolutionary clustering framework,which takes into account of the unknown relationship between the current data and the historical data.By imposing a time window for backtracking,it adaptively finds the most relevant historical snapshot to the current snapshot.Meanwhile,it fuses the static similarity based on the Itakura-Saito distance and the dynamic similarity based on the time series to compute,so as so compute the similarity matrix on each snapshot.Under this framework,this paper further proposes two adaptive time-smoothed evolutionary spectral clustering algorithms,which define the time cost from different aspects,and obtain different evolutionary clustering results.Experiments on real datasets show that the two proposed algorithms can effectively utilize historical data,and achieve better clustering performance as well as better temporal smoothness.
作者 何萍 姜玉麟 徐晓华 林惠惠 葛方毅 方威 仁祥 HE Ping;JIANG Yulin;XU Xiaohua;LIN Huihui;GE Fangyi;FANG Wei;REN Xiang(College of Information Engineering,Yangzhou University,Yangzhou 225009,China)
出处 《南京航空航天大学学报》 CAS CSCD 北大核心 2021年第5期700-707,共8页 Journal of Nanjing University of Aeronautics & Astronautics
基金 国家自然科学基金(61402395)资助项目 江苏省自然科学基金(BK20201430,BK20151314,BK20140492)资助项目。
关键词 演化数据 时间平滑性 Bregman散度 谱聚类 evolutionary data time smoothness Bregman divergence spectral clustering
  • 相关文献

参考文献1

二级参考文献42

  • 1Vapnik V N. The Nature of Statistical Learning Theory. New York, NY, USA Springer-Verlag, 1995.
  • 2Bartlett P, Ben-David S, Kulkarni S. Learning changing con- cepts by exploiting the structure of change. Machine Learn- ing, 2000, 41(2): 153 174.
  • 3Huhen G, Spencer L, Domingos P. Mining time-changing data streams//Proceedings of the 7th ACM SIGKDD Interna tional Conference on Knowledge Discovery Data Mining (KDD). New York, 2001:97-106.
  • 4Gaber M M, Zaslavsky A, Krishnaswamy S. Mining data streams: A review. ACM Sigmod Record, 2005, 34(2): 18-26.
  • 5Webb G, Ting K. On the application of roe analysis to pre- dict classification performance under varying class distribu- tions. Machine Learning, 2005, 58(1): 25-32.
  • 6Chakrabarti D, Kumar R, Tomkins A. Evolutionary cluste ring//Proeeedings of the 12th ACM SIGKDD International Conference on Knowledge Discovery Data Mining (KDD). New York, NY, 2006:554-560.
  • 7Blei D M, Lafferty J D. Dynamic topic models//Proceedings of the 23rd International Conference on Machine Learning. New York, 2006:113 120.
  • 8Bifet A, Gavalda R. Learning from time-changing data with adaptive windowing//Proceedings of the SIAM International Conference on Data Mining, Minneapolis. Minnesta, USA, 2007 : 443-448.
  • 9Ren L, Dunson D B, Carin L. The dynamic hierarchical dirichlet proeess//Proceedings o{ the 25th International Con- ference on Machine Learning. New York, 2008:824-831.
  • 10Bifet A, Holmes G, Pfahringer B. Leveraging bagging for evolving data streams//Proceedings of the 2010 European conference on Machine learning and Knowledge Discovery in Databases: Part I. Berlin, 2010:135 150.

共引文献7

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部