期刊文献+

一种改进的CURE的事件聚类方法

An event clustering approach on the improved CURE algorithm
下载PDF
导出
摘要 一个文档往往包含多个主题的事件,把分散在多个文本中的同一主题事件组织起来依靠传统的文本聚类是无法实现的.本文通过对已有的CURE算法进行分析,根据事件的特征,对代表点的选取和小类合并机制进行改进,实现了一个改进的CURE算法.实验结果表明:改进后的方法在保证执行效率的情况下取得了更好的聚类效果. A document commonly contains many events with different topics, so it' s really hard for tradition- al clustering algorithms to organize such events with the same topic in multi - documents. Through the analy- sis of the feature of traditional CURE algorithm, and according to the feature of the events. This paper pro- poses an improved CURE algorithm that improved the selecting of representative points and clusters nesting mechanism. The experimental results show that our approach can provide better performance than that of other methods.
出处 《重庆文理学院学报(社会科学版)》 2015年第5期121-124,共4页 Journal of Chongqing University of Arts and Sciences(Social Sciences Edition)
基金 安徽省级质量工程项目(2013TSZY088)
关键词 层次聚类 CURE 代表点 事件聚类 hierarchical clustering CURE representative points event clustering
  • 相关文献

参考文献6

二级参考文献17

  • 1穗志方 俞士汶.基于骨架依存树的语句相似度计算模型[A]..中文信息处理国际会议(ICCIP''98)[C].,1998..
  • 2[1]Han Jiawei,Kamber Micheline . Data Mining Concepts and Techniques . New York: Academic Press, 2001
  • 3[2]Guha S, Rastogi R, Shim K. Cure: An Efficient Clustering Algorithm for Large Database. In: Proceedings of the 1996 ACM SIGMOD International Conference on Management of Data,Seattle,Washington, 1998. 73-84
  • 4闪四清译.数据挖掘概念、模型、方法和算法[M].清华大学出版社,2003..
  • 5Guha S,Rastogi R,Shim K. CURE:an efficient clustering algorithm for large databases. In:Proc. of 1998 ACM-SIGMOD Int.Conf. Management of Data, 1998
  • 6Karypis G, Han E-H, Kumar V. CHAMELEON: a hierarchical clustering algorithm using dynamic modeling. COMPUTER,1999,32:68~75
  • 7Karypis G, Kumar V. Mulitilevel k-way hypergraph partitioning.In:Proc. of the Design and Automation Conf. ,1999
  • 8Jarvis R A, Patrick E A. Clustering using a similarity measure based on shared nearest neighbors. IEEE Trans on Computers,1973,C-22(11)
  • 9O'Callaghan L, Mishra N, Meyerson A, Guha S, Motwani R.Streaming-Data algorithms for high-quality clustering. In: 18th Intl. Conf. on Data Engineering, 2002
  • 10Ng R T,Han Jiawei. Efficient and effective clustering methods for spatial data mining. In:Proc. of the 20thVLDB Conf. Santiago,Chile, 1994

共引文献27

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部