期刊文献+

利用广义后缀树的最大相似度优先聚类方法

Maximum Similarity Priority Clustering Method Based on Generalized Suffix Tree
下载PDF
导出
摘要 本文提出了利用后缀树模抽的最大相似度优先聚类方法,通过构造文档集的广义后缀树模型抽取短语作为特征项并映射到M维向量空间模型;计算文档间的相似度矩阵,对任意两个文档之间的相似度进行降序排列,优先合并具备最大相似度的文档对形成初始聚类;合并初始聚类得到最终聚类结果。 A novel clustering method called Maximum Similarity Priority Clustering based on generalized suffix tree is proposed.Each phrase extracted from generalized suffix tree of documents collection is regarded as a unique feature term in vector space model.Similarities matrix is computed and the similarities are sorted in descend order.Then,according to maximum similarity priority,documents pairs are merged into initial clusters which can be merged into final clusters.
作者 蒋程 张建武
出处 《中国科技信息》 2013年第3期89-91,共3页 China Science and Technology Information
基金 重庆市科委(编号cstc2012gg-yyjsB40006)
关键词 聚类方法 后缀树 最大相似度 向量空间模型 clustering algorithms suffix tree maximum similarity vector space model
  • 相关文献

参考文献3

二级参考文献17

  • 1张敏,马少平,宋睿华.DF还是IDF?主特征模型在Web信息检索中的使用[J].软件学报,2005,16(5):1012-1020. 被引量:13
  • 2刘远超,王晓龙,刘秉权.一种改进的k-means文档聚类初值选择算法[J].高技术通讯,2006,16(1):11-15. 被引量:23
  • 3吴文丽,刘玉树,赵基海.一种新的混合聚类算法[J].系统仿真学报,2007,19(1):16-18. 被引量:18
  • 4Manning C D,Raghavan P, Schiitze H.An introduction to information retrieval[M].Cambridge, England: Cambridge University Press, 2009 : 349-400.
  • 5Huang J Z, Ng M K, Rong H, et al.Automated variable weighting in K-means type clustering[J].IEEE Transactions on Pattem Analysis and Machine Intelligence,2005,27(5):657-668.
  • 6Chim H, Deng Xiao-tie.Efficient phrase-based document similarity for clustering[J].IEEE Transactions on Knowledge and Data Engineering,2008,20(9) : 1217-1229.
  • 7Zamir O, Etzioni O, Madani O, et al.Fast and intuitive clustering of Web documents[C]//Proceedings of the 3rd International Conference on Knowledge Discovery and Data Mining, 1997: 287-290.
  • 8Zamir O,Etzioni O.Web document clustering:A feasibility demonstration[C]//Proceedings of the 21st International ACM SIGIR Conference on Research and Development in Information Retrieval, 1998 : 46-54.
  • 9Ukkonen E.On-line construction of suffix trees[J].Algorithmica, 1995,14(3) :249-260.
  • 10Wang Jian-hua,Li Rui-xu.A new cluster merging algorithm of suffix tree clustering[J].Intelligent Information Processing III, 2007: 197-203.

共引文献5

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部