期刊文献+

一种基于频繁概念集的文本聚类方法

A Text Clustering Method Based on Frequent Concept-Sets
下载PDF
导出
摘要 针对传统文本表示模型的不足以及文本向量的"高维诅咒"问题,本文提出一种基于频繁概念集的文本聚类方法(CFC)。该方法利用HowNet将文本中的关键词映射为概念,然后使用Apriori算法找出概念文本集中的频繁特征项,我们称之为频繁概念,最后利用CFC算法实现文本聚类。实验表明,较传统的基于频繁特征项的同类方法,该方法能获得更好的聚类效果。
出处 《计算机系统应用》 2009年第5期81-84,共4页 Computer Systems & Applications
  • 相关文献

参考文献6

  • 1刘远超,王晓龙,徐志明,关毅.文档聚类综述[J].中文信息学报,2006,20(3):55-62. 被引量:65
  • 2Liu YC, Wang XL, Wu C. ConSOM: A conceptional self-organizing map model for text. Clustering Neuroc- omputing, 2008(71):857 - 862.
  • 3Hotho A, Staab S, Stumme G. Ontologies improve text document clustering. Proceedings of the 3rd IEEE International Conference on Data Mining, 2003:541 - 544.
  • 4Li Y J, Chung SM, Holt J. Text Document Clustering Based on Frequent Word Meaning Sequences. Data and Knowledge Engineering, 2008, 64(1):381 - 404.
  • 5Fung BCM, Wang K, Ester M. Hierarchical document clustering using frequent itemsets. Proceedings of SIAM Internatio'nal Conference on Data Mining, 2003.
  • 6Bellare M, Rogaway P. The game-playing technique. Cryptology ePrint Archive Report. 2004. http://eprint. iacr.org/.

二级参考文献39

  • 1陈浩,何婷婷,姬东鸿.基于k-means聚类的无导词义消歧[J].中文信息学报,2005,19(4):10-16. 被引量:16
  • 2Regina Barzilay,Min-Yen Kan,and Kathleen R.McKeown.Simfinder:A Flexible Clustering Tool for Summarization[A].In proceedings of the Workshop on Summarization in NAACL 01[C].Pittsburg,Pennsylvania,USA:June 2001.
  • 3Zheng Chen,Wei-Ying Ma,Jinwen Ma.Learning to Cluster Web Search Results[A].In:proceedings of the 27th Annual International ACM SIGIR Conference[C].Sheffield,South Yorkshire,UK,July 2004,210 -217.
  • 4Y.C.Fang,S.Parthasarathy,F.Schwartz.Using Clustering to Boost Text Classification[J].In:proceedings of the IEEE ICDM Workshop on Text Mining,Maebashi City,Japan,2002.
  • 5A.Rauber,and M.Frühwirth.Automatically Analyzing and Organizing Music Archives[A].In:proceedings of the 5.European Conference on Research and Advanced Technology for Digital Libraries (ECDL 2001)[C].Darmstadt,Germany,2001.
  • 6Cutting,D.,Karger,D.,and etc.Scatter/Gather:A Cluster-based Approach to Browsing Large Document Collections[A].SIGIR ‘ 92,1992[C].318-329.
  • 7JR Wen,JY Nie,HJ Zhang.Clustering User Queries of a Search Engine[A].The Tenth International World Wide Web Conference[C].Hong Kong.May 1 -5,2001.
  • 8Anton Leuski and James Allan.Improving Interactive Retrieval by Combining Ranked Lists and Clustering[A].In:proceedings of RIAO2000[C].Paris,France,April 12-14,2000,665 -681.
  • 9Anton V.Leouski and W.Bruce Croft.An Evaluation of Techniques for Clustering Search Results[A].Technical Report IR-76,Department of Computer Science,University of Massachusetts,Amherst,1996.
  • 10Htttp://www.cs.washington.edu/research/clustering.

共引文献64

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部