期刊文献+

搜索引擎中的聚类浏览技术 被引量:9

Clustering Method of Web Search Results
下载PDF
导出
摘要 搜索引擎大多以文档列表的形式将搜索结果显示给用户,随着Web文档数量的剧增,使得用户查找相关信息变得越来越困难,一种解决方法是对搜索结果进行聚类提高其可浏览性。搜索引擎的聚类浏览技术能使用户在更高的主题层次上查看搜索结果,方便地找到感兴趣的信息。本文介绍了搜索引擎的聚类浏览技术对聚类算法的基本要求及其分类方法,研究分析了主要聚类算法及其改进方法的特点,讨论了对聚类质量的评价,最后指出了聚类浏览技术的发展趋势。 Most search engines return ranked lists of document snippets, which makes the user difficult to find the relevant information. One method is that the snippets returned by the search engine are grouped into clusters, which may help the user quickly and efficiently navigate the results of a query at a topic level and locate the relevant information. This paper first introduces, several key requirements for Web research results clustering methods and the classification of the clustering methods. Then it probes into the major clustering algorithms and their improved method at present, and discusses the evaluation of,clustering quality. Finally, this paper summarizes the future developments of clustering search engine results.
出处 《中文信息学报》 CSCD 北大核心 2008年第3期56-63,共8页 Journal of Chinese Information Processing
基金 国家自然科学基金资助项目(60603098)
关键词 计算机应用 中文信息处理 搜索引擎 文档聚类 信息检索 聚类标识 computer application Chinese information processing search engine document clustering information retrieval cluster label
  • 相关文献

参考文献41

  • 1Pretschner A, Gauch S. Ontology Based Personalized Search[A]. In: Proceedings of the Eleventh IEEE International Conference on Tools with Artificial Intelligence[C]. 1999:391-398.
  • 2Jansen B J, Spink A, Bateman J, Saracevic T. Real Life Information Retrieval: A Study of User Queries on the Web[J]. ACM SIGIR Forum, 1998, 32(1) : 5- 17.
  • 3Franzen K, Karlgren J. Verbosity and Interface Design[A]. Technical Report T2000:04, Swedish Institute of Computer Science(SICS)[C]. 2000.
  • 4Chen H, Dumais S. Bringing Order to the Web:Automatically Categorizing Search Results [A]. In: Proceedings of the SIGCHI Conference on Human Factors in Computing Systems[C]. New York: ACM Press, 2000. 145-152.
  • 5Kules B, Kustanowitz J, Shneiderman B. Categorizing Web Search Results into Meaningful and Stable Categories Using Fast-Feature Techniques [A]. In: Proceedings of the 6th ACM/IEEE-CS Joint Conference on Digital Libraries [C]. New York: ACM Press, 2006. 210-219.
  • 6Cui H, Zalane O R. Hierarchical Structural Approach to Improving the Browsability of Web Search Engine Results[J]. IEEE, 2001,956-960.
  • 7Griffiths A, Luchhurst H, Willett P. Using Inter-Document Similarity Information in Document Retrieval Systems[J]. Journal of the American Society for Information Sciences, 1986,37:3-11.
  • 8Salton G. The SMART Retrieval Systems[M]. Prentice Hall, Englewood Cliffs, N.J.,1971.
  • 9Zamir O, Etzioni O. Grouper: A Dynamic Clustering Iinterface to Web Search Results[A]. In:Proceedings of the Eighth International World Wide Web Conference(WWWS) [C]. 1999.
  • 10Zamir O, Etzioni O. Web Document Clustering: A Feasibility Demonstration[A]. In: Proceeding of the 19^th International ACM SIGIR Conference on Research and Development of Information Retrieval (SIGIR'98)[C]. New York: ACMPress, 1998, 46-54.

二级参考文献49

  • 1陈浩,何婷婷,姬东鸿.基于k-means聚类的无导词义消歧[J].中文信息学报,2005,19(4):10-16. 被引量:16
  • 2[1]Zamir O, Etzioni O. Web Document Clustering : A Feasibility Demos- tration. SIGIR'98,1998
  • 3[2]Zamir O, Etzioni O. Grouper: A Dynamic Clustering Interface to Web Search Results. In: Proceedings of 8th WWW Conference, Toronto Canada, 1999
  • 4JiaweiHan MiehelineKamber.数据挖掘:概念与技术[M].机械工业出版社,2001.237-251.
  • 5Regina Barzilay,Min-Yen Kan,and Kathleen R.McKeown.Simfinder:A Flexible Clustering Tool for Summarization[A].In proceedings of the Workshop on Summarization in NAACL 01[C].Pittsburg,Pennsylvania,USA:June 2001.
  • 6Zheng Chen,Wei-Ying Ma,Jinwen Ma.Learning to Cluster Web Search Results[A].In:proceedings of the 27th Annual International ACM SIGIR Conference[C].Sheffield,South Yorkshire,UK,July 2004,210 -217.
  • 7Y.C.Fang,S.Parthasarathy,F.Schwartz.Using Clustering to Boost Text Classification[J].In:proceedings of the IEEE ICDM Workshop on Text Mining,Maebashi City,Japan,2002.
  • 8A.Rauber,and M.Frühwirth.Automatically Analyzing and Organizing Music Archives[A].In:proceedings of the 5.European Conference on Research and Advanced Technology for Digital Libraries (ECDL 2001)[C].Darmstadt,Germany,2001.
  • 9Cutting,D.,Karger,D.,and etc.Scatter/Gather:A Cluster-based Approach to Browsing Large Document Collections[A].SIGIR ‘ 92,1992[C].318-329.
  • 10JR Wen,JY Nie,HJ Zhang.Clustering User Queries of a Search Engine[A].The Tenth International World Wide Web Conference[C].Hong Kong.May 1 -5,2001.

共引文献81

同被引文献152

引证文献9

二级引证文献36

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部