摘要
随着Web文档数量的剧增,搜索引擎也暴露了许多问题,用户不得不在搜索引擎返回的大量文档摘要列表中查找。而对搜索引擎结果聚类能使用户在更高的主题层次上来查看搜索引擎返回的结果。该文提出了搜索引擎结果聚类的几个重要指标并给出了一个新的基于PAT-tree的搜索引擎结果聚类算法。
Web search engines have become increasingly ineffective as the number of document on the Web is proliferated. Users of Web search engines are often forced to shift through the long ordered list of document 'snippets' returned by the engines. Clustering search engine results help users more quickly and efficiently to navigate the results of a query at a more topical level. The paper articulates the key requirements for document clustering of search engine results and proposes a PAT-tree-based approach to cluster Chinese search engine results.
出处
《计算机工程》
CAS
CSCD
北大核心
2004年第5期95-97,共3页
Computer Engineering
基金
黑龙江省自然科学基金资助项目(F01-06)