期刊文献+

一种改进的元搜索排序合成算法

An Improved Rank Merging Algorithm for Meta Search
下载PDF
导出
摘要 为了提高元搜索引擎的查询精度,提出了一种改进的元搜索结果合成算法.首先,通过分析搜索结果列表中包含的文本信息,综合考虑搜索结果与查询的匹配完全程度和相关程度,给出了文本分析的规范化方法;并结合搜索结果的排序信息计算文档的相关分值,据此实现对局部相似度的调整.然后,利用成员搜索引擎的性能评价,提出了改进的影子文档方法来估算非相关文档的相关分值.最后,采用基于群决策的合成方法对搜索结果进行一致性排序.实际Web环境中的测试表明,所提出的算法比现有合成算法具有更好的搜索结果相关性. In order to improve the precision of meta search engine, an improved merging algorithm of meta search results is proposed. In this algorithm, first, the text-based information obtained from search results is analyzed and both the query-match grade and the result relevancy are considered to give an approach on the text normalization for meta search. Next, the relevant scores of documents are normalized by incorporating text analysis with the ranks given by the search engines for the purpose of adjusting the local similarities. Then, based on the performance evaluation of underlying search engines, an improved shadow document method is presented to evaluate the scores of non-relevant documents. Finally, a merging method based on the group decision making is adopted to sort the search results. It is found from the tested results in an actual Web environment that the search results obtained by the proposed algorithm are of higher relativity than those by the existing merging algorithms.
出处 《华南理工大学学报(自然科学版)》 EI CAS CSCD 北大核心 2008年第9期48-51,共4页 Journal of South China University of Technology(Natural Science Edition)
基金 国家自然科学基金资助项目(60603098)
关键词 信息检索 元搜索 搜索结果合成 文本分析 群决策 information retrieval meta search search-resuh merging text analysis group decision making
  • 相关文献

参考文献11

  • 1Callan J P,Ln Z,Croft W B. Searching distributed collections with inference networks [ C ] //Proceedings of the 18th Annual Internation',d ACM SIGIR Conference on Research and Development in Information Retrieval. Seattle: ACM, 1995:21-28.
  • 2Calve A L,Savoy J. Database merging strategy based on logistic regression [J]. Information Proceeding and Management ,2000,36( 3 ) :341-359.
  • 3Si L, Callan J. Using sampled data and regression to merge search engine results [ C ] //Proceedings of the 25th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. Tampere: ACM ,2002 : 19-26.
  • 4肖建华,蒋明,何瑗,柏文阳.二次搜索系统的设计与实现[J].计算机应用研究,2003,20(9):123-126. 被引量:29
  • 5Hoon G K, Tan S S, Yong C H, et al. Rank aggregation model for meta search--an approach using text and rank analysis measures [ C] ffProceedings of the International Conference on Intelligent Information Processing. Beijing: Springer, 2005 : 325- 339.
  • 6张强弓,喻国宝,廖湖声,隋树林.一种元搜索引擎的查询结果处理模型[J].华南理工大学学报(自然科学版),2004,32(z1):47-51. 被引量:10
  • 7Bordogna G. Soft fusion of information accesses [ C ] // Proceedings of the 2002 IEEE International Conference on Fuzzy Systems. Honolulu : IEEE, 2002 : 1466-1471.
  • 8Wu S L, McClaen S. Result merging methods in distributed information retrieval with overlapping databases [ J ]. Information Retrieval,2007,10 ( 3 ) : 297- 315.
  • 9Keyhanipour A H, Moshiri B, Piroozmand M, et al. Webfusion : fundamentals and principals of a novel meta search engine [ C]//Proceedings of the 2006 International Joint Conference on Neural Networks. Vancouver : IEEE,2006 : 4 126-4 131.
  • 10Voorhees E M, Gupta N K, Johnson-Laird B. Learning collection fusion strategies [ C ] //Proceedings of the 18th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. Seattle : ACM, 1995 : 172-179.

二级参考文献9

  • 1[1]Eric J G. Using extra-topical user preferences to improve Web-based metasearch [D]. Michigan: University of Michigan, 2001.
  • 2[2]Yuwono B,Lee D. Server ranking for distributed text database systems on Internet [A]. Proceedings of 5th International Conference on Database Systems for Advanced Applications [C]. Melbourne, Australia: World Scientific Pub Co Inc, 1997. 391 - 400.
  • 3[4]Sergey B, Lawrence P. The anatomy of a large-scale hypertextual Web search engine [J]. Computer Networks and ISDN Systems, 1998,30:1 - 7.
  • 4J M Kleinberg. Authoritative Sources in a Hyperlinked Environment[J]. In ACM Symp. on Discrete Algorithms, 1998.
  • 5Sergey Brin,Lawrence Page. The Anatomy of a Large-Scale Hypertextual Web Search Engine[C]. In Proceeding of The Seventh International World Wide Web Conference. Apr 1998.
  • 6Chakrabarti S, Dom B, Gibson D, et al. Automatic Resource Compilation by Analyzing Hyperlink Struettrre and Associated Text[C]. Proc. Of 7^th World Wide Web Conference, 1998.65-74.
  • 7Krishna Bharat, Monika R, Henzinger. Improved Algorithms for Topic Distillation in a Hypedinked Environment[Z]. 1998.
  • 8Justin Picard,Jacques Savoy. Searching and Classifying the Web Using Hyperlinks: A Logical Approach[C]. 23^rd European Colloquium on Information Retrieval Research,2001.
  • 9Raymond Kosala, Hendrik Blockeel. Web Mining Research:A Survey[J]. ACM SIGKDD Explorations,2000,2(1) : 1-15.

共引文献37

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部