期刊文献+

基于查询日志的局部共现查询扩展 被引量:4

QUERY EXPANSION OF LOCAL CO-OCCURRENCE BASED ON QUERY LOG
下载PDF
导出
摘要 查询扩展是信息检索中的一个关键问题,查询扩展的有效性决定了检索系统的检索性能。大多数的查询扩展基于全局分析或者局部分析法,虽然准确率有了很大的提高,但是都有各自的局限性。查询日志是大量用户长期查询行为的记录。提出了基于查询日志的局部共现查询扩展方法,该方法通过挖掘用户初始查询与查询日志之间的联系,构建一个用户初始查询与用户文档的关联关系图,并且使用局部共现的方法构建相关词表,从而实现查询扩展。在50 000篇语料上的测试表明,该方法相对于未扩展时准确率提高了44%以上。 Query extension is a key issue in information retrieval, the efficiency of query expansion determines the retrieval performance of retrieval system. Most of the query expansions are based on global analysis or local analysis, though the accuracies have been greatly improved, but they all have their own limitations. Query log is the record of long term query behaviour by a great quantity of users. In this paper, we propose a query log-based expansion method of local co-occurrence, through which we can build an associated diagram of user initial query and user document through mining the link between user's initial query and user logs, and construct the related word list using local co-occurrence method, thus to realise the query expansion. The test on 50,000 corpora shows that the precision has about 44% improvement after using this method.
出处 《计算机应用与软件》 CSCD 北大核心 2013年第12期22-27,共6页 Computer Applications and Software
基金 国家自然科学基金项目(61003126)
关键词 全局分析 局部分析 查询扩展 查询日志 局部共现 Global analysis Local analysis Query expansion Query log Local co-occurrence
  • 相关文献

参考文献20

  • 1余慧佳,刘奕群,张敏,茹立云,马少平.基于大规模日志分析的搜索引擎用户行为分析[J].中文信息学报,2007,21(1):109-114. 被引量:117
  • 2Furnas G W,Landauer T K,Gomez LM. The vocabulary problem in human-system communication[J].{H}Communications of the ACM,1987,(11):964-971.
  • 3Mei Kobayashi,Koichi Takeda. Information on retrieval on the web[J].ACM Computing Survey,2000,(2):328-354.
  • 4Nekrestyanov I S,Panteleeva N V. Text Retrieval Systems for the Web[J].{H}PROGRAMMING AND COMPUTER SOFTWARE,2002,(4):207-225.
  • 5Cui Hang,Wen Jirong,Nie Jianyun. Probabilistic Query Expan-sion Using Query Logs[A].2002.325-332.
  • 6Buckley C,Singhal A,Mitra M. New retrieval approaches using SMART[A].National Institute of Standards and Tech-nology,Gaithersburg,MD,1995.25-48.
  • 7Xu J X,Croft W B. Improving the Effectiveness of Information Retrieval with Local Context Analysis[J].ACM Transactions on Information Sys-tems,2000,(1):79-112.
  • 8崔航,文继荣,李敏强.基于用户日志的查询扩展统计模型[J].软件学报,2003,14(9):1593-1599. 被引量:61
  • 9吴京慧;于珊珊;王明文.基于用户日志聚类的查询扩展模型[A]第三届全国信息检索与内容安全学术会议,2007540-542.
  • 10熊忠阳,向海燕,张玉芳.结合用户日志的局部上下文分析方法[J].计算机工程与应用,2012,48(12):74-77. 被引量:3

二级参考文献53

  • 1蒋辉,阳小华.基于文档与搜索结果上下文的查询扩展方法[J].计算机应用,2009,29(3):852-853. 被引量:6
  • 2王继民,彭波.搜索引擎用户点击行为分析[J].情报学报,2006,25(2):154-162. 被引量:45
  • 3Furnas GW, Landauer TK, Gomez LM, Dumais ST. The vocabulary problem in human-system communication. Communication of ACM, 1987,30(11):964~971.
  • 4Wen JR, Nie JY, Zhang HJ. Clustering user queries of a search engine. In: Proceedings of the 10th International World Wide Web Conference (WWW10). New York: ACM Press, 2001. 162~168.
  • 5Xu JX, Croft WB. Query expansion using local and global document analysis. In: Frei HP, Harman D, Schauble P, Wilkinson R,eds. Proceedings of the 19th Annual International SIGIR Conference on Research and Development in Information Retrieval. New York: ACM Press, 1996. 4~11.
  • 6Xu JX, Croft WB. Improving the effectiveness of information retrieval with local context analysis. ACM Transactions on Information Systems, 2000,18(1):79~112.
  • 7Deerwester S, Dumai ST, Furnas GW, Landauer TK, Harshman R. Indexing by latent semantic analysis. Journal of ACM Transactions on Information Systems, 2000,18(1):79~112.
  • 8Qiu Y, Frei H. Concept based query expansion. In: Korfhage R, Rasmussen EM, Willett P, eds. Proceedings of the 16th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. New York: ACM Press, 1993.160~169.
  • 9Attar R, Fraenkel AS. Local feedback in full-text retrieval systems. Journal of the ACM, 1977,24(3):397~417.
  • 10Buckley C, Salton G, Allan J, Singhal A. Automatic query expansion using SMART. Technical Report, TREC-3, 1995. 69~80.

共引文献214

同被引文献54

引证文献4

二级引证文献11

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部