期刊文献+

基于大规模问答对数据的查询扩展技术研究 被引量:2

The Study on the Query Expansion Technique Based on Q&A Data
下载PDF
导出
摘要 查询扩展是信息检索领域中的一个热门话题,其目标是将与初始查询词相关的其他单词添加到初始查询请求中,以更详细地描述用户的信息需求.本文将查询过程视为特殊的问答过程,并基于此思想提出一种新的查询扩展方法.本文的贡献主要有以下几点:① 借助统计语言模型从大规模问答对数据中挖掘单词之间的扩展关系,并根据单词间的扩展关系对候选扩展词进行评级;② 提出一个新的查询扩展词选取策略,以克服已有查询扩展方法仅依赖评级的扩展词选取策略的不足.通过在真实数据集合上的实验,证明本文提出的查询扩展方法可以取得优于传统方法的性能,具有一定的实用性. Query expansion technique is a hot topic in information retrieval. The aim of this technique is add terms in the original query to form a suitable query. In this article, we view the question part and the answer part of Q&A pairs in a huge Q&A archive as query and the webpages that satisfied the information need in the query. The contribution of this paper is: (1) using a statistical language model to mine the expansion probability between words, and (2) propose a candidate expansion words selection approach to form the new query to avoid the shortcomings of the prior query expansion methods. The experimental results on a real data set show that our approach performs better than the traditional query expansion techniques.
出处 《情报学报》 CSSCI 北大核心 2012年第4期407-415,共9页 Journal of the China Society for Scientific and Technical Information
基金 国家社科基金项目(10BTQ046) 国家科技支撑计划(2009BAK65B05) 中国博士后科学基金资助项目(20110491139).
关键词 查询扩展 信息检索 问答数据 语言模型 query expansion, information retrieval, Q&A data, language model
  • 相关文献

参考文献27

  • 1李卫疆,赵铁军,王宪刚.基于上下文的查询扩展[J].计算机研究与发展,2010,47(2):300-304. 被引量:32
  • 2Pitkow J, SchUtze H, Cass T, et al. Personalized Search [ J ]. Communications of the ACM, 2002,45 ( 9 ) : 50-55.
  • 3Cuerzan S, White R W. Query Suggestion based on Landing Pages [ C ]//Proceedings of SIGIR, 2007 : 875-876.
  • 4Jensen E C, Beitzel S M, Chowdhury A, et al. Query Phrase Suggestion From Topically Tagged Session Logs [ C ]//Proceedings of FQAS ,2006 : 185-196.
  • 5Kurland O, Lee L, Domshlak C. Better than the Real Thing? Iterative Pseudo-Query Processing using Cluster- Based Language Models [ C ]//Proceedings of SIGIR, 2005 : 19-26.
  • 6Rocchio J. Relevance Feedback in Information Retrieval [ M ]//the SMART Retrieval System: Experiments in Automatic Document Processing. Prentice-Hall Inc, 1971 : 313-323.
  • 7Liu X, Croft W B. Cluster-based Retrieval Using Language Models[ C ]//Proceedings of SIGIR ,2004 : 186-193.
  • 8吴丹,何大庆,王惠临.基于伪相关反馈的跨语言查询扩展[J].情报学报,2010,29(2):232-239. 被引量:19
  • 9Mitra M, Singhal A, Buckley C. Improving Automatic Query Expansion [ C ]//Proceedings of SIGIR 1998, 1998 : 206-214.
  • 10Lee K S,Croft W B, and Allan J. A Cluster-based Re- sampling Method For Pseudo-relevance Feedback [ C ]// Proceedings of SIGIR 2008,2008:235-242.

二级参考文献64

共引文献160

同被引文献120

  • 1张亮,黄河燕,胡春玲.基于Ontology的中文问答系统问题分类研究[J].中国图书馆学报,2006,32(2):60-65. 被引量:3
  • 2Mark T M. New directions in question answering [ M ]. AAAI Press; Cambridge, Mass. : Copublished and distributed by The MIT Press. 2004.
  • 3TREC K [ E B/OL ]. [ 2011-04-25 ] http ://trec. nist. gov/.
  • 4Yllias C. Question answering using question classification and document tagging [ J ]. Applied Artificial Intelligence, 2009, 23 (2) : 500-521.
  • 5Surdeanu M, Ciaramita M, Zaragoza H. Learning to rank answers on large online QA collections[ C ]. Proceedings of ACL 2008, 2008:719-727.
  • 6农业中国[EB/OL].[2013-10-15]http://ask.nongye.cn/.
  • 7Zhang J, Zhu X, Zhu G. Classification of agricultural questions based on rule templates and SVM in Chinese [C].第七届科技信息资源共享促进国际会议,2012.
  • 8Luhn P H. Automatic creation of literature abstracts [J]. IBM Journal, 1958,2(4):159-165.
  • 9Gerard S, Amit S l, Mandar M, et al. Automatic text structuring and summarization [ J ]. Information Processing & Management, 1997,33 ( 2 ) : 193-207.
  • 10Das R, Elikkottil A. Automatic summarizer to aid a Q/ A system [ J ]. International Journal of Computer Applications, 2010,1 ( 1 ) : 108-112.

引证文献2

二级引证文献6

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部