期刊文献+

基于主题语言模型的句子检索算法 被引量:8

Sentence Retrieval with a Topic-Based Language Model
下载PDF
导出
摘要 提出了基于主题语言模型的汉语问答系统句子检索算法,该算法利用问答系统中特有的提问分类信息(即提问的答案语义信息)对句子初检结果进行主题聚类,通过AspectModel将句子所属的主题信息引入到语言模型中,从而获得对句子语言模型更精确的描述.对于初检结果的聚类,提出了“一个句子多个主题”和“一个句子一个主题”两种算法.相对于PLSI算法的主题空间维度,提出的主题空间具有更加明确的物理意义;由于不需要迭代运算,运行速度更具优势.对比实验的结果表明,与标准语言模型方法相比,基于主题语言模型的方法可以明显地提高汉语问答系统句子检索模块的性能. A novel topic-based language model for sentence retrieval in Chinese question answering is presented in this paper. The main idea is to make use of the peculiar characteristics in question answering scenario, that is, the semantic category of the expected answer, to conduct topic segmentation, and then incorporate the topic information of the sentence into the standard language model. For the topic segmentation, two approaches are presented, that is, one-sentence-one-topic and one-sentence-multi-topics. The experimental results show that the performance of sentence retrieval based on the proposed topic-based language model is improved significantly.
出处 《计算机研究与发展》 EI CSCD 北大核心 2007年第2期288-295,共8页 Journal of Computer Research and Development
基金 国家自然科学基金项目(60372016) 北京市自然科学基金项目(4052027)
关键词 汉语问答系统 语言模型 句子检索 Chinese question answering language model sentence retrieval
  • 相关文献

参考文献14

  • 1A Ittycheriah,S Roukos.IBM's statistical question answering system-TREC 11[C].The 11th Text REtrieval Conference,Gaithersburg,Maryland,USA,2002
  • 2H Yang,T S Chua.The integration of lexical knowledge and external resources for question answering[C].The 11th Text REtrieval Conference,Maryland,USA,2002
  • 3A C Emmanuel,W B Croft,V Murdock.Answer passage retrieval for question answering[C].The 27th Annual Int'l Conf on Research and Development in Information Retrieval,Sheffield,UK,2004
  • 4V Murdock,W B Croft.Simple translation models for sentence retrieval in factoid question answering[C].The SIGIR 2004 Workshop on Information Retrieval for Question Answering,Sheffield,UK,2004
  • 5W Bruce Croft,John Lafferty.Language Modeling for Information Retrieval[M].Amsterdam,Netherlands:Kluwer Academic Publishers,2003
  • 6C Zhai,J Lafferty.A study of smoothing techniques for language modeling applied to ad hoc information retrieval[C].The ACM SIGIR Conf on Research and Development in Information Retrieval,New Orleans,USA,2001
  • 7A Berger,R Caruana,D Cohn,et al.Briding the lexical chasm:Statistical approaches to answer-finding[C].The 23rd Annual Conf on Research and Development in Information Retrieval,Athens,Greece,2000
  • 8T Hofmann.Probabilistic latent semantic indexing[C].The 22nd Annual Int'l SIGIR Conf on Research and Development in Information Retrieval,Berkeley,USA,1999
  • 9J Ponte,W Bruce Croft.A language modeling approach to information retrieval[C].The 1998 ACM SIGIR,Melbourne,Australia,1998
  • 10V Lavrenko,W B Croft.Relevance-based language models[C].The 2001 ACM SIGIR Conf on Research and Development in Information Retrieval,New Orleans,USA,2001

同被引文献55

引证文献8

二级引证文献33

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部