期刊文献+

一种面向实体的演化式多文档摘要生成方法 被引量:2

A Method for Entity-Oriented Timeline Summarization
下载PDF
导出
摘要 本文针对多文档摘要没有考虑实体、仅仅生成通用摘要的问题,提出面向实体的演化式多文档摘要生成方法。本文首先利用一个概率主题模型联合建模文档主题的演化和实体的参与情况,然后结合实体对句子进行评分和选择,针对不同的实体,同一个句子可能获得不同的评分。此外,本文在真实数据集上进行了大量的实验和分析,实验结果表明,该方法可以面向不同的实体生成关于事件发展的个性化摘要,同时与现有方法相比,该方法还得到了更好的通用摘要。 The objective of this paper is to propose a novel entity-oriented timeline summarization from multiple documents. To achieve this, this paper firstly proposes a topic model to simultaneously model the dynamic topics and the entity's participation. An efficient Gibbs sampler is also developed for this model. Then each sentence is allocated a score based on the discovered topics and the sentences with high score are selected as summaries. Experimental results on real-world datasets verify that the proposed model can not only generate summaries for entities, but also outperform the baseline model on Rouge evaluation.
出处 《广西师范大学学报(自然科学版)》 CAS 北大核心 2015年第2期36-41,共6页 Journal of Guangxi Normal University:Natural Science Edition
基金 "863"国家重大课题资助项目(2014AA7013033 2014AA7115061 2014AA7115028)
关键词 多文档摘要 概率主题模型 自然语言处理 multiple document summarization topic model natural language process
  • 相关文献

参考文献12

二级参考文献80

  • 1秦兵,刘挺,李生.基于局部主题判定与抽取的多文档文摘技术[J].自动化学报,2004,30(6):905-910. 被引量:10
  • 2李明.从字频统计出发的中文文摘自动编写[J].现代图书情报技术,1996(3):42-45. 被引量:20
  • 3Zhu Junyan, Wang Can, He Xiaofei, etal. Tag-oriented Document Summarization[C]//Proc. of the 18th International Conference on World Wide Web. Madrid, Spain: [s. n.], 2009.
  • 4Jing Hongyan, McKeown K R. Cut and Paste Based Text Summarization[C]//Proc. of the 1st Meeting of the North American Chapter of the Association for Computational Linguistics. Seattle, USA: [s. n.], 2000: 178-185.
  • 5Knight K, Marcu D. Summarization Beyond Sentence Extraction: A Probabilistic Approach to Sentence Compression[J]. Artificial Intelligence, 2002, 139(1): 91-107.
  • 6Gong Yihong. Generic Text Summarization Using Relevance Measure and Latent Semantic Analysis[C]//Proc. of the 24th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. New Orleans, Louisiana, USA: [s. n.], 2001: 19-25.
  • 7http://projects.ldc.upenn.edu/ace/intro.html.
  • 8WANG Ding-ding,ZHU Sheng-huo,LI Tao,et al. Multi-document summarization using sentence-based topic models [C]//Proeeedings of the ACL-IJCNLP. Suntee ,Singapore :Morgan Kaufmann Publishers ,2009:297-300.
  • 9HENDRICKX I,DAELEMANS W,MARSI E,et ah Reducing redundancy in multi-document summarization using lexical semantic similarity[C]//Proceedings of the ACL-IJCNLP. Suntec,Singapore :Morgan Kaufmann Publishers, 2009:63-66.
  • 10Mani I. Automatic Summarization. John Benjarnins Publishing Company, 2001.

共引文献68

同被引文献5

引证文献2

二级引证文献10

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部