期刊文献+

基于基本要素向量空间的英文多文档自动摘要 被引量:2

English Multi-document Summarization Based on Basic Element Vector Space
下载PDF
导出
摘要 在基于基本要素(BE)向量空间的英文多文档自动文摘中,句子不再用术语向量或词向量来表达,而是用基本要素向量来表示。在用k-均值聚类算法时,采用一种自动探测k值的技术。实验表明,基于基本要素的多文档自动文摘MSBEC比基于词更优越。 This paper proposes a novel multi-document sulmmarization strategy based on basic element(BE) vector clustering. In this strategy, sentences are represented by BE vectors instead of word or term vectors before clustering. The BE-vector clustering is realized by adopting the k-means clustering method, and a novel clustering analysis method is employed to automatically detect the number of clusters, k. The experimental results indicate a superiority of the proposed strategy over the traditional summarization strategy based on word vector clustering.
出处 《计算机工程》 CAS CSCD 北大核心 2007年第14期166-167,170,共3页 Computer Engineering
基金 国家自然科学基金资助重大项目(90104005)
关键词 多文档自动文摘 基本要素 K-均值聚类 multi-document summarization basic element k-means clustering
  • 相关文献

参考文献9

  • 1Dragomir R,Jing Hongyan,Malgorzata B.Centroid-based Summarization of Multiple Documents:Sentence Extraction,Utility-based Evaluation and User Studies[J].Information Processing and Management,2004,40(6):919-938.
  • 2Knight K,Marcu D.Summarization Beyond Sentence Extraction:a Probabilistic Approach to Sentence Compression[J].Artificial Intelligence,2002,139(1):91-107.
  • 3Barzilay R,McKeown K R,Elhadad M.Information Fusion in the Context of Multi-document Summarization[C]//Proceedings of the 37^th Annual Meeting of the Association for Computational Linguistics.New Jersey:Association for Computational Linguistics,1999:550-557.
  • 4Hovy E,Lin Chin-Yew,Zhou Liang,et al.Basic Elements[Z].2005.Http://www.isi.edu/~cyl/BE/index.html.
  • 5Lin Dekang.Minipar[Z].1998.Http://www.cs.ualberta.ca/~lindek/ minipar.htm.
  • 6Baeza-Yates R,Ribeiro-Neto B.Modern Information Retrieval[M].New York:Addison Wesley,1999:27-30.
  • 7Pantel P,Lin Dekang.Document Clustering with Committees[C] //Proceedings of ACM,SIGIR'02.New York:ACM,2002:199-206.
  • 8Webb A R.Statistical Pattern Recognition[M].2^nd ed.John Wiley & Sons,2002:376-379.
  • 9Lin Chin-Yew,Hovy E.Automatic Evaluation of Summaries Using N-gram Co-occurrence Statistics[C]//Proceedings of the Human Technology Conference on HLTNAACL′03,Edmonton,Canada.2003.

同被引文献22

引证文献2

二级引证文献3

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部