期刊文献+

Research on multi-document summarization based on latent semantic indexing

Research on multi-document summarization based on latent semantic indexing
下载PDF
导出
摘要 A multi-document summarization method based on Latent Semantic Indexing (LSI) is proposed. The method combines several reports on the same issue into a matrix of terms and sentences, and uses a Singular Value Decomposition (SVD) to reduce the dimension of the matrix and extract features, and then the sentence similarity is computed. The sentences are clustered according to similarity of sentences. The centroid sentences are selected from each class. Finally, the selected sentences are ordered to generate the summarization. The evaluation and results are presented, which prove that the proposed methods are efficient. A multi-document summarization method based on Latent Semantic Indexing (LSI) is proposed. The method combines several reports on the same issue into a matrix of terms and sentences, and uses a Singular Value Decomposition (SVD) to reduce the dimension of the matrix and extract features, and then the sentence similarity is computed. The sentences are clustered according to similarity of sentences. The centroid sentences are selected from each class. Finally, the selected sentences are ordered to generate the summarization. The evaluation and results are presented, which prove that the proposed methods are efficient.
出处 《Journal of Harbin Institute of Technology(New Series)》 EI CAS 2005年第1期91-94,共4页 哈尔滨工业大学学报(英文版)
基金 SponsoredbytheNationalNaturalScienceFoundationofChina(GrantNo. 60203020).
关键词 multi-document summarization LSI (latent semantic indexing) CLUSTERING 信息处理技术 索引 多文本摘要 网站 信息过滤系统
  • 相关文献

参考文献7

  • 1FUKUHARATomohiro,TAKEDAHideaki,NISHIDAToyoaki.Multiple textSummarizationforCollectiveKnowledgeFormation[].WorkshoponSocialAspectsofKnowledgeandMemory.1999
  • 2EVANSKD,KLAVANSJL,MCKEOWNKR.Colum bianewsblaster:MultilingualnewssummarizationontheWeb[].HLT NAACL.2004
  • 3RADEVR,JING Hongyan,BUDZIKOWSKA Malgorzata.Centroid-based summarization of multiple documents:Sentence extraction, utility-based evaluation, and user studies[].ANLP-NAACL Workshop.2000
  • 4ANDO R K,BOGURAEV B K,BYRD R J,et al.MultiDocument Summarization by Visualizing Topical Content[].ANLP-NAACL.2000
  • 5CARBONELL J G,GOLDSTEIN J.The use of MMR, diversity-based reranking for reordering documents and producing summaries[].Information Processing and Management.1998
  • 6LIN Chin-Yew,HOVY Eduard.From single to multidocument summarization: A prototype system and its evaluation[].Proceedings of the th Anniversary Meeting of the Association for Computational Linguistics( ACL - ).2002
  • 7REGINA B,ELHADAD N,MCKEOWN K R.Sentence ordering in multidocument summarization[].Proceedings of the st Human Language Technology Conference.2001

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部