期刊文献+

一种基于特征信息的Blog自动文摘研究

Blog automatic summarization based on features information
下载PDF
导出
摘要 为了有效地对Blog进行摘要抽取,以一种合理的方式挑选出对Blog摘要抽取有帮助的评论,然后在考虑句子词频的基础上结合Blog的结构化信息和挑选出的评论信息来计算Blog句子权重。针对基于句子权重选择摘要句容易忽略次要主题的缺陷,提出一种结合Blog段落形式特点进行二次摘要抽取的解决方法。在随机下载的Blog数据中进行了实验,该方法具有较好的覆盖性和概括性。 To help extract the summary of a Blog effectively,first selected a number of comments in the Blog in a reasonable way.Then,based on considering word frequency in the sentence,this paper calculated the weight of the sentence in the Blog,combined with structured information and the selected comments.However,this method was easy to neglect the minor subject.After that,to overcome the drawback,proposed a solution of secondary Abstract extract through the characteristics of paragraph form in the Blog.Finally,an experiment was done with Blog data random downloaded on the Internet,demonstrating the method has a better spreadability and generality.
出处 《计算机应用研究》 CSCD 北大核心 2011年第10期3760-3763,共4页 Application Research of Computers
基金 国家自然科学基金资助项目(60970015 61003054) 2009年江苏省基础研究计划企业博士创新项目(BK2009563) 江苏省高校自然科学研究项目(10KJB520018) 苏州市科技型企业技术创新资金专项项目(SG201043)
关键词 博客摘要 评论 特征信息 主题覆盖 Blog summary comment feature information subject coverage
  • 相关文献

参考文献8

  • 1中国互联网络发展状况统计报告[R]CNNIC,2006.
  • 2DELORT J Y. Identifying commented passages of documents using implicit hyperlinks [ C ]//Proc of the 17th Conference on Hypertext and Hypermedia. 2006:89-98.
  • 3HU Mei-shan, SUN Ai-xin, LIME P. Comments oriented Blog sum- marization by sentence extraction [ C ]//Proc of CIKM. 2007: 901- 904.
  • 4SUN Shuang, HE Liang, LV Zhao, et al. A new approach to Blog post summarization using fast features [ C ]// Proc of the 5th Interna- tional Conference on Fuzzy Systems and Knowledge Discovery. 2008 : 8-13.
  • 5MISHNE G, GLANCE N. Leave a reply: analysis of Web log com- ments[ C]//Proc of the 15th International Conference on World Wide Web. 2006.
  • 6王萌,何婷婷,张伟.基于概念向量空间模型的中文自动文摘系统[J].计算机工程与应用,2005,41(1):107-110. 被引量:5
  • 7王继成,武港山,周源远,张福炎.一种篇章结构指导的中文Web文档自动摘要方法[J].计算机研究与发展,2003,40(3):398-405. 被引量:43
  • 8MORRIS A, KASPER G, ADAMS D. The effects and limitations of automated text condensing on reading comprehension performance [ J ]. Information Systems Research, 1992,3 ( 1 ) : 17- 35.

二级参考文献14

  • 1Luhn H P.The Automatic creation of literature abstracts[J].IBM J Res and Dev, 1958 ;2(2) : 159-165.
  • 2Sahon G,wong A,Yang C S.A vector space model for automatic indexing [J].Communications of ACM, 1995 ; 18:613-620.
  • 3Edmundson H P.New Methods in Automatic Extraction[J].Journal of the ACM, 1968; 16(2).
  • 4Barzilay R,M Elhadad.Using Lexical Chains for Text Summarizer[C]. In :Proceedings of the Workshop on Intelligent Scalable Text Summarization at the ACL/EACL Conference,10-17.Madrid,Spain,1997.
  • 5Yu ShiWen,Duan Huiming,Tian Jianqiu.The theory and implement- ation of automatic evaluation of mechanical abstraction[C].In:Proc of 97"s National Conf on Intelligent Machines(in Chinese).Beijing:Pub- lishing House of Electronics Industry, 1997:230-233.
  • 6J Kupiec. J Pedersen et al. A trainable document summarizer. In: Proc of the 18th Annual Int'l ACM SIGIR Conf on Research and Development in Information Retrieval (SIGIR'95). Seattle, Washington, USA: ACM Press, 1995. 68~73
  • 7R Brandow, K Mitze, L F Rau. Automatic condensation of electronic publication by sentence selection. Information Processing and Management, 1995, 34(5): 575~685
  • 8吴岩,刘挺,王开铸,陈彬.中文自动文摘原理与方法探索[J].中文信息学报,1998,12(2):8-16. 被引量:20
  • 9孙春葵,李蕾,杨晓兰,钟义信.基于知识的文本摘要系统研究与实现[J].计算机研究与发展,2000,37(7):874-881. 被引量:19
  • 10王继成,萧嵘,孙正兴,张福炎.Web信息检索研究进展[J].计算机研究与发展,2001,38(2):187-193. 被引量:118

共引文献51

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部