期刊文献+

基于遗传算法的查询导向式自动文摘

Query-Oriented Summarization Based on Genetic Algorithm
下载PDF
导出
摘要 查询导向式自动文摘是近年来文本挖掘领域的一个热点研究课题,它以自动生成偏向用户查询需求的个性化简洁摘要为目的。本文从优化问题的角度提出一种基于遗传算法的句子抽取型文摘选择策略和方法,可以满足摘要长度限制的不同句子集合构成的随机摘要作为初始种群,将文摘的综合特性评价函数作为适应函数,通过遗传算法的全局寻优能力搜索到整体特性接近最优的句子集合作为摘要。该方法将摘要的查询偏好性与冗余性无缝地集成到遗传算法的适应函数中,因而能使生成的摘要具有更优的综合质量。在新浪网上随机抽取100个不同主题的新闻文本作为摘要测试文本,通过实验,验证了该策略和方法的有效性。 Query-oriented summarization is a hot research issue in text mining, which aims to generate a query-biased concise summary in accordance with user needs. This paper proposes a sentence extractive summarization approach based on genetic algorithm from the perspective of optimization problem. In the method, different sentence sets constituting the random summaries and conforming to specific length limit are selected as the initial population and the evaluation function for a summary's comprehensive characteristics is considered as the fitness function. With the global optimization ability of genetic algorithm, the sentence set with the best overall performance is selected to create the summary. This method seamlessly integrates the query preference and redundancy into the fitness function of the genetic algorithm to ensure the created summary a better quality. Experimental results on one hundred of news documents with different topics randomly selected from Sina website have demonstrated the effectiveness of the proposed approach.
作者 王海 胡珀
出处 《微计算机信息》 2009年第28期23-25,共3页 Control & Automation
关键词 查询导向式自动文摘 遗传算法 句子抽取 query-oriented summarization genetic algorithm sentence extraction
  • 相关文献

参考文献7

  • 1Wauter Bosma. 2005. Query-Based Summarization using Rhetorical Structure Theory. In Proceedings of CLIN04.
  • 2Yu-Chieh Wu, Kun-Chang Tsai, Yue-Shi Lee, Jie-Chi Yang. 2006. Light-Weight Multi-Document Summarization Based on Two-Pass Re-Ranking.In Proceedings of DUC2006.
  • 3Jagadeesh Jagarlamudi,Prasad Pingali,Vasudeva Varma. 2006. Query Independent Sentence Scoring Approach to DUC 2006.In Proceedings of DUC2006.
  • 4B. Favre, F. Bechet, P. Bellot, F. Boudin, M.E1-Beze, L.Gillard, J.-M.Torres-Moreno. 2006. The HA-Thales Summarization System at DUC 2006. In Proceedings of DUC2006.
  • 5Sujian Li, You Ouyang, Bin Sun, Peking University at DUC 2006.In Proceedings of DUC2006.
  • 6刘德喜,何炎祥,姬东鸿,杨华.一种基于演化算法进行句子抽取的多文档自动摘要系统SBGA[J].中文信息学报,2006,20(6):46-53. 被引量:10
  • 7刘海涛,老松杨,韩智广.自动文摘系统中的段落自适应聚类研究[J].微计算机信息,2006,22(06X):288-291. 被引量:6

二级参考文献25

共引文献14

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部