期刊文献+

基于多特征融合模型的自动摘要 被引量:3

Multi-feature combination based automatic summarization
下载PDF
导出
摘要 为解决文本自动摘要任务中特征挖掘不充分的问题,选取句子的词汇、相对位置、长度和句间相似度4个特征,提出一种基于多特征融合模型的摘要系统。基于句法树的词汇特征充分利用语法信息,消除传统方法获取关键词的局限性,相对位置特征通过获取位置的高阶信息对句子进行赋值,长度特征过滤掉过长的句子,基于平滑逆向频率句嵌入方法构造句向量,有效计算句子间的相似度。实验结果表明,该系统提高了文本自动摘要的准确度。 To solve the problem of inadequate feature mining in automatic text summarization task,a summarization system based on multi-feature fusion model was proposed by selecting four features of sentence vocabulary,relative position,length and similarity between sentences.Among them,the lexical features based on syntactic tree made full use of the grammatical information and eliminated the limitation of the traditional method of obtaining keywords.The relative position feature assigned the sentence by obtaining the higher order information of the position.The length feature was used filter the rather long sentences.Based on the smoothing inverse frequency sentence embedding method,the sentence vector was constructed and the similarity between sentences was calculated effectively.Experimental results show that the system improves the accuracy of automatic text summarization.
作者 吴世鑫 黄德根 张云霞 WU Shi-xin;HUANG De-gen;ZHANG Yun-xia(College of Computer Science and Technology,Dalian University of Technology,Dalian 116000,China)
出处 《计算机工程与设计》 北大核心 2020年第3期650-655,共6页 Computer Engineering and Design
关键词 文本摘要 多特征融合 句法树 平滑逆向频率句嵌入 语义相似度 text summarization multi-feature combination syntactic tree smooth inverse frequency(SIF)sentence embedding semantic similarity
  • 相关文献

参考文献4

二级参考文献22

  • 1张奇,黄萱菁,吴立德.一种新的句子相似度度量及其在文本自动摘要中的应用[J].中文信息学报,2005,19(2):93-99. 被引量:34
  • 2刘功中,李建华,李生红.基于类信息的特征选择和加权方法[C]//第一届全国信息检索与内容安全学术会议.上海:上海交通大学出版社,2004.
  • 3Luhn H P.The automatic creation of literature abstract[J].IBM Journal of Research and Development,1958,2(2):159-165.
  • 4Edmundson H P.New methods in automatic extracting[J].Journal of the ACM (JACM),1969,6(2):264-285.
  • 5Erkan G,Radev D R.LexRank:Graph-based lexical centrality as salience in text summarization[J].J.Artif.Intell.Res.(JAIR),2004,22(1):457-479.
  • 6Antiqueira L,Oliveira Jr O N,Costa L F,et al.A complex net-work approach to text summarization[J].Information Sciences,2009,179(5):584-599.
  • 7Salton G,Lesk M E.Computer evaluation of indexing and text processing [J].Journal of the ACM,1968,15(1):8-36.
  • 8Machine B E.Made index for technical literature an experiment[J].IBM Journal of Research and Development,1958,12(4):354-361.
  • 9Ozsoy M G,Alpaslan F N,Cicekli I.Text summarization using latent semantic analysis[J].Journal of Information Science,2011,37(4):405-417.
  • 10王永成,许慧敏.OA中文文献自动摘要系统[J].情报学报,1997,16(2):128-132. 被引量:26

共引文献57

同被引文献18

引证文献3

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部