期刊文献+

基于潜层结构化语义增强的低资源摘要模型

Low Resource Summarization Model Based on Latent Structural Semantic En hancement
下载PDF
导出
摘要 生成任务通常采用数据增强或预训练结合微调的方式进行处理,对于源文本与目标摘要之间的潜层结构化语义信息未能充分利用。为此,提出一种基于潜层结构化语义增强的低资源摘要模型,以图结构对齐的方式增强模型对结构化信息的利用。首先,该模型通过结构特征表示层获取源文本与预测摘要的潜层结构化语义特征。然后,将获得的语义特征利用潜层结构对齐模块进行节点对齐和边对齐,这种对齐有助于模型捕捉语义特征中的结构化信息,从而增强模型对结构化知识的利用。最后,利用源文本与预测摘要之间的结构化特征对齐距离作为目标损失的正则项来辅助模型进行优化。在六个领域的低资源数据集上进行实验,ROUGE-1分值相对于基线模型平均提高了0.58。结果表明利用潜层结构化语义知识可以有效提高低资源摘要生成的能力。 At present,low-resource summary generation tasks are usually processed by data enhancement or pre training combined with fine-tuning,which cannot make full use of the latent structural semantic information between the source text and the target summary.For this reason,this paper proposes a low resource summary model based on latent structural semantic enhancement,which enhances the utilization of structured information in the way of graph structure alignment.First of all,the model obtains the latent semantic features of the source text and prediction summary through the structural feature representation layer.Then,the obtained semantic features are aligned with the latent structured alignment module for node alignment and edge alignment,which helps the model to capture the structured information in the semantic features,thus enhancing the model's use of structured knowledge.Finally,the model uses the structured feature alignment distance between the source text and the prediction summary as the regular term of target loss to assist the model in optimization.Experiments are performed on a low-resource dataset across six domains.The model achieves an average improvement of 0.58 in ROUGE-1 scores relative to the baseline model.The results show that the model can effectively improve the ability of generating low-resource summaries by using latent structured semantic knowledge.
作者 刘宇 刘小明 刘卫光 杨关 刘杰 LIU Yu;LIU Xiaoming;LIU Weiguang;YANG Guan;LIU Jie(School of Computer Science,Zhongyuan University of Technology,Zhengzhou 450007,China;Henan Key Laboratory on Public Opinion Intelligent Analysis,Zhengzhou 450007,China;Software College,Zhongyuan University of Technology,Zhengzhou 450007,China;School of Information Science,North China University of Technology,Beijing 100144,China;China Language Intelligence Research Center,Beijing 102206,China)
出处 《计算机科学与探索》 CSCD 北大核心 2023年第8期1961-1973,共13页 Journal of Frontiers of Computer Science and Technology
基金 国家重点研发计划(2020AAA0109700) 国家自然科学基金(62076167,61772020)。
关键词 低资源 结构化 语义特征 图结构 low resources structured semantic features graph structure
  • 相关文献

参考文献2

二级参考文献4

共引文献9

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部