期刊文献+

基于增强序列标注策略的单阶段联合实体关系抽取方法 被引量:3

Single-stage Joint Entity and Relation Extraction Method Based on Enhanced Sequence Annotation Strategy
下载PDF
导出
摘要 从非结构化文本中抽取实体和关系是自动构建知识库的基础工作。现有的工作主要采用联合学习方法来解决嵌套实体、重叠关系、冗余计算和曝光偏差等问题,但单个模型仅在部分问题上表现出色,尚无模型可以同时解决上述问题。因此,提出了一种基于增强序列标注策略的单阶段联合实体关系抽取方法(A Token With Multi-labels Entity and Relation Extraction,ATMREL)。首先,设计了一种增强序列标注策略,将文本中的每个单词标记为多个标签,标签包含每个单词在实体中的位置、关系类型和实体位置信息。然后,将每个单词的标签预测转化为多标签分类任务,同时将联合实体关系抽取转化为序列标注任务。最后,为增强实体对之间的依赖关系,引入实体相关矩阵,用于对抽取结果进行剪枝,以提升模型抽取效果。实验结果表明,与CasRel和TPLinker模型相比,ATMREL模型在NYT和WebNLG数据集上的参数量减少了3.1×10^(6)~5.4×10^(6),平均推理速度提升了2~4.2倍,F1值提升了0.5%~2.1%。 Extracting entities and relations from unstructured text is the fundamental task of automatically constructing know-ledge bases.Existing works mainly adopt joint learning to solve the problems of nested entities,overlapping relations,redundant computation,or exposure bias,but a single model only performs well on some issues,and no model can solve the above problems simultaneously.Therefore,a single-stage joint entity and relation extraction method based on an enhanced sequence annotation strategy called ATMREL is proposed.First,an enhanced sequence annotation strategy is designed to tag each word in the text with multiple labels,and the labels contain information about the position of each word in the entity,the relation type and the entity location.Second,the labels prediction of each word is transformed into a multi-label classification task,while the joint entity and relation extraction is transformed into a sequence annotation task.Finally,to enhance the dependencies between entity pairs,an entity correlation matrix is introduced for pruning the extraction results to improve the model extraction effect.Experimental results show that ATMREL model reduces the parameter volume by 3.1×106~5.4×106,improves the average inference speed by 2~4.2 times,and improves the F1 value by 0.5%~2.1%compared with the CasRel and TPLinker models on the NYT and WebNLG datasets.
作者 朱秀宝 周刚 陈静 卢记仓 向怡馨 ZHU Xiubao;ZHOU Gang;CHEN Jing;LU Jicang;XIANG Yixin(State Key Laboratory of Mathematical Engineering and Advanced Computing,Zhengzhou 450001,China)
出处 《计算机科学》 CSCD 北大核心 2023年第8期184-192,共9页 Computer Science
基金 河南省科技攻关项目(222102210081)。
关键词 联合实体关系抽取 序列标注 组合标签 相关矩阵 Joint entity and relation extraction Sequence annotation Combined labels Correlation matrix
  • 相关文献

参考文献2

二级参考文献16

共引文献916

同被引文献24

引证文献3

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部