基于深度学习的实体关系联合抽取研究综述被引量：6

Joint Extraction of Entities and Relations Based on Deep Learning:A Survey

下载PDF

导出

摘要实体关系抽取是信息抽取领域的核心任务.从文本中抽取的实体关系三元组是构建大规模知识图谱的基础.传统的流水线方法将实体关系抽取分解为独立的命名实体识别和关系抽取两个子任务.首先,构建一个高效的命名实体识别器,从大规模非结构化文本语句中识别实体边界和类型.然后,将该命名实体识别器识别的实体与类型作为关系抽取任务中所用数据的标注.最后,通过关系抽取器得到两个实体之间的关系类别,进而组合成为结构化的实体关系三元组.命名实体识别任务存在的误差会影响后续的关系抽取任务的性能,这使得流水线方法具有错误累积问题.这是因为关系抽取任务中使用的标注数据来自于前面的命名实体识别任务,这会有一定的误差,进而影响关系抽取的结果质量.此外,流水线方法减弱了两个子任务之间的特征关联,这会出现冗余实体的问题.命名实体识别任务和关系抽取任务独立进行学习训练,导致这两个子任务间缺乏交互,使得文本信息没有得到充分利用,限制了流水线方法的性能瓶颈.由于非结构化文本信息没有得到充分利用,流水线方法在抽取实体间长依赖关系时具有一定局限性,很难达到联合抽取模型的性能指标.实际应用中,实体间往往存在多种关系,流水线方法无法充分使用全局文本信息,且命名实体识别会产生冗余实体,在抽取多元重叠关系时,该方法具有一定的局限性.因此,在构建高准确率实体关系抽取模型时,流水线方法具有欠缺之处.本文对实体关系联合抽取的研究发展全景进行了综述,简要阐明整数线性规划、卡片金字塔解析模型、概率图模型和结构化预测模型这四类基于特征工程的联合模型的共同缺点.本文聚焦基于深度学习的实体关系联合抽取技术,根据近年来实体关系联合抽取前沿研究成果,总结了实体关系联合抽取模型的主流构建方法.按照建模思想的特点总结为三种建模方法:多模块-多步骤、多模块-单步骤以及单模块-单步骤.多模块-多步骤建模方法主要包含实体域映射关系域、关系域映射实体域和头实体域映射关系-尾实体域这三种类别.这三类模型的共同特点都是将三元组的提取过程分为多个模块,通过共享参数的方式整合各个模块,逐步迭代得到三元组.这种方法推动联合模型性能提升,初步解决了流水线方法存在的问题.但每个步骤使用独立的解码算法,导致解码误差累积问题.且共享参数整合各个模块的冗余误差会互相影响预测性能,从而产生级联冗余问题.多模块-单步骤建模方法旨在构建一个最优化的联合解码算法,并对其求取最优解进而得到最优超参数.这种方法设计了简单精确的联合解码算法,并加强了多个子模块间的交互性,减弱了因为逐步迭代导致的解码误差和级联冗余对联合模型性能的影响.然而,模块的分离依然会产生冗余错误,具有一定局限性.单模块-单步骤建模方法可以直接从文本语句中抽取三元组,有效缓解了多模块-多步骤和多模块-单步骤建模方法的级联错误和实体冗余等问题.本文以前沿文献中具有代表性的联合模型为例,详细分析了这些模型的建模思路,剖析了各个模型的优缺点,将多个具有共同建模思路的经典模型进行归类,以阐述实体关系联合抽取模型的发展趋势.本文将单模块-单步骤建模方法的代表模型在公开基准数据集上的模型性能与多模块-多步骤和多模块-单步骤的代表模型性能进行对比分析,阐明实体关系联合抽取模型的建模思路正在从基于多模块-多步骤和多模块-单步骤的复杂建模方法,逐渐向单模块-单步骤的高效建模方法转变的客观趋势.最后,本文对三个实体关系联合抽取的研究方向进行了展望.当下主流的联合模型聚焦于限定域的实体关系抽取任务,对于开放域问题研究得不够.开放域实体关系联合抽取任务是未来的研究人员亟待解决的问题之一.在实际工业应用中,文本语料包含多元信息,如时序信息.而当前的实体关系联合抽取模型大多依据单一文本上下文信息进行特征抽取,从而忽略了时序信息.若融入像时序信息这样的多元信息或能进一步提升联合模型性能,这是未来一项具有重大意义的课题.此外,对于跨文本的实体关系联合抽取模型研究较少,这也是该领域未来的一个研究趋势.本文旨在建立一个完整的基于深度学习的实体关系联合抽取领域研究视图,以对相关领域研究者有所帮助. Entity-relation extraction is a core task in the field of information extraction.Entity-relation triples extract-ed from text are the basis for building large-scale knowledge graphs.The traditional pipeline method decomposes entity-re-lation extraction into two subtasks:named entity recognition and relation extraction.First,an efficient named entity recog-nizer is built to identify the entity boundaries and types from large-scale unstructured text sentences.Then,the entities and types are used as labels for the data used in the relation extraction task.Finally,the relationship category between two enti-ties is obtained through the relationship extractor and then combined into a structured entity-relation triplet.However,error in the named entity recognition task will affect the performance of the subsequent relation extraction task,which makes the pipeline method problematic because of error accumulation.This is because the labeled data used in the relation extraction task come from the previous named entity recognition task,which will include certain errors,and this will affect the quality of the relation extraction results.In addition,the pipeline method weakens the feature association between the two subtasks,which will lead to redundant entities.The named entity recognition task and relationship extraction task are independently learned and trained,which leads to a lack of interaction between these two subtasks.As a result,the text information is not fully utilized,which becomes the main reason the performance of the pipeline method is limited.Because unstructured text information is not fully employed,the pipeline method has certain limitations in extracting long dependencies between enti-ties,and it is difficult to achieve high performance in the joint extraction model.In practical applications,there are often multiple relationships between entities,but the pipeline method cannot fully consider the global text information,and hence named entity recognition produces redundant entities,which has disadvantages when extracting multiple overlapping rela-tionships.Therefore,when constructing a high-accuracy entity-relation extraction model,the pipeline approach has short-comings.This paper reviews the research and development of the joint extraction of entity relationships.Furthermore,it briefly clarifies the common shortcomings of four types of joint models based on feature engineering:integer linear pro-gramming,card pyramid analysis models,probabilistic graph models,and structured prediction models.Focusing on the joint extraction techniques for entity relationships based on deep learning,the mainstream construction methods of these models are summarized according to the state-of-the-art results reported in recent years.According to the characteristics of the modeling idea,the modeling methods are categorized into three types:multi-module/multi-step,multi-module/single-step,and single-module/single-step models.Multi-module/multi-step modeling methods consist of three main types:entity domain mapping to the relationship domain,relationship domain mapping to the entity domain,and head-entity domain mapping to the relation-tail domain.The common feature of these three types of models is that they divide the extraction of triples into multiple modules,integrate each module by sharing the parameters,and gradually iterate to obtain triples.This approach improves the performance of the joint model and initially solves the problems of the pipeline method.However,because each step uses an independent decoding algorithm,it leads to the accumulation of decoding errors.Moreover,be-cause the redundant errors of each module integrated with shared parameters affect the prediction performance of the others,this results in cascading redundancies.The multi-module-single-step modeling method aims to construct an optimal joint de-coding algorithm and obtain the optimal solution to determine the optimal hyperparameters.This method designs a simple and accurate joint decoding algorithm and strengthens the interaction between multiple submodules.Therefore,the impact of decoding errors and cascading redundancies caused by gradual iterations on the performance of the joint model is weak-ened.However,the separation of the modules still produces redundancy errors,which cause certain limitations.The single-module/single-step modeling method can extract triples from text directly,which effectively alleviates the cascading error and entity redundancy problems of multi-module/multi-step and multi-module/single-step modeling methods.Taking the representative joint models in the high-impact literature as examples,this paper analyzes the modeling idea,advantages,and disadvantages of each model.It also classifies a number of classical models according to common modeling ideas to illus-trate trends in the development of entity-relationship joint extraction models.This paper compares and analyzes the perfor-mance of the representative single-module,single-step modeling method with multi-module/multi-step and multi-module/single-step models on a public benchmark data set.Moreover,it clarifies the objective trend that the modeling idea of joint extraction models is gradually changing from complex methods based on multi-module/multi-step and multi-module/single-step models to efficient single-module/single-step models Finally,this paper discusses the prospects of research directions in the joint extraction of three-entity relationships.The current mainstream joint model focuses on the entity-relationship ex-traction task of limited domains,and the open-domain entity-relationship joint extraction task is an urgent problem for fu-ture researchers to solve.In practical industrial applications,a text corpus contains multiple types of information,such as timing information.However,most current entity-relationship joint extraction models extract features based on single-text context information,thus ignoring time-series information.If multivariate information such as time-series information could be incorporated,the performance of the joint model would be further improved,and this is a topic of high importance for the future.In addition,there is little research on cross-text entity-relationship joint extraction models,which is also a future research topic in this field.This paper aims to establish a complete deep learning-based view of entity-relationship joint ex-traction research,which will be helpful to researchers in related fields.

作者张仰森刘帅康刘洋任乐辛永辉 ZHANG Yang-sen;LIU Shuai-kang;LIU Yang;REN Le;XIN Yong-hui(Institute of Intelligent Information Processing,Beijing Information Science and Technology University,Beijing 100192,China;Computer Network Emergency Response Technical Team,Coordination Center of China,Beijing 100029,China)

机构地区北京信息科技大学智能信息处理研究所国家计算机网络应急技术处理协调中心

出处《电子学报》 EI CAS CSCD 北大核心 2023年第4期1093-1116,共24页 Acta Electronica Sinica

基金国家自然科学基金(No.62176023)。

关键词信息抽取知识图谱深度学习实体关系联合抽取流水线方法 information extraction knowledge graph deep learning joint extraction of entities and relations pipe-line method

分类号 TP391 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献20

1刘峤,李杨,段宏,刘瑶,秦志光.知识图谱构建技术综述[J].计算机研究与发展,2016,53(3):582-600. 被引量：956
2徐健,张智雄,吴振新.实体关系抽取的技术方法综述[J].现代图书情报技术,2008(8):18-23. 被引量：54
3甘丽新,万常选,刘德喜,钟青,江腾蛟.基于句法语义特征的中文实体关系抽取[J].计算机研究与发展,2016,53(2):284-302. 被引量：74
4罗凌,杨志豪,宋雅文,李楠,林鸿飞.基于笔画ELMo和多任务学习的中文电子病历命名实体识别研究[J].计算机学报,2020,43(10):1943-1957. 被引量：47
5胡宇,申德荣,聂铁铮,寇月.面向生物医学实体链接的联合式学习方法[J].计算机学报,2022,45(4):748-765. 被引量：7
6郜成胜,张君福,李伟平,赵文,张世琨.一种基于混合神经网络的命名实体识别与共指消解联合模型[J].电子学报,2020,48(3):442-448. 被引量：4
7冯建周,宋沙沙,王元卓,刘亚坤,武红颖,龚昊.基于改进注意力机制的实体关系抽取方法[J].电子学报,2019,47(8):1692-1700. 被引量：18
8李志欣,孙亚茹,唐素勤,张灿龙,马慧芳.双路注意力引导图卷积网络的关系抽取[J].电子学报,2021,49(2):315-323. 被引量：9
9赵超,谢松县,曾道建,郑菲,程琛,彭立宏.融合预训练语言模型和标签依赖知识的关系抽取方法[J].中文信息学报,2022,36(1):75-82. 被引量：1
10鄂海红,张文静,肖思琪,程瑞,胡莺夕,周筱松,牛佩晴.深度学习实体关系抽取研究综述[J].软件学报,2019,30(6):1793-1818. 被引量：164

二级参考文献119

1车万翔,刘挺,李生.实体关系自动抽取[J].中文信息学报,2005,19(2):1-6. 被引量：116
2姜吉发,王树西.一种自举的二元关系和二元关系模式获取方法[J].中文信息学报,2005,19(2):71-77. 被引量：5
3何婷婷,徐超,李晶,赵君喆.基于种子自扩展的命名实体关系抽取方法[J].计算机工程,2006,32(21):183-184. 被引量：25
4邓擘,樊孝忠,杨立公.用语义模式提取实体关系的方法[J].计算机工程,2007,33(10):212-214. 被引量：23
5董静,孙乐,冯元勇,黄瑞红.中文实体关系抽取中的特征选择研究[J].中文信息学报,2007,21(4):80-85. 被引量：55
6刘克彬,李芳,刘磊,韩颖.基于核函数中文关系自动抽取系统的实现[J].计算机研究与发展,2007,44(8):1406-1411. 被引量：59
7Schutz A, Buitelaar P. RelExt:A Tool for Relation Extraction from Text in Ontology Extension [ C ]. 4th International Semantic Web Conference, Galway, Ireland, November 6 - 10, 2005:593 - 606.
8Katrenko S, Adriaans P. Learning Relations from Biomedical Corpora Using Dependency Tree Levels [ C ]. In : Proc. BENELEARN conference( 2006), 2006.
9Relationship Extraction [ EB/OL]. [ 2008 - 05 - 30 ]. http ://en. wikipedia, org/wiki/Relationship_extraction.
10The ACE 2004 Evaluation Plan[ EB/OL]. [ 2008 - 05 - 30 ]. http://www, nist. gov/speech/tests/ace/2004/doc/ace04 - evalplan - v7. pdf.

共引文献1371

1陈财森,向阳霞,寇应展,刘会英.面向装备作战数据的知识图谱平台构建[J].装甲兵学报,2022(5):105-110. 被引量：1
2袁野,刘佳伟,赵惠浞,左志平,葛超,朱晋锐.基于知识图谱的钢厂设备故障智能诊断技术研究与应用[J].冶金设备,2023(S02):20-25.
3何宏,葛张鹏,徐小良,夏一行,王宇翔.基于知识图谱语义查询技术的科技咨询服务研究[J].信息与管理研究,2019,4(4):86-96.
4李华昱,付亚凤,闫阳,李家瑞.基于LEBERT的多模态领域知识图谱构建[J].计算机系统应用,2022,31(11):79-90. 被引量：2
5曹艳琴.基于深度学习的英语自然语言处理系统[J].系统仿真技术,2021,17(4):285-288. 被引量：1
6吴雅娟,杨壮壮,尚福华,解红涛,杜睿山.学习仪表盘在油田射孔取心工培训系统中的应用[J].系统仿真技术,2021,17(1):17-21.
7熊回香,严舞月.基于知识图谱的数字档案服务模式探究[J].知识管理论坛,2021(4):204-212. 被引量：3
8冯鑫,李雪,闫月,李佳培,刘梦瑶,吴晔.基于知识实体的突发公共卫生事件数据平台构建研究[J].知识管理论坛,2020(3):175-190. 被引量：2
9郭嘉欣.基于多源异构数据挖掘的“红色记忆”知识图谱构建[J].知识管理论坛,2020(1):59-68. 被引量：11
10徐安迎,胡孔法,杨涛.基于Neo4j的肺癌中医诊疗知识图谱构建研究[J].世界科学技术-中医药现代化,2023,25(4):1456-1461. 被引量：9

同被引文献46

1贾宝林,尹世群,王宁朝.基于门控多层感知机的端到端实体关系联合抽取[J].中文信息学报,2023,37(3):143-151. 被引量：3
2俞敬松,魏一,张永伟,杨浩.基于非参数贝叶斯模型和深度学习的古文分词研究[J].中文信息学报,2020(6):1-8. 被引量：16
3程宁,李斌,葛四嘉,郝星月,冯敏萱.基于BiLSTM-CRF的古汉语自动断句与词法分析一体化研究[J].中文信息学报,2020(4):1-9. 被引量：21
4陈悦,陈超美,刘则渊,胡志刚,王贤文.CiteSpace知识图谱的方法论功能[J].科学学研究,2015,33(2):242-253. 被引量：7180
5王声培,云雅娟.洛特卡定律、普赖斯定律和我国数学科学文献[J].图书情报工作,1994,38(3):21-24. 被引量：41
6汤建民.基于文献计量的卓越科研机构描绘方法研究——以国内教育学科为例[J].情报杂志,2010,29(4):5-9. 被引量：22
7朱晓,金力.条件随机场图模型在《明史》词性标注研究中的应用效果探索[J].复旦学报（自然科学版）,2014,53(3):297-304. 被引量：9
8郭宇,王晰巍,贺伟,杨梦晴.基于文献计量和知识图谱可视化方法的国内外低碳技术发展动态研究[J].情报科学,2015,33(4):139-148. 被引量：64
9刘峤,李杨,段宏,刘瑶,秦志光.知识图谱构建技术综述[J].计算机研究与发展,2016,53(3):582-600. 被引量：956
10徐增林,盛泳潘,贺丽荣,王雅芳.知识图谱技术综述[J].电子科技大学学报,2016,45(4):589-606. 被引量：506

引证文献6

1王春亮,姚洁仪,李昭.融合MacBERT和Talking⁃Heads Attention实体关系联合抽取模型[J].现代电子技术,2024,47(5):127-131.
2何静,赵睿,张恒硕.知识图谱的可视化文献计量分析[J].计算机科学,2024,51(S01):1-10. 被引量：1
3李智杰,杨盛杰,李昌华,张颉,董玮,介军.基于BERT古文预训练模型的实体关系联合抽取[J].计算机系统应用,2024,33(8):187-195.
4唐贤伦,丁河长,唐瑜泽,谢涛,罗洪平.基于异构图和语义融合的实体关系抽取[J].实验技术与管理,2024,41(8):22-29.
5张强,曾俊玮,陈锐.基于对比学习与梯度惩罚的实体关系联合抽取模型[J].吉林大学学报（理学版）,2024,62(5):1155-1162.
6张宇,李书琴.低资源场景下苹果种植领域实体关系联合抽取模型[J].农业工程学报,2024,40(16):188-195.

二级引证文献1

1盛欣,杜彦春,张英媛.留守儿童健康问题体育干预相关研究的可视化分析[J].当代体育科技,2024,14(18):185-187.

1于小四.数字孪生技术在轨道交通智能建造中的实践探索[J].轨道交通,2023(1):38-40.
2张云飞,郭俊杰.信息抽取赋能地质调查发展综述[J].电脑知识与技术,2023,19(14):102-105.
3张朝阳.基于BERT的非招标采购实体关系抽取研究[J].信息通信技术与政策,2023,49(6):2-9.
4石小明.新课改下小学数学教学方法的创新研究[J].大众文摘,2022(12):77-79.
5苏鑫.基于BERT的远洋运输询盘命名实体识别方法[J].世界海运,2023,46(6):9-13. 被引量：1
6鲍钰清.指向幼儿深度学习的高质量师幼互动要素——基于CLASS评估系统的视角[J].福建教育,2023(20):39-42.
7任小强,王东灿,王浩宇,林慧琼.基于改进LightGBM的车辆碰撞检测模型研究[J].兰州职业技术学院学报,2023,39(3):87-91. 被引量：1
8陈佳威,吴茂念,彭蔚,朱绍军,郑博.山河纵横交错的工业园区能源多目标优化模型[J].黑龙江工业学院学报（综合版）,2023,23(4):78-89.
9周逸云,万新军,胡伏原,陈昊.基于联合注意与特征关联的实例分割算法[J].计算机工程,2023,49(6):217-226. 被引量：2
10毕昌萍,杨吉.新时代乡村优秀传统文化“两创”路径探析[J].安徽商贸职业技术学院学报,2023,22(2):1-5.

电子学报

2023年第4期

浏览历史

内容加载中请稍等...

基于深度学习的实体关系联合抽取研究综述被引量：6

参考文献20

二级参考文献119

共引文献1371

同被引文献46

引证文献6

二级引证文献1

相关作者

相关机构

相关主题

浏览历史

基于深度学习的实体关系联合抽取研究综述 被引量：6

参考文献20

二级参考文献119

共引文献1371

同被引文献46

引证文献6

二级引证文献1

相关作者

相关机构

相关主题

浏览历史

基于深度学习的实体关系联合抽取研究综述被引量：6