期刊文献+

文档级关系抽取方法的研究进展 被引量:1

Survey on Document-Level Relation Extraction
下载PDF
导出
摘要 关系抽取是自然语言处理领域的一项基础研究,抽取的结果可以用于知识图谱构建、人机问答、语义搜索等下游任务,具有广泛的应用场景和重要的研究价值。近年来,关系抽取研究取得了丰富的成果,但绝大多数研究局限于句子级关系抽取。研究表明,大量的关系无法通过单个句子提取,随着深度学习和自然语言处理技术的不断发展,文档级关系抽取研究工作迎来了新一轮的机遇和挑战。文中着重对近几年文档级关系抽取的研究进展进行分类和梳理,提炼出文档级关系抽取的一般技术路线图,分析文档级关系抽取研究的特征编码及特征聚合方法,并根据提取特征的不同,将文档级关系抽取方法概括为基于词汇特征、基于句法特征以及基于关系特征的3类方法;同时介绍常用文档级关系抽取数据集和评测指标,并对未来的研究趋势进行展望。 Relation extraction(RE)is one of the basic research in natural language processing.The result of RE can be applied to downstream missions such as construction of knowledge graphs,knowledge base question answering,semantic search et al,so RE has wide-ranging application scenarios and important research value.In recent years,RE has achieved fruitful results,but most studies are limited in sentence-level RE.Research shows that many relations can not be extracted from a single sentence.Under the development of deep learning and NLP,do-cument-level RE is facing new opportunities and challenges.This study reviewed the recent advances in document-level RE research,summarized a general technology roadmap of document-level RE,and analyzed the methods of feature encoding and feature aggregation used in researches.According to the different features of extraction,the document-level RE methods were divided into three categories,namely,extraction method based on lexical features,extraction method based on syntactic features,and extraction method based on relational features.The paper also introduced the common datasets and evaluation metrics of document-level RE and forecasted the future development trend of this task.
作者 周友华 黄翰 刘浩龙 郝志峰 ZHOU Youhua;HUANG Han;LIU Haolong;HAO Zhifeng(School of Software Engineering,South China University of Technology, Guangzhou 510006, Guangdong,China;School of Mathematics and Big Data,Foshan University,Foshan 528225, Guangdong,China)
出处 《华南理工大学学报(自然科学版)》 EI CAS CSCD 北大核心 2022年第4期10-25,共16页 Journal of South China University of Technology(Natural Science Edition)
基金 国家自然科学基金资助项目(61876207) 中央高校基本科研业务费资助项目(2020ZYGXZR014)。
关键词 文档级 关系抽取 特征编码 特征聚合 document-level relation extraction feature encoding feature aggregation
  • 相关文献

参考文献2

二级参考文献36

共引文献175

同被引文献6

引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部