期刊文献+

面向武器装备领域的实体及关系抽取方法研究 被引量:1

Research on Entity and Relation Extraction for Weapons and Equipment Field
下载PDF
导出
摘要 实体及关系抽取是实现海量数据知识化的关键,而现有实体及关系抽取方法应用于垂直领域时,表现出的效果很难达到实装应用水平。针对武器装备领域,文中在分析该领域文本数据特征的基础上,提出基于预训练模型与规则知识结合的武器装备实体及关系抽取方法,由实体抽取和关系抽取两个阶段组成。在实体抽取阶段,首先,利用BERT+BiLSTM+CRF模型完成武器装备实体的识别;然后,通过规则知识对领域性实体补充抽取。在关系抽取阶段,首先,利用BERT+BiGRU+CNN模型抽取武器装备实体间关系;然后,经过滤调模块对实体间关系抽取结果过滤和调整;最后,设计强领域性的关系抽取规则,用于实体间关系的补充抽取。在仿真数据集上对本文方法评测,结果表明在实体识别和关系抽取上的F1值分别为96.4%和95.1%,与基线相比均提升了约10%。同时,文中提出的实体及关系抽取方法可作为一种通用解决方案,推广至其他垂直领域。 Entity and relation extraction is the key to realize the knowledge of massive data.However,when the existing entity and relation extraction methods are applied to the vertical field,the results can hardly reach the level of practical application.Aiming at the field of weapon equipment,this paper proposes a method of extracting weapon equipment entities and relations based on the combination of pre-training model and rule-based knowledge on the analysis of the characteristics of text datasets in this field.The method consists of two stages:entity extraction and relation extraction.In the entity extraction stage,first,using BERT+BiLSTM+CRF model to complete the identification of weapon and equipment entities,secondly using rule-based knowledge to supplement the extraction of domain entities.In the relation extraction stage,first,the BERT+BiGRU+CNN model is used to extract the relation between the entities of weapons and equipmen,secondly,the filtering and adjustment module is used to filter and adjust the relation extracted in the previous step,finally,the strong domain-oriented rules are designed to extract the sematic relation between entities.The proposed extraction method was evaluated on the simulation data set.The F1 values of entity recognition and relationship extraction in the proposed method are 96.4%and 95.1%,respectively,both of which are improved by about 10%compared with the baseline.Meanwhile,the entity and relationship extraction method proposed in this paper can be used as a general solution and extended to other verticals.
作者 段文昱 朱继召 赵浩楠 黄友澎 范纯龙 DUAN Wen-yu;ZHU Ji-zhao;ZHAO Hao-nan;HUANG You-peng;FAN Chun-long(Shenyang Aerospace University,Shenyang 110136,China;Wuhan Digital Engineering Research Institute,Wuhan 430074,China;Key Laboratory of Network Data Science and Technology,Institute of Computing Technology,Chinese Academy of Sciences,Beijing 100190,China)
出处 《中国电子科学研究院学报》 北大核心 2022年第12期1165-1172,共8页 Journal of China Academy of Electronics and Information Technology
关键词 知识图谱 武器装备 预训练 实体关系抽取 knowledge graph weaponry pre-training entity and relation extraction
  • 相关文献

参考文献9

二级参考文献46

  • 1车万翔,刘挺,李生.实体关系自动抽取[J].中文信息学报,2005,19(2):1-6. 被引量:115
  • 2陈康,武港山.基于Ontology的信息检索技术研究[J].中文信息学报,2005,19(2):51-57. 被引量:29
  • 3L I Zhanjun, Ramani K. Ontology-based Design Information Extnnction and Retrieval[J]. Artificial Intelligence for Engineering Design, Analysis and Manufacturing ,2007 ( 2 ) : 137 - 154.
  • 4GATE. Developing Language Processing Components with GATFZ Version 5 ( a User Guide ) [ EB/OL]. 2009 - 11. http ://gate. ae. uk/.
  • 5Protege. user docutnentation [ EB/OL]. http ://protegc. stanford. edu/.
  • 6ICTCLAS. ICTCLAS[ GP]. http://ictelas, org/index, htmt.
  • 7ACE. ACE (Automatic Content Extraction) English Annotation Guidelines for Relations[ EB/OL]. http ://www. itl. hist. gov/iad/ mig/tests/ace/.
  • 8车万翔,刘挺,李生.实体关系自动抽取[A].第一届全国内容安全与信息检索学术会议[C].上海,2004.
  • 9庄成龙,钱龙华,周国栋.基于树核函数的实体语义关系抽取方法研究[J].中文信息学报,2009,23(1):3-8. 被引量:27
  • 10姜文志,顾佼佼,胡文萱,王彦.基于多模型结合的军事命名实体识别[J].兵工自动化,2011,30(10):90-93. 被引量:11

共引文献50

同被引文献26

引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部