期刊文献+

基于预训练BERT的军事领域目标实体深度学习识别方法 被引量:4

Deep Learning Recognition Method for Target Entity in Military Field Based on Pre-Trained BERT
下载PDF
导出
摘要 军事领域非结构化文本中的大量目标实体往往包含丰富的军事信息和军事知识,对其准确识别是进行军事信息抽取和军事知识组织的基础性关键任务,也是构建军事知识图谱的重要环节。针对军事领域标注数据不足以及军事实体边界模糊的问题,提出基于预训练BERT模型的深度学习识别方法。利用BERT生成基于当前输入语境特征的动态字向量来增强字的语义表示,融合字的含边界词性特征得到特征融合向量,再连接BiLSTM-CRF神经网络。在自建的军事领域标注数据集上的实验结果表明,相较于另外两种基准方法,该方法在准确率、召回率和F值上获得了更优的表现。 A large number of target entities in unstructured texts of the military field often contain a wealth of military information and military knowledge.Accurate identification of them is a fundamental and key task for military information extraction and military knowledge organization,as well as an important link in the construction of a military knowledge graph.To address the problems of insufficient annotation data in the military field and fuzzy boundaries of military entities,a deep learning recognition method based on the pre-trained BERT model is proposed.This method uses BERT to generate dynamic character vectors based on the current input context features to enhance the semantic representation of the characters,fuses the boundary part-of-speech features of the characters to obtain the feature fusion vectors,and then connects the BiLSTM-CRF neural network.The experimental results on the self-built military annotation data set show that this method has better performance in accuracy,recall and F-value,compared with the other two benchmark methods.
作者 张乐 李健 唐亮 易绵竹 ZHANG Le;LI Jian;TANG Liang;YI Mianzhu(Luoyang Campus,Information Engineering University,Luoyang 471003,China)
出处 《信息工程大学学报》 2021年第3期331-337,共7页 Journal of Information Engineering University
关键词 军事领域 实体识别 预训练BERT 深度学习 military field entity recognition pre-trained BERT deep learning
  • 相关文献

参考文献10

二级参考文献56

  • 1孙宏纲,姚景顺,闫国玉.基于自然语言理解的军事自动标图系统[J].火力与指挥控制,2005,30(S1):27-29. 被引量:7
  • 2胡斌,汤伟,刘晓明.基于自然语言理解的文本标图系统设计与实现[J].解放军理工大学学报(自然科学版),2005,6(2):132-136. 被引量:9
  • 3张晓艳,王挺,陈火旺.命名实体识别研究[J].计算机科学,2005,32(4):44-48. 被引量:65
  • 4刘非凡,赵军,吕碧波,徐波,于浩,夏迎炬.面向商务信息抽取的产品命名实体识别研究[J].中文信息学报,2006,20(1):7-13. 被引量:47
  • 5俞鸿魁,张华平,刘群,吕学强,施水才.基于层叠隐马尔可夫模型的中文命名实体识别[J].通信学报,2006,27(2):87-94. 被引量:153
  • 6Lafferty J, McCallum A, Pereira E Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data[C].In: Proc. ICML 2001,New York:ACM,2001.
  • 7Chang P C, M. Galley, C. D. Manning, Optimizing Chinese word Segmentation for machine translation performance[C].In: Proc of the Third Workshop on Statistical Machine Translation, Columbus, Ohio: Association for Computational Linguistics.2008:224-232.
  • 8季妲,罗振声.基于反比概率模型和规则的中文姓名自动辨识系统[C].自然语言理解与机器翻译全国第六届计算语言学联合学术会议论文集,2001.
  • 9CRF++: Yet another crf toolkit, http://crfpp. sourceforge. net/.
  • 10Tjong Kim Sang E F, De Meulder F. Introduction to the CoNLL- 2003 shared task: Language-lndependent named entity recogni- tion[C] // Proceedings of the Seventh Conference on Natural Language Learning at HLT-NAACL 2003-Volume 4. Associa- tion for Computational Linguistics, 2003 : 142-147.

共引文献119

同被引文献37

引证文献4

二级引证文献5

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部