期刊文献+

基于BERT-BLSTM-CRF的政务领域命名实体识别方法 被引量:6

A Method of Named Entity Recognition in Government Affairs Based on BERT-BLSTM-CRF
下载PDF
导出
摘要 政务领域的命名实体通常是一些政务事项名,这类实体与开放域实体比较,具有长度较长、实体并列、别称等特点,目前还未见公开可用的训练数据集。构建了具有25176个句子的政务领域命名实体识别数据集,并提出一种基于BERT-BLSTM-CRF的神经网络识别模型,该模型在不依赖人工特征选择的情况下,使用BERT中文预训练模型,然后采用BLSTM-CRF识别实体。实验结果表明,该模型识别效果优于CRF,BLSTM-CRF,CNN-BLSTM-CRF,F1值达到92.23%。 The named entities in the government affairs are some service items,and they have the characteristics of long length,entity juxtaposition,abbreviations,nicknames,etc.At present,there is no publicly available training data set.In this paper,a government domain named entity recognition data set with 25176 sentences was constructed,and a neural network method based on BERT-BLSTM-CRF was proposed.In this model,BERT Chinese pre-training model was used without relying on the selection of artificial features,and then BLSTM-CRF was used for named entity recognition.The experimental results show that the recognition accuracy is better than that of CRF,BLSTM-CRF,CNN-BLSTM-CRF,and the F1 value reaches 92.23%.
作者 杨春明 魏成志 张晖 赵旭剑 李波 YANG Chunming;WEI Chenzhi;ZHANG Hui;ZHAO Xujian;LI Bo(School of Computer Science and Technology,Southwest University of Science and Technology,Mianyang 621010,Sichuan,China;School of Science,Southwest University of Science and Technology,Mianyang 621010,Sichuan,China;Sichuan Big Data and Intelligent System Engineering Technology Research Center,Mianyang 621010,Sichuan,China)
出处 《西南科技大学学报》 CAS 2020年第3期86-91,共6页 Journal of Southwest University of Science and Technology
基金 教育部人文社科基金(17YJCZH260) 赛尔网络下一代创新项目(NGII20170901,NGII20180403)。
关键词 政务事务 命名实体识别 BLSMT CRF BERT Government affairs Named entity recognition BLSTM CRF BERT
  • 相关文献

参考文献10

二级参考文献83

共引文献312

同被引文献58

引证文献6

二级引证文献18

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部