期刊文献+

基于参数迁移的领域命名实体识别方法 被引量:2

Domain Named Entity Recognition Method Based on Parameter Transfer Learning
下载PDF
导出
摘要 [目的/意义]命名实体识别是自然语言处理领域中的基础任务,基于深度学习的方法在通用领域的命名实体中取得了显著成果,但在特定领域识别效果不佳。为了解决工业信息化领域标注数据不足,数据特征差异较大、模型难以扩展的问题,首先提出了一种基于Transformer的有限区间命名实体识别模型。[方法/过程]采用预训练模型对文本进行分布式表示,然后利用基于有限区间的标注方法对输入序列进行标注,解决传统标注法在训练过程中可能导致的序列标注不一致的问题。在此基础上,引入迁移学习策略,采用参数共享的方式,将通用领域的命名实体识别模型迁移到工业信息化领域,并在工业信息化领域数据集上进行微调,最终获得在工业信息化领域上表现良好的模型。[结果/结论]实验结果表明,本文提出的有限区间命名实体识别模型在工业信息化领域数据集上的准确率较基线模型提高了8.7%,基于参数迁移的领域命名实体识别方法在人民日报语料和工业信息化领域数据集上的准确率和综合指标F值相较未使用迁移学习的模型分别提高了3.1%和1.1%,证明了迁移策略的有效性。 [Objective/Significance]Named entity recognition is a fundamental task in natural language processing,and deep learning-based methods have achieved remarkable results in general domains,but not in specific domains.Aiming at the problems of insufficient labeling samples,quite differences in data features and difficulty in model expansion,this paper introduces a limited span-based transformer classifier for named entity recognition model(Span-based Transformer Classifier for Named Entity Recognition,STCNER).[Methods/Process]The model takes advantage of the features extraction of Encoder in Transformer and combines with the limited span-based labeling method,which solves the problem of the sequence labeling inconsistency caused by traditional labeling method in the training process.On this basis,then introduce the transfer learning strategy which adopt the parameter sharing method to transfer the named entity recognition model in general domains to the specific domains.After fine-tuning it on the domain-specific dataset,the model performs well in specific domain.[Results/Conclusions]The experimental results show that the accuracy of STCNER model is 8.7%higher than the baseline model on the dataset in the industrial informatization field.Compared with the model without transfer learning,the accuracy and F-scores are improved by 3.1%and 1.1%respectively on the corpus of People's Daily and the data set in the industrial informatization field,which proves the effectiveness of the transfer strategy.
作者 孙新 任翔渝 郑洪超 杨凯歌 SUN Xin;REN XiangYu;ZHENG Hongchao;YANG Kaige(School of Computer Science and Technology,Beijing Institute of Technology,Beijing 100081,China;The Key Laboratory of Rich-Media Knowledge Organization and Service of Digital Publishing Content,Beijing 100036,China)
出处 《情报工程》 2022年第3期13-27,共15页 Technology Intelligence Engineering
基金 富媒体数字出版内容组织与知识服务重点实验室开放基金项目“基于模糊粗糙集理论的远程监督关系抽取研究”(ZD2021-11/06)。
关键词 命名实体识别 深度学习 迁移学习 预训练语言模型 Named entity recognition deep learning transfer learning pre-trained language model
  • 相关文献

参考文献7

二级参考文献112

  • 1李妮,关焕梅,杨飘,董文永.基于BERT-IDCNN-CRF的中文命名实体识别方法[J].山东大学学报(理学版),2020,55(1):102-109. 被引量:54
  • 2向晓雯,史晓东,曾华琳.一个统计与规则相结合的中文命名实体识别系统[J].计算机应用,2005,25(10):2404-2406. 被引量:37
  • 3姜维,王晓龙,关毅,赵健.基于多知识源的中文词法分析系统[J].计算机学报,2007,30(1):137-145. 被引量:29
  • 4Ben-David S,Blitzer J,Crammer K,Pereira F.Analysis of representations for domain adaptation.In:Platt JC,Koller D,Singer Y,Roweis ST,eds.Proc.of the Advances in Neural Information Processing Systems 19.Cambridge:MIT Press,2007.137-144.
  • 5Blitzer J,McDonald R,Pereira F.Domain adaptation with structural correspondence learning.In:Jurafsky D,Gaussier E,eds.Proc.of the Int’l Conf.on Empirical Methods in Natural Language Processing.Stroudsburg PA:ACL,2006.120-128.
  • 6Dai WY,Xue GR,Yang Q,Yu Y.Co-Clustering based classification for out-of-domain documents.In:Proc.of the 13th ACM Int’l Conf.on Knowledge Discovery and Data Mining.New York:ACM Press,2007.210-219.[doi:10.1145/1281192.1281218].
  • 7Dai WY,Xue GR,Yang Q,Yu Y.Transferring naive Bayes classifiers for text classification.In:Proc.of the 22nd Conf.on Artificial Intelligence.AAAI Press,2007.540-545.
  • 8Liao XJ,Xue Y,Carin L.Logistic regression with an auxiliary data source.In:Proc.of the 22nd lnt*I Conf.on Machine Learning.San Francisco:Morgan Kaufmann Publishers,2005.505-512.[doi:10.1145/1102351.1102415].
  • 9Xing DK,Dai WY,Xue GR,Yu Y.Bridged refinement for transfer learning.In:Proc.of the Ilth European Conf.on Practice of Knowledge Discovery in Databases.Berlin:Springer-Verlag,2007.324-335.[doi:10.1007/978-3-540-74976-9_31].
  • 10Mahmud MMH.On universal transfer learning.In:Proc.of the 18th Int’l Conf.on Algorithmic Learning Theory.Sendai,2007.135-149.[doi:10,1007/978-3-540-75225-7_14].

共引文献583

同被引文献18

引证文献2

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部