期刊文献+

中文分词与命名实体识别的联合学习 被引量:9

Joint learning of Chinese word segmentation and named entity recognition
下载PDF
导出
摘要 将卷积结构引入循环神经网络,从而构建卷积循环神经网络。以此为基础,研究构建了面向中文分词与实体识别联合学习的序列标注模型。该模型依托卷积循环神经网络构建特征编码层,实现中文字序列局部空间特征和长距离时序依赖特征的联合提取;依托改进的循环神经网络构建标签解码层,实现标签序列长距离时序依赖的有效建模;依托统一的分词与实体识别序列标注模式实现分词信息与实体信息的联合学习,避免传统流水线法的误差传播问题。在人民日报语料和微软标注语料上的实验结果显示,该框架较传统统计模型和神经网络模型有显著的性能提升,尤其是在识别字数较多的命名实体时,其效果明显优于其他方法。 The convolutional structure was introduced into the recurrent neural network to construct a convolutional recurrent neural network.Based on this network,a sequence annotation model for joint learning of Chinese word segmentation and entity recognition was constructed.The model relies on the convolutional recurrent neural network to construct feature-encoding layer,which realizes the joint extraction of local spatial features and long-distance time-dependent features of Chinese character sequences;the improved recurrent neural network was relies on the constructing of tag-decoding layer,which realizes the effective modeling of timing-dependent features in the tag sequences;the unified word segmentation and entity recognition annotation mode relies on the achieving of joint learning of word segmentation information and entity information,which avoids the error propagation problem of traditional pipeline methods.Experimental results on the People′s Daily corpus and Microsoft′s annotated corpus show that the framework has significant performance improvement over traditional statistical models and neural network models,especially when identifying entities with multiple characters,and its effect is significantly better than other methods.
作者 黄晓辉 乔立升 余文涛 李京 薛寒 HUANG Xiaohui;QIAO Lisheng;YU Wentao;LI Jing;XUE Han(College of Computer Science and Technology, University of Science and Technology of China, Hefei 230026, China;Luoyang Campus of the Information Engineering University of the Strategic Support Force, Luoyang 471003, China)
出处 《国防科技大学学报》 EI CAS CSCD 北大核心 2021年第1期86-94,共9页 Journal of National University of Defense Technology
基金 国家重点研发计划资助项目(2016YFB0201402)。
关键词 卷积循环神经网络 局部空间特征 时序依赖特征 分词与实体识别 convolutional recurrent neural network local spatial features time-dependent features word segmentation and entity recognition
  • 相关文献

参考文献6

二级参考文献45

共引文献488

同被引文献107

引证文献9

二级引证文献9

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部