摘要
将卷积结构引入循环神经网络,从而构建卷积循环神经网络。以此为基础,研究构建了面向中文分词与实体识别联合学习的序列标注模型。该模型依托卷积循环神经网络构建特征编码层,实现中文字序列局部空间特征和长距离时序依赖特征的联合提取;依托改进的循环神经网络构建标签解码层,实现标签序列长距离时序依赖的有效建模;依托统一的分词与实体识别序列标注模式实现分词信息与实体信息的联合学习,避免传统流水线法的误差传播问题。在人民日报语料和微软标注语料上的实验结果显示,该框架较传统统计模型和神经网络模型有显著的性能提升,尤其是在识别字数较多的命名实体时,其效果明显优于其他方法。
The convolutional structure was introduced into the recurrent neural network to construct a convolutional recurrent neural network.Based on this network,a sequence annotation model for joint learning of Chinese word segmentation and entity recognition was constructed.The model relies on the convolutional recurrent neural network to construct feature-encoding layer,which realizes the joint extraction of local spatial features and long-distance time-dependent features of Chinese character sequences;the improved recurrent neural network was relies on the constructing of tag-decoding layer,which realizes the effective modeling of timing-dependent features in the tag sequences;the unified word segmentation and entity recognition annotation mode relies on the achieving of joint learning of word segmentation information and entity information,which avoids the error propagation problem of traditional pipeline methods.Experimental results on the People′s Daily corpus and Microsoft′s annotated corpus show that the framework has significant performance improvement over traditional statistical models and neural network models,especially when identifying entities with multiple characters,and its effect is significantly better than other methods.
作者
黄晓辉
乔立升
余文涛
李京
薛寒
HUANG Xiaohui;QIAO Lisheng;YU Wentao;LI Jing;XUE Han(College of Computer Science and Technology, University of Science and Technology of China, Hefei 230026, China;Luoyang Campus of the Information Engineering University of the Strategic Support Force, Luoyang 471003, China)
出处
《国防科技大学学报》
EI
CAS
CSCD
北大核心
2021年第1期86-94,共9页
Journal of National University of Defense Technology
基金
国家重点研发计划资助项目(2016YFB0201402)。
关键词
卷积循环神经网络
局部空间特征
时序依赖特征
分词与实体识别
convolutional recurrent neural network
local spatial features
time-dependent features
word segmentation and entity recognition