摘要
在实体关系抽取任务中,通常采用远程监督(distant supervision,DS)数据集,远程监督方法能通过大规模语料库自动标注数据来扩张数据集,但这无疑会使数据集充满大量的噪声。为此,该文将深度残差网络(deep residual network,ResNet)应用到关系提取的远程监督数据集上,通过加深网络层数来提高模型降噪能力。同时,提出了Gate模块,有效提高了深度残差网络的性能。该模块可以学习到每个特征通道的重要性,通过权重增强或抑制各个特征通道的比重,从而防止过拟合。另外,为了进一步解决数据集降噪问题,还提出了一种双池化层的池化层新方案。实验结果表明所提方法相比于目前效果较好的PCNN+ATT模型,在准确率和召回率上都有3%左右的提升。
In the entity relationship extraction task,the distant supervision data set with substantial noise is often used.This paper applies ResNet to the distant supervision data set of relation extraction,to exploit its denoising ability by deepening the network.This paper also proposes a Gate module that can effectively improve the performance of deep residual networks,which can learn the importance between each feature channel.In addition,in order to further reduce the noise,this paper also proposes a new pooling layer called double pooling layer.The experimental results show that the proposed method achieves an improvement of 3%in precision and recall rate compared with the PCNN+ATT model.
作者
袁祯祺
宋威
陈璟
YUAN Zhenqi;SONG Wei;CHEN Jing(School of Internet of Things Engineering,Jiangnan University,Wuxi,Jiangsu 214122,China;Engineering Research Center of Internet of Things Technology Applications of Ministry of Education,Jiangnan University,Wuxi,Jiangsu 214122,China)
出处
《中文信息学报》
CSCD
北大核心
2019年第10期57-63,共7页
Journal of Chinese Information Processing
基金
江苏省青年科学基金(BK20150159)
国家自然科学基金(61673193)
中国博士后科学基金(2017M621625)
江苏省自然科学基金(BK20181341)
关键词
实体关系提取
远程监督
深度残差网络
relationship extraction
distant supervision
deep residual network