摘要
针对目前大多数关系抽取模型中局部特征及全局特征利用不充分的缺点,该文提出一种采用多尺度注意力机制的远程监督关系抽取模型。在词语层面,通过在池化层构建权重矩阵来衡量词语与关系的相关程度,从而捕捉句子中重要的语义特征;在句子层面,采用注意力机制将预测关系与句子进行相关性比较,获得句子级别的重要信息。模型在NYT数据集上平均准确率达到78%,表明该模型能够有效地利用多尺度特征,并且提高远程关系抽取任务的准确率。
To best exploit the local and global feature,we proposed a distant supervised relation extraction model based on multi-level attention mechanism.We employ an attention matrix in pooling layer to capture the word-level sematic feature which indicates the relevant relationship between input words and relations.Moreover,we adopted sentence-level attention mechanism to compare the relationship between sentences and predicted relations.Experimental results show that the mean accuracy of the proposed model achieves 78% in the NYT data set,indicating an effective use of multi-level feature and better performance of distant relation extraction task.
作者
蔡强
郝佳云
曹健
李海生
CAI Qiang;HAO Jiayun;CAO Jian;LI Haisheng(School of Computer and Information Engineering, Beijing Technology and Business University, Beijing 100048, China;Beijing Key Laboratory of Big Data Technology for Food Safety,Beijing Technology and Business University, Beijing 100048, China)
出处
《中文信息学报》
CSCD
北大核心
2018年第1期96-101,共6页
Journal of Chinese Information Processing
基金
北京市教委科研计划一般项目(SQKM201610011010)
北京市自然科学基金(4162019)
北京市科技计划课题(Z161100001616004)
关键词
多尺度
注意力机制
远程监督模型
关系抽取
multi-level
attention mechanism
distant supervision model
relation extraction