摘要
中文事件抽取通常使用循环神经网络(recurrent neural network,RNN)来进行事件和事件要素的抽取,但RNN在处理长度较长的词语时容易丢失重要信息,为此提出一种组合卷积神经网络(convolutional neural network,CNN)与双向长短期记忆(bidirectional long short-term memory,Bi-LSTM)网络的中文事件抽取模型CNN-Bi-LSTM-CRF,其中CRF(conditional random field)为条件随机场.采用基于注意力机制和语义特征生成的字词联合向量,使用CNN和Bi-LSTM模型对字词联合向量进行处理,以获取其隐含表示,最后通过CRF得出预测结果.实验结果表明,所提出的方法与其他现有的中文事件抽取方法相比,准确率有明显提升.
The recurrent neural network is widely used in the event extraction of Chinese text to extract events and event elements,but it usually loses essential information when processing long words.In this study,the convolutional neural network(CNN)and the bidirectional long short-term memory(Bi-LSTM)network were combined to develop a novel event extraction model known as CNN-Bi-LSTM-conditional random field(CRF).A joint vector of characters and words was adopted based on the attention mechanism and semantic features,and the CNN and Bi-LSTM models were used to process the vector to obtain its implicit representation.Finally,the CRF was used to obtain the prediction results.The experimental results show that the proposed method is more accurate than other existing event extraction methods in extracting Chinese text.
作者
季忠祥
吴悦
JI Zhongxiang;WU Yue(School of Computer Engineering and Science,Shanghai University,Shanghai 200444,China)
出处
《上海大学学报(自然科学版)》
CAS
CSCD
北大核心
2021年第3期535-543,共9页
Journal of Shanghai University:Natural Science Edition
关键词
事件抽取
卷积神经网络
长短期记忆
event extraction
convolutional neural network
long short-term memory