期刊文献+

BGCN:基于BERT和图卷积网络的触发词检测 被引量:6

BGCN:Trigger Detection Based on BERT and Graph Convolution Network
下载PDF
导出
摘要 触发词检测是事件抽取的一项基本任务,该任务涉及对触发词进行识别和分类。目前,已有工作主要存在两方面的问题:1)用于触发词检测的神经网络模型只考虑了句子的顺序表示,且通过顺序建模的方法在捕捉长距离依赖关系时效率较低;2)基于表示的方法虽然解决了手动提取特征的问题,但用作初始训练特征的词向量对句子的表示程度有所欠缺,难以捕捉深层的双向表征。因此,文中提出了一种基于BERT模型和GCN网络的触发词检测模型BGCN,该模型通过引入BERT词向量来强化特征表示,并引入句法结构来捕捉长距离依赖,对事件触发词进行检测。实验结果表明,所提方法在ACE2005数据集上的表现优于其他现有的神经网络模型。 Trigger word detection is a basic task of event extraction,which involves the recognition and classification of trigger words.There are two main problems in the previous work:(1)the neural network model for trigger word detection only consi-ders the sequential representation of sentences,and the sequential modeling method is inefficient in capturing long-distance dependencies;(2)although the representation-based method overcomes the problem of manual feature extraction,the word vector used as the initial training feature lacks the degree of representation of the sentence,so it is difficult to capture the deep two-way representation.Therefore,we propose a trigger word detection model BGCN,based on BERT model and GCN network.This model strengthens the feature representation by introducing BERT word vector,and introduces syntactic structure to capture long-distance dependencies and detect event trigger words.Experimental results show that our method outperforms other existing neural network models on ACE2005 datasets.
作者 程思伟 葛唯益 王羽 徐建 CHENG Si-wei;GE Wei-yi;WANG Yu;XU Jian(School of Computer Science and Engineering,Nanjing University of Science and Technology,Nanjing 210094,China;Key Laboratory of Information System Engineering,28th Research Institute of China Electronic Science and Technology Group Corporation 210007,China)
出处 《计算机科学》 CSCD 北大核心 2021年第7期292-298,共7页 Computer Science
基金 国家自然科学基金(61872186) 信息系统工程重点实验室开放基金(05201901)。
关键词 BERT 双向LSTM 图卷积网络 序列标注 事件触发词 BERT Bi-LSTM Graph convolution network Sequence annotation Event trigger
  • 相关文献

参考文献2

二级参考文献23

  • 1ACE(Automatic Content Extraction) Chinese Annotation Gui - delines for Events [M]. National Institute of Standards and Technology, 2005.
  • 2Surdeanu M, Harabagiu S, Williams J, et al. Using Predicate-Argument Structures for Information Extraction[C]// Proceedings of ACL. 2003,8-15.
  • 3Surdeanu M, Harabagiu S. Infrastructure for open-domain information extraction [C]//Proceedings of the Human Language Technology Conference. 2002 : 325-330.
  • 4Chieu Hal Leong, Ng Hwee Tou. A Maximum entropy Ap - proach to Information Extraction from Semi-Structured and Free Text[C]//Proceedings of the 18th National Conference on Artificial Intelligence. 2002:786-791.
  • 5Ahn D. The Stages of Event Extraction[C]//Proceedings of the Workshop on Annotations and Reasoning about Time and Events. 2006 : 1-8.
  • 6Ding C, He Xiaofeng. Cluster Merging and Splitting in Hierarchical Clustering Algorithms [A] // Proceedings of the 2002 IEEE International Conference on Data Mining[C]. Maebashi City,Japan: Maebashi TERRSA, 2002 : 139-146.
  • 7Ding C, He X, Zha H, et al. A Min-Max Cut Algorithm for Graph Partitioning and Data Clustering[A]//Proceedings of the IEEE Internationl Conference [C]. San Jose, California, USA:Data Mining,2001 ; 107-114.
  • 8Riloff E.Automatically Generating Extraction Patterns fromUntagged Text[C]∥Proceedings of the Thirteenth National Conference on Artificial Intelligence.1996:1044-1049.
  • 9Yangarber R,Grishman R,Tapanainen P,et al.Automatic Acquisition of Domain Knowledge for Information Extraction[C]∥Proceedings of the 18th Conference on Computational linguistics.2000:940-946.
  • 10Yangarber R.Counter-Training in Discovery of Semantic Pat-terns[C]∥Proceedings of ACL 2003.2003:343-350.

共引文献11

同被引文献46

引证文献6

二级引证文献6

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部