期刊文献+

基于BERT的突发事件文本自动标注方法 被引量:1

Automatic-annotation method for emergency text corpus based on BERT
下载PDF
导出
摘要 信息提取技术是自然语言处理技术的关键技术之一其中最主要的任务是事件元素提取。本文利用深度学习网络模型实现信息提取任务进行了深入研究。训练数据来源于上海大学构建的CEC已标注的语料库。相比于采用手工设立规则的识别方式和BiLSTM网络模型本文通过对数据进行预处理和搭建BERT-BiLSTM-CRF深度网络模型,对文本数据训练实现标注,在时间、报道时间、参与对象的识别准确率上均有所提升。 Information Extraction is one of the most important technology in Natural Language Process,which mainly job is extract the events element.This paper proposes a deep learning network method to solve this task.The training data comes from CEC corpus which was built by Shanghai University.In this experiment,compared with rule-based annotation method and Bi-LSTM network method,showing that using BERT+BiLSTM+CRF model can improve the efficiency of event extraction effectively.
作者 杨芷婷 马汉杰 YANG Zhiting;MA Hanjie(School of Information Science and Technology,Zhejiang Sci-Tech University,Hangzhou 310018,China)
出处 《智能计算机与应用》 2021年第6期14-19,共6页 Intelligent Computer and Applications
关键词 BERT 中文突发事件 自动标注 信息提取 BERT Chinese emergency event automatic-annotation information extraction
  • 相关文献

参考文献7

二级参考文献111

  • 1马建霞,袁慧,蒋翔.基于Bi-LSTM+CRF的科学文献中生态治理技术相关命名实体抽取研究[J].数据分析与知识发现,2020,4(2):78-88. 被引量:8
  • 2Ralph Grishman. 1997. Information Extraction : Tech- niques and Challenges[R]. New York: New York U-niversity, 1997.
  • 3Ralph Grishman, Beth Sundheim. Message Under- standing Conference-6: A Brief History[C]//Proceed- ings of COLING, 1996.
  • 4http://www, itl. nist. gov/iad/mig/tests/ace/[OL].
  • 5http ://www. nist. gov/tac/[OL].
  • 6Martina Naughton, N. Kushmerichand J. Carthy. Event Extraction from Hetergeneous News Sources [C]//Proceedings of AAAI, 2006.
  • 7D. McClosky, M. Surdeanu, C. D. Manning. Event Extraction as Dependency Parsing[C]//Proceedings ofACL-HLT, 2011.
  • 8Yu Hong, Jianfeng Zhang, Bin Ma, Jianmin Yao, Gu- odong Zhou, Qiaoming Zhu. Using Cross-Entity Infer ence to Improve Event Extraction[C]//Proeeedings ofACL-HLT, 2011.
  • 9Jun Zhao, Feifan Liu. Product Named Entity Recog nition in Chinese Texts[J]. International Journal of Language Resource and Evaluation. 2008, 42 (2) :132- 152.
  • 10Richard C. Wang, William Cohen. Automatic Set In- stance Extraction using the Web[C]//Proceedings of ACL-IJCNLP, 2009.

共引文献263

同被引文献18

引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部