期刊文献+

基于注意力机制的特征融合序列标注模型 被引量:1

Attention based sequence labeling model with feature fusion
下载PDF
导出
摘要 序列标注任务是自然语言处理领域的重要问题,包括命名实体识别、词性标注、意见抽取等具有重要应用价值的子任务。目前,长短期记忆网络-条件随机场模型(LSTM-CRF)作为序列标注任务的主流框架,虽然取得了很好的性能并被广泛使用,但仍存在局部依赖性以及受限于序列化特征学习的缺点。为了同步建模句子中每个词的局部上下文语义与全局语义,并将两部分语义进行有效融合,提出基于注意力机制的特征融合序列标注模型。具体地,本模型利用多头注意力机制建模句子中任意两个词之间的语义关系,得到每个词应关注的全局语义。考虑到LSTM学习的局部上下文信息和注意力机制学习的全局语义具有互补性,进一步设计了三种特征融合方法将两部分语义深度融合以得到更丰富的语义依赖信息。为验证模型的有效性,在四个数据集上进行了大量的实验,实验结果表明本模型达到较优的性能。 Sequence labeling is a fundamental problem in natural language processing and play a significant role in applications such as named entity recognition,text chunking,opinion extraction and so on.Even though LSTM-CRF is a widely used mainstream framework for sequence labeling in the recent years,it still has several shortcomings including local dependence and limitation of sequential learning.To synchronously model the local context semantics and global semantics of each word,and to effectively merge the two parts of semantics,an attention based sequence labeling model with feature fusion is proposed in this paper.The multi-head attention mechanism to model the semantic relation between any two words is introduced and global semantics is obtained.Considering that the context information of LSTM and the global semantics of attention mechanism are complementary,three feature fusion methods were designed to significantly fuse the two parts of semantics.Extensive experiments conducted demonstrate that our approach achieves state-of-the-art performance on four datasets.
作者 王旭强 岳顺民 张亚行 刘杰 王扬 杨青 WANG Xuqiang;YUE Shunmin;ZHANG Yahang;LIU Jie;WANG Yang;YANG Qing(Information Communication Company,State Grid Tianjin Electric Power Company,Tianjin 300310,China;College of Computer Science,Nankai University,Tianjin 300071,China;College of Artificial Intelligence,Nankai University,Tianjin 300071,China)
出处 《山东科技大学学报(自然科学版)》 CAS 北大核心 2020年第5期79-88,共10页 Journal of Shandong University of Science and Technology(Natural Science)
基金 国家自然科学基金项目(U1633103) 天津市科技计划项目(18ZXZNGX00310) 天津市电力公司科技项目(KJ19-1-38)。
关键词 序列标注 条件随机场 多头注意力机制 特征融合 sequence labeling conditional random field(CRF) multi-head attention feature fusion
  • 相关文献

参考文献4

二级参考文献20

共引文献120

同被引文献7

引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部