基于动态句法剪枝机制的中文语义角色标注被引量：2

Chinese Semantic Role Labeling Based on Dynamic Syntax Pruning

下载PDF

导出

摘要语义角色标注(Semantic Role Labeling,SRL)旨在识别给定句子中所包含的谓词及对应的语义论元,从而为信息抽取、自动问答和阅读理解等任务的语义理解提供帮助.构建句法特征作为实现语义角色标注任务的关键步骤,在很大程度上影响着任务的性能.针对现有的神经网络模型未能有效构建句法特征,例如现有研究采取离线式的人工定式句法裁剪方案,不可避免地造成关键句法信息丢失或者裁剪效果减弱等问题,本文提出基于动态句法剪枝机制的端到端神经网络模型,并将其用于中文语义角色标注任务.具体地,我们提出两种创新的动态句法剪枝机制:基于递归神经网络模型的动态句法剪枝机制(Recur-DSP)和基于带句法标签的图卷积网络模型的句法剪枝机制(SGCN-DSP).Recur-DSP采用递归神经网络模型进行句法结构编码与融合,并对句法树的每一个连接处通过Gumbel-Softmax函数离散化实现动态句法裁剪.SGCN-DSP采用图卷积神经网络模型为句法依存树的依存弧结构以及对应的标签进行统一建模,并提出对应的动态句法裁剪机制.在基准数据集上的实验结果显示所提方法超过当前的最好模型,获得当前中文语义角色标注的最优性能.通过整合预训练语言模型BERT,基于CoNLL09数据集,提出的模型SGCN-DSP在角色论元识别上获得了90.4%的F1值,在谓词识别上获得90.8%的F1值. Semantic role labeling(SRL),as the shallow semantic parsing task,which has received extensive research attention in recent years and plays a core role in the natural language processing(NLP)community.The SRL task aims to identify the corresponding argument roles for the predicates of a given sentence,which can facilitate the downstream NLP tasks,such as information extraction,question answer system and reading comprehension,etc.A great number of methods have been proposed for the task,and the existing studies can be divided into two main categories:machine learning based methods with hand-crafted discrete features and deep learning methods with automatic distributed features.The early studies largely separate SRL into two individual subtasks,i.e.,predicate disambiguation and argument role labeling.More recently,great efforts have been paid for constructing various end-to-end SRL architectures,solving two pipeline steps in one shot via one unified model.Recent studies also show that integrating external syntactic features,such as syntactic dependency trees,are important for the SRL task highly.So designing a novel neural model,which can capture syntactic features effectively,has become a heated research topic.Recently,He et al.(2018)find that only a part of syntactic structure information can offer valuable information for the SRL task,which calls for pruning the syntactic structure features.However,the existing work adopts the offline syntactic pruning strategy,which can inevitably lead to either the loss of key syntactic information or the weakening of pruning effectiveness.Extracting syntactic features,as an important step of the SRL task,will largely affect the final performance of the task.However,the existing neural network methods fail to effectively model syntactic features.For example,the existing studies adopt the offline syntactic pruning strategy with fixed human labor,which inevitably leads to the loss of key syntactic information or the weakening of pruning effectiveness.To address the above issues,we propose an end-to-end neural network model for the Chinese SRL task based on dynamic syntactic pruning mechanism.Specifically,we propose two novel methods:recursive neural network model with dynamic syntactic pruning(Recur-DSP)and syntax-label graph convolutional network with dynamic syntactic pruning(SGCN-DSP).Recur-DSP uses a recursive neural network model to encode and fuse syntactic structure knowledge,and applies the Gumbel-Softmax function to realize dynamic syntactic pruning.SGCN-DSP exploits a graph convolutional neural network model that can simultaneously encode syntactic arcs and labels,based on which we introduce the corresponding dynamic syntactic pruning strategy.Experimental results on multiple benchmark datasets show the effectiveness of the proposed methods.Our proposed methods outperform the current best method by a large margin,giving the state-of-the-art performances for the Chinese SRL task.Specifically,our proposed model SGCN-DSP achieves 86.9%F1 score in argument role labeling and 89.1%F1 score in predicate identification based on the CoNLL09 dataset.By integrating the current pre-trained language model BERT(Bidirectional Encoder Representation from Transformers,BERT),the task performance can be further improved.The proposed SGCN-DSP gives 90.4%F1 score in argument role labeling,and 90.8%F1 scores in predicate identification,respectively.

作者费豪姬东鸿任亚峰 FEI Hao;JI Dong-Hong;REN Ya-Feng(School of Cyber Science and Engineering,Wuhan University,Wuhan 430072;School of Interpreting and Translation Studies,Guangdong University of Foreign Studies,Guangzhou 510420)

机构地区武汉大学国家网络安全学院广东外语外贸大学高级翻译学院

出处《计算机学报》 EI CAS CSCD 北大核心 2022年第8期1746-1764,共19页 Chinese Journal of Computers

基金国家重点研发计划项目(2017YFC1200500) 国家自然科学基金项目(61702121,61772378) 广州市科技计划项目(202102020607)资助.

关键词自然语言处理语义角色标注句法剪枝神经网络深度学习 natural language processing semantic role labeling syntax pruning neural network deep learning

分类号 TP391 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献3

1王明轩,刘群.基于深度神经网络的语义角色标注[J].中文信息学报,2018,32(2):50-57. 被引量：10
2袁晓虹,王红玲,王步康,周国栋.基于依存关系的中文名词性谓词语义角色标注研究[J].计算机应用与软件,2011,28(5):31-33. 被引量：2
3杨凤玲,周俏丽,蔡东风,季铎.结合短语结构句法的语义角色标注[J].中文信息学报,2018,32(6):1-11. 被引量：4

二级参考文献18

1刘挺,车万翔,李生.基于最大熵分类器的语义角色标注[J].软件学报,2007,18(3):565-573. 被引量：73
2CoNLL 2008.http://www.yr-bcn.es/conll2008/.
3CoNLL 2009.http://ufal.mff.cuni.cz/conll2009-st/.
4Xue N,Palmer M.Annotating the propositions in the Penn Chinese Treebank[C] //Proc.Of the 2nd SIGHAN Workshop on Chinese Language Processing.
5Xue N.Annotating the predicate-argumènt structure of Chinese nominalizations[C] //Proc.of LREC 2006.
6Pradhan S,Sun H,Ward W,et al.Parsing arguments of nominalizations in English and Chinese[C] //Proc.of NAACL-HIT 2004.
7Xue N,Palmer M.Automatic semantic role labeling for Chinese verbs[C] //Proc.of IJCAI 2005.
8Xue N.Semantic role labeling of nominalized predicates in Chinese[C] //Proc.of HLT-NAACL 2006.
9Xue N.Labeling Chinese predicates with semantic roles[J].Computational Linguistics,2008,34(2):225-255.
10Junhui Li,Guodong Zhou,Hai Zhao,et al.Improving Nominal SRL in Chinese Language with Verbal SRL Information and Automatic Predicate Recognition[C] //Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing,Singapore,6-7 August 2009.ACL and AFNLP 2009:1280-1288.

共引文献11

1王步康,王红玲,周国栋.基于树核函数的中文语义角色标注[J].计算机工程,2011,37(22):128-130. 被引量：2
2袁里驰.利用语义信息的句法分析统计模型[J].小型微型计算机系统,2019,40(10):2125-2129. 被引量：3
3陈艳平,冯丽,秦永彬,黄瑞章.一种基于深度神经网络的句法要素识别方法[J].山东大学学报（工学版）,2020,50(2):44-49. 被引量：6
4王一成,万福成,马宁.基于条件随机场的多线索中文语义角色标注[J].云南大学学报（自然科学版）,2020,42(3):474-480. 被引量：3
5王一成,万福成,马宁.融合多层次特征的中文语义角色标注[J].智能系统学报,2020,15(1):107-113. 被引量：5
6梁志剑,郝淼.基于改进深度注意神经网络的语义角色标注[J].计算机工程与设计,2020,41(8):2327-2331. 被引量：4
7徐建国,刘泳慧,刘梦凡.基于BILSTM-CRF的高校政策语义角色标注研究[J].计算机工程与应用,2021,57(6):207-211. 被引量：4
8袁里驰.利用深度神经网络并结合配价信息的语义角色标注[J].小型微型计算机系统,2022,43(9):1925-1930. 被引量：1
9班玛宝,色差甲,才让加,张瑞,柔特.一种端到端的藏文La格浅层语义分析[J].中文信息学报,2023,37(2):62-70.
10王超,吕国英,李茹,柴清华,李晋荣.基于BERT特征融合与膨胀卷积的汉语副词框架语义角色标注[J].中文信息学报,2024,38(2):25-35.

同被引文献33

1吴婷,孔芳.基于图注意力卷积神经网络的文档级关系抽取[J].中文信息学报,2021,35(10):73-80. 被引量：12
2潘鹏程.图书馆智能咨询系统模型构建[J].图书馆学研究（应用版）,2010(6):82-84. 被引量：14
3李记旭.基于范例推理的数字参考咨询系统实现初探[J].情报理论与实践,2009,32(6):78-80. 被引量：4
4姚飞,纪磊,张成昱,陈武.实时虚拟参考咨询服务新尝试——清华大学图书馆智能聊天机器人[J].现代图书情报技术,2011(4):77-81. 被引量：104
5李玲,姚大鹏,魏韧,张杰龙,范炜.国家科学图书馆咨询知识库的研究与实践[J].图书情报工作,2012,56(21):57-61. 被引量：8
6吴佐衍,王宇.基于HNC理论的词语相似度计算[J].中文信息学报,2014,28(2):37-43. 被引量：10
7刘宝瑞,郭宏娇.基于Deep QA的图书馆数字参考咨询问答系统研究[J].情报科学,2017,35(4):103-108. 被引量：13
8贺新乾,王颖纯,刘燕权.“211”高校图书馆虚拟参考咨询服务调查研究[J].情报杂志,2017,36(9):192-196. 被引量：17
9来云.图书馆智能化咨询问答机器人系统设计与语料技术研究[J].现代情报,2017,37(11):121-124. 被引量：12
10单轸,邵波.国内“人工智能&图书馆用户行为分析”的演变和现状探赜[J].图书馆学研究,2018(10):9-15. 被引量：18

引证文献2

1宋灵超,张立彬.功能视域下对高校图书馆智能咨询服务的调研与思考[J].图书情报工作,2023,67(10):72-81. 被引量：4
2朱晓光.层次概念的分布式表示和学习方法综述[J].计算机技术与发展,2023,33(10):1-7.

二级引证文献4

1李小清,梁宏伟.网络环境下高校图书馆智能咨询服务模式研究[J].忻州师范学院学报,2024,40(2):94-99.
2潘雪峰,王超.功能视域下ChatGPT对高校图书馆智能咨询的影响研究[J].图书情报导刊,2023,8(5):15-20. 被引量：4
3潘禹辰,呼玮,杨建梁,徐璐,卢小宾.新文科下的信息资源管理专业人工智能课程体系设计[J].图书情报知识,2023,40(6):42-51. 被引量：3
4刘怡彤,张静,姜润发.基于NLP的图书馆智能问答系统研究[J].信息与电脑,2024,36(1):117-120. 被引量：1

1刘俊杰,叶英豪.航空安全短文本信息主题分析[J].综合运输,2022,44(5):47-52.
2陈良富.智能家居控制命令的语义分析方法[J].单片机与嵌入式系统应用,2022,22(8):29-31.
3安娜,白雄文,王红艳,张萌.基于双流注意力机制的阅读理解式事件抽取模型[J].计算机工程与设计,2022,43(6):1686-1693. 被引量：7
4汪梦翔.基于规则的非典型有标被动句的语义角色自动标注研究[J].语言文字应用,2022(2):122-132.
5周永,吴义诚.“都”易位结构的动态解析[J].语言科学,2021,20(6):608-622. 被引量：1
6何乌云,秀芝,包晶晶,陈美兰,王斯日古楞.结合BERT数据增强的基于词切分的蒙汉神经机器翻译系统[J].厦门大学学报（自然科学版）,2022,61(4):667-674. 被引量：2
7刘勘,徐勤亚,於陆.面向营商环境的知识图谱构建研究[J].数据分析与知识发现,2022,6(4):82-96. 被引量：5
8徐使超,贾炯.“人工智能+句法学习”在高中英语教学中的实践[J].教育信息技术,2022(3):29-32. 被引量：1
9詹卫东,孙春晖,岳朋雪,唐乾桐,秦梓巍.空间语义理解能力评测任务设计的新思路—SpaCE2021数据集的研制[J].语言文字应用,2022(2):99-110. 被引量：3
10张峻霞,宁杰,李亚.基于聚类分析的学生主观评教信息挖掘与应用[J].中国轻工教育,2022,25(2):21-28.

计算机学报

2022年第8期

浏览历史

内容加载中请稍等...

基于动态句法剪枝机制的中文语义角色标注被引量：2

参考文献3

二级参考文献18

共引文献11

同被引文献33

引证文献2

二级引证文献4

相关作者

相关机构

相关主题

浏览历史

基于动态句法剪枝机制的中文语义角色标注 被引量：2

参考文献3

二级参考文献18

共引文献11

同被引文献33

引证文献2

二级引证文献4

相关作者

相关机构

相关主题

浏览历史

基于动态句法剪枝机制的中文语义角色标注被引量：2