摘要
语义角色标注旨在标注出句子中所有与谓语相关的语义成分,是进行语义分析的基础和关键技术.使用传统的机器学习方法进行语义角色标注,需要人工设定特征,特征稀疏且工作繁琐沉重,同时传统方法对句法解析精度有较高要求,所以语义角色标注发展缓慢.针对上述情况,采取基于双向长短时记忆(BiLSTM)网络-注意力机制(attention)-条件随机场(CRF)模型进行汉语语义角色标注,同时尝试针对性引入其他资源优化模型性能.在训练阶段,将词性、依存句法特征以及短语结构句法特征组成的多线索特征组共同送入模型.经过多组对照实验论证,相比于BiLSTM-CRF模型,融合注意力机制的模型性能显著提升,并且引入的多线索特征组可以进一步提升模型性能.
Semantic role labeling aims to label all semantic components related to predicates in a sentence,and it constitutes the basis and the key technology for semantic analyses.Semantic role labeling based on traditional machine learning methods requires manual feature setting.Traditional methods have endured problems of sparse features and cumbersome work.At the same time,traditional methods must meet higher requirements on the accuracy of syntactic analyses,rendering the development of semantic role labeling slow.Due to the existence of the situation mentioned above,this paper is based on the BiLSTM-attention-CRF model for Chinese semantic role labeling,and attempts to introduce other targeted resources to optimize the performance of the model.During training,a multi-clue feature group consisting of parts of speech,dependent syntactic parsing features,and phrase structure parsing features are fed into the model.After multiple groups of controlled experiments,compared with the BiLSTM-CRF model,the performance of the model fused with the attention mechanism is significantly improved.Finally,the combined multi-cue feature group can further improve the model performance.
作者
朱傲
万福成
马宁
车郭怡
ZHU Ao;WAN Fucheng;MA Ning;CHE Guoyi(Key Laboratory of China's Ethnic Languages and Information Technology of Ministry of Education,Northwest Minzu University,Lanzhou 730030,China;Key Laboratory of China's Ethnic Languages and Intelligent Processing of Gansu Province,Northwest Minzu University,Lanzhou 730030,China)
出处
《厦门大学学报(自然科学版)》
CAS
CSCD
北大核心
2021年第6期1019-1023,共5页
Journal of Xiamen University:Natural Science
基金
国家自然科学基金(61762076,61602387)。