摘要
事件共指消解是很多自然语言处理任务的基础,旨在识别文本中指代相同真实事件的事件提及。由于中文语法相比英文更复杂,捕获英文文本特征的方法在中文事件共指消解中效果并不明显。为解决文档内中文事件共指,提出了一种门控机制神经网络(Gated Mechanism Neural Network, GMNN)。针对中文具有主语省略、结构松散等特点,引入事件基本属性作为符号特征。在此基础上,提出了一种新的门控去噪机制,对符号特征向量进行微调,过滤符号特征中的噪声,提取在特定上下文语境中的有用信息,进而提高共指事件的识别率。在ACE2005中文数据集上进行了实验,结果表明,GMNN的AVG分数提升了2.66,有效地提高了中文事件共指消解的效果。
Event coreference resolution is the basis of many natural language processing tasks, aiming to identify event mentions in text that refer to the same real event.Since Chinese grammar is much more complex than English, the method of capturing English text features is not effective in Chinese event corefe-rence resolution.To solve the within-document Chinese event corefe-rence, a gated mechanism neural network(GMNN) is proposed.In view of Chinese characteristics with subject omission and loose structure, event attributes are introduced as symbolic features.On this basis, a novel gated mechanism is proposed, which fine-tunes the symbolic feature vector, filters the noise in the symbolic features, extracts useful information in a specific context, and improves the coreference events recognition rate.Experimental results on the ACE2005 Chinese dataset show that the perfor-mance of GMNN improves by 2.66,which effectively improves the effect of Chinese event coreference resolution.
作者
环志刚
蒋国权
张玉健
刘浏
刘姗姗
HUAN Zhigang;JIANG Guoquan;ZHANG Yujian;LIU Liu;LIU Shanshan(School of Cyber Science and Engineering,Southeast University,Nanjing 211189,China;The Sixty-third Research Institute,National University of Defense Technology,Nanjing 210007,China;School of Information Engineering,Suqian University,Suqian,Jiangsu,223800,China)
出处
《计算机科学》
CSCD
北大核心
2023年第3期291-297,共7页
Computer Science
基金
中国博士后科学基金面上资助(2021MD703983)
国防科技大学校科研计划项目(ZK20-46)。
关键词
中文事件共指消解
门控机制
神经网络
预训练语言模型
符号特征
Chinese event coreference resolution
Gated mechanism
Neural network
Pre-trained language models
Symbolic features