一种面向事件事实性预测的平行语料库构建方法研究

Parallel Corpus Annotated for Event Factuality Prediction

下载PDF

导出

摘要事件事实性预测(Event Factuality Prediction,EFP)是将事实性评价(Factuality Assessment)问题建模为句子级别回归任务,判定句子中事件提及(event mention)的事实程度.EFP是自然语言处理中重要且具有挑战性的任务.与英文事件事实性语料库的资源丰富不同,目前中文领域事件事实性语料库十分缺乏,这明显阻碍了对中文事实性评价问题的进一步研究.针对此问题,本文探索并提出了基于机器翻译的半自动事件事实性平行语料库构建方法.实验结果表明,利用本文构建的事件事实性中英平行语料库Parallel FactBank配合DLEF语料库进行多任务学习可以有效提升中文EFP任务中模型的泛化能力,并使模型在各数据集上性能优于单任务学习模型. Event factuality prediction(EFP)models the problem of factuality assessment as a sentence level regression task and determines the degree to which the event mention in a sentence corresponds to a fact in the world.EFP is an important and challenging task in natural language processing.Different from the rich resources of English event factuality corpus,there is a lack of event factuality corpus in Chinese field at present,which obviously hinders the further study of Chinese factuality assessment.To solve this problem,this paper explores and proposes a semi-automatic event factuality parallel corpus construction method based on machine translation.The experimental results show that using the event factuality Chinese-English parallel corpus Parallel FactBank constructed in this paper and the DLEF corpus for multi-task learning can effectively improve the generalization ability of the Chinese EFP task model,and make the performance of the model on each dataset better than that of the single-task learning model.

作者张禛谢志鹏 ZHANG Zhen;XIE Zhipeng(Software School,Fudan University,Shanghai 200438,China;School of Computer Science,Fudan University,Shanghai 200438,China)

机构地区复旦大学软件学院复旦大学计算机科学技术学院

出处《小型微型计算机系统》 CSCD 北大核心 2024年第7期1537-1544,共8页 Journal of Chinese Computer Systems

基金国家自然科学基金项目(62076072)资助.

关键词事件事实性事实性评价平行语料库多任务学习 event factuality factuality assessment parallel corpus multi-task learning

分类号 TP391 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

1徐聪,石会鹏,陈志敏,张鑫宇,王静,杨甲森.卫星领域语料库构建与命名实体识别[J].国防科技大学学报,2024,46(4):175-183.
2李庆明,王丹妮.秦腔汉英多模态双语平行语料库构建及英译上口性研究[J].西安文理学院学报（社会科学版）,2024,27(3):105-111.
3Sangjae Lee,Byung Gon Kim.Attribute of Big Data Analytics Quality Affecting Business Performance[J].Journal of Social Computing,2023,4(4):357-381.

小型微型计算机系统

2024年第7期

浏览历史

内容加载中请稍等...

一种面向事件事实性预测的平行语料库构建方法研究

相关作者

相关机构

相关主题

浏览历史