基于双向对比训练的中文故事结尾生成模型

Chinese story ending generation model based on bidirectional contrastive training

下载PDF

导出

摘要中文故事结尾生成(SEG)是自然语言处理中的下游任务之一。基于全错误结尾的CLSEG(Contrastive Learning of Story Ending Generation)在故事的一致性方面表现较好。然而,由于错误结尾中也包含与原结尾文本相同的内容,仅使用错误结尾的对比训练会导致生成文本中原结尾正确的主要部分被剥离。因此,在CLSEG基础上增加正向结尾增强训练,以保留对比训练中损失的正确部分;同时,通过正向结尾的引入,使生成的结尾具有更强的多样性和关联性。基于双向对比训练的中文故事结尾生成模型包含两个主要部分:1)多结尾采样,通过不同的模型方法获取正向增强的结尾和反向对比的错误结尾;2)对比训练,在训练过程中修改损失函数,使生成的结尾接近正向结尾,远离错误结尾。在公开的故事数据集OutGen上的实验结果表明,相较于GPT2. ft和深层逐层隐变量融合(Della)等模型,所提模型的BERTScore、METEOR等指标均取得了较优的结果,生成的结尾具有更强的多样性和关联性。 Chinese Story Ending Generation(SEG)is one of the downstream tasks in Natural Language Processing(NLP).CLSEG(Contrastive Learning of Story Ending Generation)based on completely wrong endings performs well in terms of story consistency.However,due to the fact that the wrong ending also contains the same content as the original ending text,using only the wrong ending for contrastive training may results in the main part of the generated text with the correct ending being stripped off.Therefore,forward ending enhancement training was added on the basis of CLSEG to preserve the correct parts lost in contrastive training.At the same time,by introducing forward endings,the generated endings have stronger diversity and relevance.The proposed Chinese story ending generation model based on bidirectional contrastive training consisted of two main parts:1)multi-ending sampling,by which positively enhanced endings and reverse contrasted erroneous endings were obtained by different model methods;2)contrastive training,by which the loss function was modified during the training process to make the generated ending close to the positive ending and away from the wrong ending.Experimental results on the publicly available story dataset OutGen show that compared to models such as GPT2.ft and Della(Deeply fused layer-wise latent variable),the proposed model achieves better results in BERTScore,METEOR,and other indicators,generating more diverse and relevant endings.

作者帅奇王海瑞朱贵富 SHUAI Qi;WANG Hairui;ZHU Guifu(Faculty of Information Engineering and Automation,Kunming University of Science and Technology,Kunming Yunnan 650504,China;Information Technology Construction Management Center,Kunming University of Science and Technology,Kunming Yunnan 650504,China)

机构地区昆明理工大学信息工程与自动化学院昆明理工大学信息化建设管理中心

出处《计算机应用》 CSCD 北大核心 2024年第9期2683-2688,共6页 journal of Computer Applications

基金国家自然科学基金资助项目(61863016)。

关键词中文故事结尾生成对比训练文本采样文本生成自然语言处理 Chinese Story Ending Generation(SEG) contrastive training text sampling text generation Natural Language Processing(NLP)

分类号 TP183 [自动化与计算机技术—控制理论与控制工程]

引文网络
相关文献

1米昭宇,姜乃丹.孤独症儿童康复治疗过程中综合康复训练的作用探究[J].中文科技期刊数据库（引文版）医药卫生,2024(10):0182-0185.
2梁嘉劲.健美操与啦啦操运动训练的多维度对比探讨[J].当代体育科技,2024,14(23):26-28.
3曹丹丹,巩雪.数学思维与思想政治教育的有效融合探讨[J].吉林教育,2024(16):68-70.
4郁华鑫,何利文.基于UNet的图像分割研究[J].软件工程,2024,27(10):50-53.
5杨兴耀,陈羽,于炯,张祖莲,陈嘉颖,王东晓.结合自我特征和对比学习的推荐模型[J].计算机应用,2024,44(9):2704-2710.
6李国燕,田明达,董春华,郝志鹏.面向遥感图像的结构化图像描述网络[J].电子测量与仪器学报,2024,38(5):148-157.
7蒋维康,王劲贤.联合对比学习与图神经网络的自优化单细胞聚类[J].计算机系统应用,2024,33(9):1-13.
8胡余玲.初中英语大单元视域下单元写作教学策略分析[J].中国科技经济新闻数据库教育,2024(10):0037-0039.
9唐廷杰,黄佳进,秦进.基于图辅助学习的会话推荐[J].计算机应用,2024,44(9):2711-2718.
10袁嘉梦,陈浪,陈维亚,骆汉宾.历史建筑多模态检索方法研究[J].土木建筑工程信息技术,2024,16(4):7-13.

计算机应用

2024年第9期

浏览历史

内容加载中请稍等...

基于双向对比训练的中文故事结尾生成模型

相关作者

相关机构

相关主题

浏览历史