基于对比学习的表达式偏序关系建模

Modeling Partial Order Relation in Math Expression by Contrastive Learning

下载PDF

导出

摘要数学公式解题任务要求模型根据数学问题生成表达式用于解答。该任务的主流方法是将目标表达式当作文本序列来生成。然而,这一设定导致模型忽略了表达式树作为树形结构所带有的偏序关系,如交换律、分配律等。这不仅降低了模型对表达式生成的学习效率,也减弱了模型的泛化能力。为解决这一问题,该文提出一种基于对比学习的表达式偏序关系建模方法。该方法的核心做法是在模型训练时,对表达式树做微调扰动,产生和原有表达式等价和不等价的正样本和负样本,并通过对比学习最小化原式和等价式子之间的距离,且最大化与不等价负样本式子之间的距离。在公开数据集Math23K和MAWPS上的对比实验表明,该文方法相对于基线模型具有显著性能提升。 In math word problem solving task,mainstream methods treat target expression as text sequence.The consequence of this setting is that the model cannot capture the partial order relation of expression trees,such as the law of exchange or the distributive law.This hinders models'learning efficiency in generating expression,and also reduces their generalization capability.To address this problem,this paper proposes a method to model expression tree's partial order relation by contrastive learning.The core practice is to generate both positive and negative sam-ples when training models.The samples are obtained by modifying the original expression tree.The training objec-tive is to minimize the distance between original expression with equivalent samples and maximize the distance with inequivalent ones.Based on experiments performed on Math23k and MAWPS,our model outperforms the baseline models by a significant margin.

作者胡星武桂韬张奇陈运文高翔 HU Xingwu;GUI Tao;ZHANG Qi;CHEN Yunwen;GAO Xiang(School of Computer Science,Fudan University,Shanghai 200433,China;Data Grand,Shanghai 201203,China)

机构地区复旦大学计算机科学技术学院达观数据

出处《中文信息学报》 CSCD 北大核心 2023年第4期166-174,共9页 Journal of Chinese Information Processing

关键词数学公式生成对比学习偏序关系 math word problem solving contrastive learning partial order relation

分类号 TP391 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

1李明捷,王涛,苗容瑜,黄欣宁.航站楼离港旅客聚集影响因素研究[J].重庆理工大学学报（自然科学）,2023,37(6):285-293.
2何伟东,杨志豪,王治政,林鸿飞.融合语义特征与偏序关系的生物医学文档检索[J].山西大学学报（自然科学版）,2023,46(3):491-499.
3王恩昭.基于“问题生成与解决”的物理概念教学研究——以人教版物理必修3“电场电场强度”为例[J].数理天地（高中版）,2023(16):29-31.

中文信息学报

2023年第4期

浏览历史

内容加载中请稍等...

基于对比学习的表达式偏序关系建模

相关作者

相关机构

相关主题

浏览历史