期刊文献+

基于槽位相关信息提取的对话状态追踪模型

Dialogue state tracking model based on slot correlation information extraction
下载PDF
导出
摘要 对话状态追踪(DST)是任务型对话系统中一个重要的模块,但现有的基于开放词表的DST模型没有充分利用槽位的相关信息以及数据集本身的结构信息。针对上述问题,提出基于槽位相关信息提取的DST模型SCELDST(SCE and LOW for Dialogue State Tracking)。首先,构建槽位相关信息提取器(SCE),利用注意力机制学习槽位之间的相关信息;然后,在训练过程中应用学习最优样本权重(LOW)策略,在未大幅增加训练时间的前提下,加强模型对数据集信息的利用;最后,优化模型细节,搭建完整的SCEL-DST模型。实验结果表明,SCE和LOW对SCEL-DST模型性能的提升至关重要,该模型在两个实验数据集上均取得了更高的联合目标准确率,其中在MultiWOZ 2.3(Wizard-of-OZ 2.3)数据集上与相同条件下的TripPy(Triple coPy)相比提升了1.6个百分点,在WOZ 2.0(Wizard-of-OZ 2.0)数据集上与AG-DST(Amendable Generation for Dialogue State Tracking)相比提升了2.0个百分点。 Dialogue State Tracking(DST)is an important module in task-oriented dialogue systems,but the existing open-vocabulary-based DST models do not make full use of the slot correlation information as well as the structural information of the dataset itself.To solve the above problems,a new DST model named SCEL-DST(SCE and LOW for Dialogue State Tracking)was proposed based on slot correlation information extraction.Firstly,a Slot Correlation Extractor(SCE)was constructed,and the attention mechanism was used to learn the correlation information between slots.Then the Learning Optimal sample Weights(LOW)strategy was applied in the training process to enhance the model􀆳s utilization of the dataset information without substantial increase in training time.Finally,the model details were optimized to build the complete SCEL-DST model.Experimental results show that SCE and LOW are critical to the performance improvement of SCEL-DST model,making SCEL-DST achieve higher joint goal accuracy on both experimental datasets.The SCEL-DST model has the joint goal accuracy improved by 1.6 percentage points on the MultiWOZ 2.3(Wizard-of-OZ 2.3)dataset compared to TripPy(Triple coPy)under the same conditions,and by 2.0 percentage points on the WOZ 2.0(Wizard-of-OZ 2.0)dataset compared to AG-DST(Amendable Generation for Dialogue State Tracking).
作者 石利锋 倪郑威 SHI Lifeng;NI Zhengwei(School of Information and Electronic Engineering,Zhejiang Gongshang University,Hangzhou Zhejiang 310018,China)
出处 《计算机应用》 CSCD 北大核心 2023年第5期1430-1437,共8页 journal of Computer Applications
基金 浙江省自然科学基金资助项目(LQ22F010008)。
关键词 对话状态追踪 注意力机制 任务型对话 课程学习 预训练模型 Dialogue State Tracking(DST) attention mechanism task-oriented dialogue Curriculum Learning(CL) pre-trained model
  • 相关文献

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部