基于槽位相关信息提取的对话状态追踪模型

Dialogue state tracking model based on slot correlation information extraction

下载PDF

导出

摘要对话状态追踪(DST)是任务型对话系统中一个重要的模块,但现有的基于开放词表的DST模型没有充分利用槽位的相关信息以及数据集本身的结构信息。针对上述问题,提出基于槽位相关信息提取的DST模型SCELDST(SCE and LOW for Dialogue State Tracking)。首先,构建槽位相关信息提取器(SCE),利用注意力机制学习槽位之间的相关信息;然后,在训练过程中应用学习最优样本权重(LOW)策略,在未大幅增加训练时间的前提下,加强模型对数据集信息的利用;最后,优化模型细节,搭建完整的SCEL-DST模型。实验结果表明,SCE和LOW对SCEL-DST模型性能的提升至关重要,该模型在两个实验数据集上均取得了更高的联合目标准确率,其中在MultiWOZ 2.3(Wizard-of-OZ 2.3)数据集上与相同条件下的TripPy(Triple coPy)相比提升了1.6个百分点,在WOZ 2.0(Wizard-of-OZ 2.0)数据集上与AG-DST(Amendable Generation for Dialogue State Tracking)相比提升了2.0个百分点。 Dialogue State Tracking(DST)is an important module in task-oriented dialogue systems,but the existing open-vocabulary-based DST models do not make full use of the slot correlation information as well as the structural information of the dataset itself.To solve the above problems,a new DST model named SCEL-DST(SCE and LOW for Dialogue State Tracking)was proposed based on slot correlation information extraction.Firstly,a Slot Correlation Extractor(SCE)was constructed,and the attention mechanism was used to learn the correlation information between slots.Then the Learning Optimal sample Weights(LOW)strategy was applied in the training process to enhance the model􀆳s utilization of the dataset information without substantial increase in training time.Finally,the model details were optimized to build the complete SCEL-DST model.Experimental results show that SCE and LOW are critical to the performance improvement of SCEL-DST model,making SCEL-DST achieve higher joint goal accuracy on both experimental datasets.The SCEL-DST model has the joint goal accuracy improved by 1.6 percentage points on the MultiWOZ 2.3(Wizard-of-OZ 2.3)dataset compared to TripPy(Triple coPy)under the same conditions,and by 2.0 percentage points on the WOZ 2.0(Wizard-of-OZ 2.0)dataset compared to AG-DST(Amendable Generation for Dialogue State Tracking).

作者石利锋倪郑威 SHI Lifeng;NI Zhengwei(School of Information and Electronic Engineering,Zhejiang Gongshang University,Hangzhou Zhejiang 310018,China)

机构地区浙江工商大学信息与电子工程学院

出处《计算机应用》 CSCD 北大核心 2023年第5期1430-1437,共8页 journal of Computer Applications

基金浙江省自然科学基金资助项目(LQ22F010008)。

关键词对话状态追踪注意力机制任务型对话课程学习预训练模型 Dialogue State Tracking(DST) attention mechanism task-oriented dialogue Curriculum Learning(CL) pre-trained model

分类号 TP183 [自动化与计算机技术—控制理论与控制工程]

引文网络
相关文献

1叶正,傅灵,覃俊,刘晶.基于利用BERT不同层信息的微调策略的对话状态追踪[J].中南民族大学学报（自然科学版）,2023,42(3):327-333. 被引量：1
2刘子瑞.基于YOLOV4算法的优化低空无人机的检测与跟踪[J].黑龙江科学,2023,14(6):70-72.
3成明峰,耿晶晶.基于高斯过程粒子滤波的WIFI信号定位追踪[J].三门峡职业技术学院学报,2023,22(1):139-143.
4徐瑞,肖海军,胡琛.基于WGBDT的心衰患者半年内再入院风险预测[J].中南民族大学学报（自然科学版）,2023,42(3):425-432.
5夏瑞玲,李国平,王国中,滕国伟.基于改进蚁群算法的个性化学习路径推荐[J].上海大学学报（自然科学版）,2023,29(1):129-139. 被引量：2
6张宜宝,孙经纬,石绍军,田芙蓉,张清,刘双喜.自动驾驶插秧机控制系统的设计与试验[J].农机化研究,2023,45(7):71-78. 被引量：3
7如风.改用免费小软件管理分区更方便[J].电脑爱好者,2022(21):39-39.
8尼玛珍啦.浅析小学语文教学中学生阅读能力培养策略[J].传奇故事,2023(22):7-8.
9吴仁彪,刘洋,贾云飞,刘闪亮,乔晗.基于改进XGBoost的民航重点旅客风险评估方法[J].安全与环境学报,2023,23(3):651-658. 被引量：5
10罗苑彤.新媒体、新技术助推粤港澳大湾区艺术设计教育改革[J].美术文献,2022(11):84-86.

计算机应用

2023年第5期

浏览历史

内容加载中请稍等...

基于槽位相关信息提取的对话状态追踪模型

相关作者

相关机构

相关主题

浏览历史