This paper focuses on end-to-end task-oriented dialogue systems,which jointly handle dialogue state tracking(DST)and response generation.Traditional methods usually adopt a supervised paradigm to learn DST from a manu...This paper focuses on end-to-end task-oriented dialogue systems,which jointly handle dialogue state tracking(DST)and response generation.Traditional methods usually adopt a supervised paradigm to learn DST from a manually labeled corpus.However,the annotation of the corpus is costly,time-consuming,and cannot cover a wide range of domains in the real world.To solve this problem,we propose a multi-span prediction network(MSPN)that performs unsupervised DST for end-to-end task-oriented dialogue.Specifically,MSPN contains a novel split-merge copy mechanism that captures long-term dependencies in dialogues to automatically extract multiple text spans as keywords.Based on these keywords,MSPN uses a semantic distance based clustering approach to obtain the values of each slot.In addition,we propose an ontology-based reinforcement learning approach,which employs the values of each slot to train MSPN to generate relevant values.Experimental results on single-domain and multi-domain task-oriented dialogue datasets show that MSPN achieves state-of-the-art performance with significant improvements.Besides,we construct a new Chinese dialogue dataset MeDial in the low-resource medical domain,which further demonstrates the adaptability of MSPN.展开更多
Dialogue state tracking(DST)leverages dialogue information to predict dialogues states which are generally represented as slot-value pairs.However,previous work usually has limitations to efficiently predict values du...Dialogue state tracking(DST)leverages dialogue information to predict dialogues states which are generally represented as slot-value pairs.However,previous work usually has limitations to efficiently predict values due to the lack of a powerful strategy for generating values from both the dialogue history and the predefined values.By predicting values from the predefined value set,previous discriminative DST methods are difficult to handle unknown values.Previous generative DST methods determine values based on mentions in the dialogue history,which makes it difficult for them to handle uncovered and non-pointable mentions.Besides,existing generative DST methods usually ignore the unlabeled instances and suffer from the label noise problem,which limits the generation of mentions and eventually hurts performance.In this paper,we propose a unified shared-private network(USPN)to generate values from both the dialogue history and the predefined values through a unified strategy.Specifically,USPN uses an encoder to construct a complete generative space for each slot and to discern shared information between slots through a shared-private architecture.Then,our model predicts values from the generative space through a shared-private decoder.We further utilize reinforcement learning to alleviate the label noise problem by learning indirect supervision from semantic relations between conversational words and predefined slot-value pairs.Experimental results on three public datasets show the effectiveness of USPN by outperforming state-of-the-art baselines in both supervised and unsupervised DST tasks.展开更多
Statistical dialogue management is the core of cognitive spoken dialogue systems (SDS) and has attracted great research interest. In recent years, SDS with the ability of evolution is of particular interest and beco...Statistical dialogue management is the core of cognitive spoken dialogue systems (SDS) and has attracted great research interest. In recent years, SDS with the ability of evolution is of particular interest and becomes the cuttingedge of SDS research. Dialogue state tracking (DST) is a process to estimate the distribution of the dialogue states at each dialogue turn, given the previous interaction history. It plays an important role in statistical dialogue management. To provide a common testbed for advancing the research of DST, international DST challenges (DSTC) have been organised and well-attended by major SDS groups in the world. This paper reviews recent progresses on rule-based and statistical approaches during the challenges. In particular, this paper is focused on evolvable DST approaches for dialogue domain extension. The two primary aspects for evolution, semantic parsing and tracker, are discussed. Semantic enhancement and a DST framework which bridges rule-based and statistical models are introduced in detail. By effectively incorporating prior knowledge of dialogue state transition and the ability of being data-driven, the new framework supports reliable domain extension with little data and can continuously improve with more data available. This makes it excellent candidate for DST evolution. Experiments show that the evolvable DST approaches can achieve the state-of-the-art performance and outperform all previously submitted trackers in the third DSTC.展开更多
基金supported by the National Key Research and Development Program of China under Grant No.2020AAA0106400the National Natural Science Foundation of China under Grant Nos.61922085 and 61976211+2 种基金the Independent Research Project of National Laboratory of Pattern Recognition under Grant No.Z-2018013the Key Research Program of Chinese Academy of Sciences(CAS)under Grant No.ZDBS-SSW-JSC006the Youth Innovation Promotion Association CAS under Grant No.201912.
文摘This paper focuses on end-to-end task-oriented dialogue systems,which jointly handle dialogue state tracking(DST)and response generation.Traditional methods usually adopt a supervised paradigm to learn DST from a manually labeled corpus.However,the annotation of the corpus is costly,time-consuming,and cannot cover a wide range of domains in the real world.To solve this problem,we propose a multi-span prediction network(MSPN)that performs unsupervised DST for end-to-end task-oriented dialogue.Specifically,MSPN contains a novel split-merge copy mechanism that captures long-term dependencies in dialogues to automatically extract multiple text spans as keywords.Based on these keywords,MSPN uses a semantic distance based clustering approach to obtain the values of each slot.In addition,we propose an ontology-based reinforcement learning approach,which employs the values of each slot to train MSPN to generate relevant values.Experimental results on single-domain and multi-domain task-oriented dialogue datasets show that MSPN achieves state-of-the-art performance with significant improvements.Besides,we construct a new Chinese dialogue dataset MeDial in the low-resource medical domain,which further demonstrates the adaptability of MSPN.
基金supported by the National Natural Science Foundation of China under Grant Nos.61533018,U1936207,61976211,and 61702512the Independent Research Project of National Laboratory of Pattern Recognition under Grant No.Z-2018013+1 种基金the National Key Research and Development Program of China under Grant No.2020AAA0106400the Youth Innovation Promotion Association of Chinese Academy of Sciences under Grant No.201912.
文摘Dialogue state tracking(DST)leverages dialogue information to predict dialogues states which are generally represented as slot-value pairs.However,previous work usually has limitations to efficiently predict values due to the lack of a powerful strategy for generating values from both the dialogue history and the predefined values.By predicting values from the predefined value set,previous discriminative DST methods are difficult to handle unknown values.Previous generative DST methods determine values based on mentions in the dialogue history,which makes it difficult for them to handle uncovered and non-pointable mentions.Besides,existing generative DST methods usually ignore the unlabeled instances and suffer from the label noise problem,which limits the generation of mentions and eventually hurts performance.In this paper,we propose a unified shared-private network(USPN)to generate values from both the dialogue history and the predefined values through a unified strategy.Specifically,USPN uses an encoder to construct a complete generative space for each slot and to discern shared information between slots through a shared-private architecture.Then,our model predicts values from the generative space through a shared-private decoder.We further utilize reinforcement learning to alleviate the label noise problem by learning indirect supervision from semantic relations between conversational words and predefined slot-value pairs.Experimental results on three public datasets show the effectiveness of USPN by outperforming state-of-the-art baselines in both supervised and unsupervised DST tasks.
文摘Statistical dialogue management is the core of cognitive spoken dialogue systems (SDS) and has attracted great research interest. In recent years, SDS with the ability of evolution is of particular interest and becomes the cuttingedge of SDS research. Dialogue state tracking (DST) is a process to estimate the distribution of the dialogue states at each dialogue turn, given the previous interaction history. It plays an important role in statistical dialogue management. To provide a common testbed for advancing the research of DST, international DST challenges (DSTC) have been organised and well-attended by major SDS groups in the world. This paper reviews recent progresses on rule-based and statistical approaches during the challenges. In particular, this paper is focused on evolvable DST approaches for dialogue domain extension. The two primary aspects for evolution, semantic parsing and tracker, are discussed. Semantic enhancement and a DST framework which bridges rule-based and statistical models are introduced in detail. By effectively incorporating prior knowledge of dialogue state transition and the ability of being data-driven, the new framework supports reliable domain extension with little data and can continuously improve with more data available. This makes it excellent candidate for DST evolution. Experiments show that the evolvable DST approaches can achieve the state-of-the-art performance and outperform all previously submitted trackers in the third DSTC.