期刊文献+
共找到1篇文章
< 1 >
每页显示 20 50 100
Language Model Using Differentiable Neural Computer Based on Forget Gate-Based Memory Deallocation
1
作者 Donghyun Lee Hosung Park +4 位作者 Soonshin Seo Changmin Kim Hyunsoo Son Gyujin Kim Ji-Hwan Kim 《Computers, Materials & Continua》 SCIE EI 2021年第7期537-551,共15页
A differentiable neural computer(DNC)is analogous to the Von Neumann machine with a neural network controller that interacts with an external memory through an attention mechanism.Such DNC’s offer a generalized metho... A differentiable neural computer(DNC)is analogous to the Von Neumann machine with a neural network controller that interacts with an external memory through an attention mechanism.Such DNC’s offer a generalized method for task-specific deep learning models and have demonstrated reliability with reasoning problems.In this study,we apply a DNC to a language model(LM)task.The LM task is one of the reasoning problems,because it can predict the next word using the previous word sequence.However,memory deallocation is a problem in DNCs as some information unrelated to the input sequence is not allocated and remains in the external memory,which degrades performance.Therefore,we propose a forget gatebased memory deallocation(FMD)method,which searches for the minimum value of elements in a forget gate-based retention vector.The forget gatebased retention vector indicates the retention degree of information stored in each external memory address.In experiments,we applied our proposed NTM architecture to LM tasks as a task-specific example and to rescoring for speech recognition as a general-purpose example.For LM tasks,we evaluated DNC using the Penn Treebank and enwik8 LM tasks.Although it does not yield SOTA results in LM tasks,the FMD method exhibits relatively improved performance compared with DNC in terms of bits-per-character.For the speech recognition rescoring tasks,FMD again showed a relative improvement using the LibriSpeech data in terms of word error rate. 展开更多
关键词 Forget gate-based memory deallocation differentiable neural computer language model forget gate-based retention vector
下载PDF
上一页 1 下一页 到第
使用帮助 返回顶部