模板驱动的神经机器翻译被引量：11

Template-Driven Neural Machine Translation

下载PDF

导出

摘要由于神经机器翻译模型简单、通用和有效,神经机器翻译模型已成为目前最受关注的机器翻译模型.在神经机器翻译模型中,通过引入词汇翻译表和短语翻译表可以提高翻译质量.然而,对于已经存在的人工整理的翻译模板或者启发式算法生成的翻译模板,目前已有的神经机器翻译框架不存在有效的方法对这些翻译模板进行建模.该文研究的主要内容是将翻译模板内嵌到端到端的神经机器翻译模型中.为此,我们提出了模板驱动的神经机器翻译模型,该模型通过使用额外的模板编码器对翻译模板进行端到端建模,通过使用知识门阀和注意力门阀动态地控制解码过程中不同来源的知识对当前解码词汇的贡献度的大小.知识门阀的主要作用是对源语言句子和翻译模板的信息进行有效的表示,从而更好地对解码器进行初始化.注意力门阀是一个基于时序的门阀,可以动态地控制当前翻译词汇接收源语言句子或者翻译模板信息的多少.最终实验结果表明,该文提出的方法对模板进行了有效的建模,20%词汇标准模板在汉英和英汉翻译任务上的翻译正确率分别高达93.6%和95.1%.与基线翻译系统相比,在汉英和英汉翻译任务上使用含有20%词汇的标准模板时,翻译性能可以增长4.2～7.2个BLEU值.当翻译模板中的真实词汇增加时,翻译质量得到进一步提升. Nowadays,neural machine translation(NMT)has been the most prominent approach to machine translation(MT),due to its simplicity,generality and effectiveness.The principle of neural machine translation is to directly maximize the conditional probabilities of target sentences given source sentences in an end-to-end fashion.One of the most widely used neural machine translation model follows the encoder-decoder framework.It encodes the source sentence using a recurrent neural network(RNN)into a dense context representation,and produces the target translation from the context vector on the decoder.By exploiting the gating and attention mechanisms,neural machine translation models have been shown to surpass the performance of previously dominant statistical machine translation(SMT)on many well-established translation tasks.Recently,researchers have shown an increasing interest in incorporating external lexical translation table and phrase translation table into the neural machine translation,and obtained impressive translation performance.However,in the literature there is less study on incorporating translation templates,which are manually constructed or automatically induced by heuristic algorithm from parallel corpus,into the neural translation model.In this paper,we propose a novel architecture,template driven neural machine translation model,which extends to incorporate the additional translation template into the neural machine translation model.In contrast to the conventional neural machine translation model,on the source side,we use an additional recurrent neural network encoder(template encoder)to encode the additional translation template in parallel to the encoder for the source sentence.In our proposed template driven NMT model,firstly,we propose a gating mechanism,knowledge gate,to balance the information between the source sentence and the additional translation template that is best suited for inducing the source sentence representation.Secondly,to effectively leverage the knowledge representation in predicting the target words,we propose a weighted variant attention mechanism,attention gate,in which a time-dependent gating scalar is adopted to control the ratio of conditional information between the source sentence and the additional translation template.To evaluate the effectiveness of our proposal,we experiment with three kinds of translation templates:1)head template,where we preserve n words from the leftmost of a sentence and blank out the rest as slot to be predicted and filled by the neural machine translation model;2)tail template,where the leftmost words are blanked out by keeping the rightmost m words;and 3)normal template,the words are arbitrarily discarded to make slots to be filled by the translation model.Experimental results demonstrate that our proposed model can effectively make use of the additional information from the translation template,and the translation accuracy for the normal translation template with 20%of target words(of the sentence)is up to 93.6%and 95.1%on the Chinese-to-English and English-to-Chinese translation tasks,respectively.When we use 20%of target words as a translation template,we observe significant improvements of 4.2to 7.2BLEU scores compared with the baseline systems on the Chinese-to-English and English-to-Chinese translation tasks,respectively.Experiments also show that the translation performance goes up as more context words are considered in the translation template.

作者李强黄辉周沁韩雅倩肖桐朱靖波 LI Qiang;WONG Fai;CHAO Sam;HAN Ya-Qian;XIAO Tong;ZHU Jing-Bo(Natural Language Processing Laboratory,Northeastern University,Shenyang 110000;Natural Language Processing&Portuguese-Chinese Machine Translation Laboratory,University of Macao,Macao 999078;Shenyang Yatrans Network Technology Co.,Ltd.,Shenyang 110000)

机构地区东北大学自然语言处理实验室澳门大学自然语言处理与中葡机器翻译实验室沈阳雅译网络技术有限公司

出处《计算机学报》 EI CSCD 北大核心 2019年第3期566-581,共16页 Chinese Journal of Computers

基金国家自然科学基金(61432013 61732005 61672555) 中央高校基本科研业务费澳门大学多年度研究资助(MYRG2017-00087-FST MYRG2015-00175-FST MYRG2015-00188-FST) 澳门科学技术发展基金国家自然科学基金联合科研资助项目(045/2017/AFJ)资助~~

关键词人工智能自然语言处理神经机器翻译翻译模板门阀 artificial intelligence natural language processing neural machine translation translation template gate unit

分类号 TP391.2 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

同被引文献74

1明玉琴,夏添,彭艳兵.基于GAN模型优化的神经机器翻译[J].中文信息学报,2020(4):47-54. 被引量：6
2张宁.基于内容的多媒体检索的研究现状和应用前景[J].上饶师范学院学报,2006,26(3):79-83. 被引量：2
3徐戈,王厚峰.自然语言处理中主题模型的发展[J].计算机学报,2011,34(8):1423-1436. 被引量：233
4李熙利.数字档案馆多媒体检索系统实现分析[J].北京档案,2012(12):30-31. 被引量：2
5郁振华.波兰尼的默会认识论[J].自然辩证法研究,2001,17(8):5-10. 被引量：247
6冯志伟.机器翻译与人工智能的平行发展[J].外国语,2018,41(6):35-48. 被引量：81
7庄福振,罗平,何清,史忠植.迁移学习研究进展[J].软件学报,2015,26(1):26-39. 被引量：458
8张凤,高航.自然语言处理技术在西方国家军事领域的应用现状[J].国防科技,2014,35(6):75-82. 被引量：2
9苏依拉,乌尼尔,刘婉婉.基于统计分析的蒙汉自然语言的机器翻译[J].北京工业大学学报,2017,43(1):36-42. 被引量：4
10孙茂松,陈新雄.借重于人工知识库的词和义项的向量表示：以HowNet为例[J].中文信息学报,2016,30(6):1-6. 被引量：11

引证文献11

1霍小静.人工智能理论的机器自动翻译系统[J].微型电脑应用,2020,36(11):77-79. 被引量：2
2李雪晴,王石,王朱君,朱俊武.自然语言生成综述[J].计算机应用,2021,41(5):1227-1235. 被引量：17
3孙毅,裘杭萍,郑雨,张超然,郝超.自然语言预训练模型知识增强方法综述[J].中文信息学报,2021,35(7):10-29. 被引量：8
4程晓娇.基于多特征融合的机器英语翻译错误自动识别研究[J].黑龙江工业学院学报（综合版）,2021,21(10):66-71. 被引量：4
5李海丽,王金海.基于智能数据处理的汉英翻译文本准确度自动评价方法[J].自动化技术与应用,2022,41(8):25-28.
6王媛,罗瑜.基于改进规则算法的英汉双语互译方法[J].自动化技术与应用,2022,41(8):41-44.
7李公全,李智国,李卫星,高栋.自然语言生成技术及其在军事领域应用[J].中国电子科学研究院学报,2022,17(10):935-942. 被引量：1
8李政.基于神经网络语言模型的统计机器翻译应用分析[J].信息与电脑,2022,34(22):109-111.
9高利利,马鹏霄.基于语音识别的人机交互语言翻译系统设计[J].自动化与仪器仪表,2023(6):175-178.
10郭丽娜.基于模型结构先验的神经机器翻译研究[J].自动化与仪器仪表,2023(9):192-196. 被引量：1

二级引证文献32

1简俊,王衡,孙正,吴冠霖,苏欣,陈三君.人工智能自动生成海上大风预报报文研究[J].软件工程,2021,24(9):9-12. 被引量：2
2肖雪,李成城.手写汉字评价方法研究进展[J].计算机工程与应用,2022,58(2):27-42. 被引量：4
3王浩然,李国勇,徐传淇,胡智翔.基于深度学习的开放域引导对话生成模型[J].计算机应用,2021,41(S02):66-70. 被引量：3
4谢星雨,余本功.基于MFFMB的电商评论文本分类研究[J].数据分析与知识发现,2022,6(1):101-112. 被引量：6
5冯骁骋,秦兵,刘挺.高考语文议论文自动生成技术概述[J].人工智能,2022(2):21-29.
6齐梦娜,朱丽平,李宁.基于ERNIE和CNN的在线评论情感分析模型[J].计算机应用,2022,42(S01):7-11. 被引量：5
7刘渝.基于移动云计算模式的英语地名机器翻译系统设计[J].自动化与仪器仪表,2022(8):240-244. 被引量：1
8赵元.基于机器辅助的高校英语专有名词自动翻译研究[J].自动化技术与应用,2022,41(10):114-116. 被引量：1
9卢嘉荣,肖红,姜文超,杨建仁,王涛.基于语料关联生成的知识增强型BERT[J].湖北大学学报（自然科学版）,2022,44(6):732-741.
10田静,贾智勇.基于深度学习算法的英语语法纠错系统设计[J].自动化与仪器仪表,2022(9):128-131. 被引量：1

1石超,周澎.WIPO专利翻译工具及中国推广问题研究[J].中国发明与专利,2019,16(2):69-73. 被引量：2
2刘红燕.翻盖瓶盖注塑模具设计[J].工程塑料应用,2019,47(1):96-101. 被引量：9
3牛晓莉,陈喜春.浅论特定汉语词汇翻译的拼音化[J].海外英语,2019(4):37-38. 被引量：2
4刘伟斌.视觉与意识形态——基于视觉文化意识形态生成机制的批判分析[J].自然辩证法通讯,2019,41(2):83-88. 被引量：11
5Emma,中国设计红星奖(图).WT2实时翻译耳机[J].设计,2019,32(6):8-8.
6徐雄.基于深度学习的问答系统研究[J].湖北师范大学学报（自然科学版）,2019,39(1):10-18. 被引量：7
7曾强,邓敬源,张进春,沈玲.多工作日历下流水作业调度遗传优化方法[J].计算机工程与应用,2019,55(4):238-247. 被引量：3
8倪赛赛,钱莉琳.急诊科急救仪器使用现状分析与对策[J].中医药管理杂志,2019,27(1):46-47.
9陆红红,殷鸣,谢罗峰,向枭,殷国富.基于面结构光的叶片三维重构技术研究[J].中国测试,2019,45(2):134-138. 被引量：6
10胡亚文,白嘉诚,葛文雪,姚梦依,张雪莲.耻垢分枝杆菌中反式翻译报告体系的构建[J].微生物与感染,2019,14(1):16-22.

计算机学报

2019年第3期

浏览历史

内容加载中请稍等...

模板驱动的神经机器翻译被引量：11

同被引文献74

引证文献11

二级引证文献32

相关作者

相关机构

相关主题

浏览历史

模板驱动的神经机器翻译 被引量：11

同被引文献74

引证文献11

二级引证文献32

相关作者

相关机构

相关主题

浏览历史

模板驱动的神经机器翻译被引量：11