基于双语词典的远距离语对无监督神经机器翻译方法

Bilingual dictionary based unsupervised neural machine translation method for distant language pairs

下载PDF

导出

摘要为了缓解大型平行语料库稀缺性对机器翻译质量的影响,无监督方法在神经机器翻译领域备受关注,但其在远距离语言对上的翻译表现仍有待提高。因此,文中引入了翻译语言模型(TLM)并提出了Dict-TLM方法。该方法的核心思想是结合单语语料和无监督双语词典训练语言模型。具体而言,模型首先接受源语言句子作为输入,然后,不同于传统TLM只接受平行语料,Dict-TLM模型还接受源语言句子通过无监督双语词典处理后的数据作为输入,在这种输入中,模型将源语言句子中在双语词典中出现的单词替换为相应的目标语言翻译词,重要的是,该方法中的双语词典是无监督获得的。实验表明,Dict-TLM相对于传统无监督机器翻译在中英语言对上提高了3个BLEU分数。 Unsupervised methods,which strives to alleviate the impact of the scarcity of large parallel corpora on the quality of machine translation,have attracted much attention in the field of neural machine translation.However,their translation performances in distant language pairs still need to be improved.Therefore,the translation language model(TLM)is introduced and the Dict-TLM method is proposed.The core idea of this method is to train language models by combining monolingual corpora and unsupervised bilingual dictionaries.Specifically,the model accepts source language sentences and takes them as the input first,and then,unlike the traditional TLM that only accepts parallel corpora,the Dict-TLM model even accepts data from source language sentences processed by unsupervised bilingual dictionaries and takes them as the input.In this input,the proposed model replaces the words that appear in the bilingual dictionary in the source language sentence with the corresponding target language translation words.Importantly,the bilingual dictionary is obtained in an unsupervised manner.The experiment shows that the Dict-TLM improves the BLEU score by 3%in comparison with the traditional unsupervised machine translation in Chinese English language pairs.

作者黄孟钦 HUANG Mengqin(Faculty of Information Engineering and Automation,Kunming University of Science and Technology,Kunming 650500,China)

机构地区昆明理工大学信息工程与自动化学院

出处《现代电子技术》北大核心 2024年第7期161-164,共4页 Modern Electronics Technique

关键词无监督神经机器翻译远距离语言对预训练 TLM 双语词典双语词嵌入 unsupervised neural machine translation distant language pairs pre-training TLM bilingual dictionary bilingual word embedding

分类号 TN99-34 [电子电信—信号与信息处理] TP389.1 [自动化与计算机技术—计算机系统结构]

引文网络
相关文献

1朱志国,郭军军,余正涛.一种Mask交互融合预训练知识的低资源神经机器翻译方法[J].小型微型计算机系统,2024,45(3):591-597.
2周纭加,赵民,付继伟.固体火箭电缆线路雷电感应仿真研究[J].计算机仿真,2024,41(1):58-63.
3郑伟鑫,田锋,刘诚,李毅.正常踝关节影像学角度分析[J].美中国际创伤杂志,2023,22(4):50-53.
4王桂莲.文化理解导向下的翻译教学——评《中英语言文化对比与翻译》[J].中国教育学刊,2024(2).
5成桂红,郑爱燕,丁洁,邹琴燕,许咏乐,朱蕊,王馥新,吴惠华,李红,孟庆霞.IVF/ICSI-ET中胚胎实时监测系统与常规形态学评估选择性单胚胎移植累积活产率分析[J].实用妇产科杂志,2024,40(2):130-135.
6占思琦,徐志展,杨威,谢抢来.基于深度编码注意力的XLNet-Transformer汉-马低资源神经机器翻译优化方法[J].计算机应用研究,2024,41(3):799-804. 被引量：1
7郑咏滟,李文纯.数据驱动的国际奥林匹克委员会语言政策价值取向分析[J].语言文字应用,2023(4):34-49.
8沈骑,刘思琪.基于数据驱动方法的国际组织语言政策研究——以联合国关注的语言问题为例[J].语言文字应用,2023(4):20-33.
9滕梅,王一平,王小雨.铁路知识术语在近代中国的译介与地方化研究[J].中国翻译,2024,45(2):45-52.
10马杰森.罗什本《金刚经》术语英译历时研究[J].湘南学院学报,2024,45(1):74-79.

现代电子技术

2024年第7期

浏览历史

内容加载中请稍等...

基于双语词典的远距离语对无监督神经机器翻译方法

相关作者

相关机构

相关主题

浏览历史