基于差异化学习的Transformer改进方法研究

Research on Improved Method of Transformer Based on Differentiated Learning

下载PDF

导出

摘要以Transformer为代表的神经机器翻译模型是目前机器翻译领域中的研究热点。多头注意力机制是Transformer的重要组成部分,其作用是增强模型提取不同信息的能力,提高模型的泛化性。但是多头注意力机制中存在部分自注意力头失效的问题。针对此问题,本文提出了基于差异化学习的Transformer改进方法,通过在Transformer的训练过程中使用新颖的差异化学习方法充分提高自注意力头的有效性。在多项机器翻译任务中的实验结果表明,相比原始的Transformer,基于差异化学习方法改进的Transformer可以取得更高的BLEU值。 The neural machine translation model represented by Transformer is the current research hotspot in the field of machine translation.The multi-head attention mechanism is an important part of Transformer.Its function is to enhance the model's ability to extract different information and improve the generalization of the model.However,there is a problem that some self-attention heads fail in the multi-head attention mechanism.In response to this problem,this paper proposes a Transformer improvement method based on differentiated learning,which can fully improve the effectiveness of the self-attention head by using novel differentiated learning methods in the training process of Transformer.Experimental results in a number of machine translation tasks show that,compared to the original Transformer,the improved Transformer based on the differentiated learning method can achieve a higher BLEU value.

作者丁义 DING Yi(Dezhou University,Dezhou Shandong 253023)

机构地区德州学院

出处《软件》 2024年第7期91-94,共4页 Software

关键词机器翻译 TRANSFORMER 多头注意力差异化学习 machine translation Transformer multi-head attention differentiated learning

分类号 TP391.2 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

1颜幼梅.浅谈小学语文课程语文要素的策略指导[J].新课程导学,2021(17):41-42. 被引量：5
2鲁庆欣,姜礼红,孟霖,张明明,王君颖,孟佳.翻转课堂在全科住培理论授课中的应用效果分析[J].中国继续医学教育,2022,14(1):91-94. 被引量：3
3张林,李峰,吴蕾,孙康远.CINRAD/SAD双偏振雷达非降水回波识别技术[J].应用气象学报,2022,33(6):724-735. 被引量：6
4孙楠,郑俊.基于多粒度网络的有监督预训练行人重识别[J].广东公安科技,2024,32(1):53-56.
5宋树磊,沙杰,周晨阳.教育国际化背景下的全英语课程建设——以“高等矿物加工”为例[J].教育教学论坛,2023(18):125-128.
6葛天骄,王永胜.跨文化视域下英国文学汉译策略研究———以《傲慢与偏见》的汉译为例[J].杂文月刊（下半月）,2024(3):0107-0109.
7欧桃樱.农村中职学生英语学习兴趣提升策略——以湖南某中职学校为例[J].中国培训,2024(1):82-85.
8伊力米努尔·艾克拜尔,胡小青.大语言模型应用于中国特色话语英译的适用性研究[J].现代教育与实践,2024,6(12):16-20.
9项青.计算机辅助翻译软件在翻译实践中的可操作性研究[J].计算机应用文摘,2024,40(18):129-131.

软件

2024年第7期

浏览历史

内容加载中请稍等...

基于差异化学习的Transformer改进方法研究

相关作者

相关机构

相关主题

浏览历史