结合向量化方法与掩码机制的术语干预翻译模型

Terminology Intervention Translation Model Combining Vectorization Method and Mask Mechanism

下载PDF

导出

摘要术语干预神经机器翻译模型通常借助人为给定的术语翻译来改变译文,从而改善翻译质量。向量化干预方法为术语干预任务提供了新的范式,但仅考虑将术语与句子信息以向量的形式融合,没有关注术语信息对术语翻译效果的影响。为此,构建一种结合向量化方法与掩码机制的术语干预机器翻译模型,将人为给定的源端术语与目标端术语编码为特征向量,显式地融入机器翻译模型的编码器、解码器以及输出层。在训练阶段,借助掩码机制屏蔽注意力机制中源端术语对应的关键字,增强模型编码器与解码器对术语特征向量的关注。在推理阶段,利用掩码机制优化术语干预输出层的概率分布,进一步提高术语字符的翻译准确率。在WMT 2014德英和WMT 2021英中数据集上的实验结果表明,相较于基于原始向量化方法的Code-Switching机器翻译模型,所提模型的术语翻译准确率分别提升了9.27和2.95个百分点,并且能大幅度提升长术语的翻译准确率。 The terminology intervention Neural Machine Translation(NMT)model optimizes translations with the help of human-provided translations;this improves the translation quality.Recently,vectorization methods have emerged to provide a new paradigm for terminology intervention tasks;however,these methods consider only fusing terminology information with sentence information and neglect the low contribution of terminology vectors to terminology translation.To address these issues,a terminology intervention machine translation model combining the vectorization method and mask mechanism is built.This model encodes human-provided source terminology and target terminology into feature vectors and integrates them into the encoder,decoder,and output layers of the machine translation model.To enhance its attention to term feature vectors,the model uses a mask mechanism to mask the keys corresponding to the source-side terminologies in the attention mechanism during the training phase.In the inference phase,the probability distribution of the output layer is optimized to improve terminology generation.The experimental results on the WMT 2014 German-English and WMT2021 English-Chinese datasets show that,compared with the Code-Switching machine translation model based on the original vectorization method,the proposed model has improved the terminology translation accuracy by 9.27 and 2.95 percentage points,respectively,and can significantly improve the translation accuracy of long-terms.

作者张金鹏段湘煜 ZHANG Jinpeng;DUAN Xiangyu(School of Computer Science and Technology,Soochow University,Suzhou 215000,Jiangsu,China)

机构地区苏州大学计算机科学与技术学院

出处《计算机工程》 CAS CSCD 北大核心 2023年第11期70-76,84,共8页 Computer Engineering

基金国家自然科学基金(61673289)。

关键词机器翻译术语干预向量化注意力机制掩码机制 machine translation terminology intervention vectorization attention mechanism mask mechanism

分类号 TP391.2 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献5

1冯洋,邵晨泽.神经机器翻译前沿综述[J].中文信息学报,2020(7):1-18. 被引量：36
2张知行,张佳影,高大启,阮彤,王俊,何萍,姚华彦.临床检验指标术语库的构建与病历挖掘应用[J].中文信息学报,2020,34(12):100-110. 被引量：1
3游新冬,杨海翔,陈海涛,孙甜,吕学强.融合术语信息的新能源专利机器翻译研究[J].中文信息学报,2021,35(12):76-83. 被引量：1
4张泽锋,毛存礼,余正涛,黄于欣,刘奕洋.融入领域术语词典的司法舆情敏感信息识别[J].中文信息学报,2022,36(9):76-83. 被引量：10
5董兴华,陈丽娟,周喜,周俊林,吐尔洪.吾司曼.汉维统计机器翻译中的形态学处理[J].计算机工程,2011,37(12):150-152. 被引量：5

二级参考文献24

1韩鹏宇,高盛祥,余正涛,黄于欣,郭军军.基于案件要素指导的涉案舆情新闻文本摘要方法[J].中文信息学报,2020,34(5):56-63. 被引量：8
2Arianna B, Marcello F. Morphological Pre-processing for Turkish to English Statistical Machine Translation[C] //Proc. of IWSLT’09. Tokyo, Japan:[s. n.] , 2009.
3Durgar E K, Oflazer K. Initial Explorations in English to Turkish Statistical Machine Translation[C] //Proc. of IEEE Int’l Conf. on Statistical Machine Translation. New York, USA:[s. n.] , 2006.
4Oflazer K, Durgar E K. Exploring Different Representational Units in English to Statistical Machine Translation[C] //Proc. of the 2nd Workshop on Statistical Machine Translation. Prague, Czech Republic:[s. n.] , 2007.
5Habash N, Sadat F. Arabic Preprocessing Schemes for Statistical Machine Translation[C] //Proc. of the Human Language Technology Conference.[S. l.] : IEEE Press, 2006.
6Zollmann A, Venugopal A, Vogel S. Bridging the Inflection Morphology Gap for Arabic Statistical Machine Translation[C] // Proc. of the Human Language Technology Conference. New York, USA:[s. n.] , 2006.
7李国臣, 孟静. 利用主语和谓语的句法关系识别谓语中心词[D]. 太原: 山西大学, 2005.
8Mathias C, Krista L. Unsupervised Morpheme Segmentation and Morphology Induction from Text Corpora Using Morfessor 1.0. Publications[EB/OL]. (2005-07-12). http:// www.cis.hut.fi/projects/morpho/.
9董兴华,周俊林,郭树盛,吐尔洪.吾司曼.基于短语的汉维/维汉统计机器翻译[J].计算机工程,2011,37(9):16-18. 被引量：15
10晋耀红.一种混合策略的专利机器翻译系统研究[J].计算机工程与应用,2012,48(4):29-32. 被引量：12