期刊文献+

基于复述增广的医疗领域机器翻译 被引量:1

Paraphrase Based Data Augmentation For Chinese-English Medical Machine Translation
下载PDF
导出
摘要 医疗机器翻译对于跨境医疗、医疗文献翻译等应用具有重要价值。汉英神经机器翻译依靠深度学习强大的建模能力和大规模双语平行数据取得了长足的进步。神经机器翻译通常依赖于大规模的平行句对训练翻译模型。目前,汉英翻译数据主要以新闻、政策等领域数据为主,缺少医疗领域的数据,导致医疗领域的汉英机器翻译效果不佳。针对医疗垂直领域机器翻译训练数据不足的问题,该文提出利用复述生成技术对汉英医疗机器翻译数据进行增广,扩大汉英机器翻译的规模。通过多种主流的神经机器翻译模型的实验结果表明,通过复述生成对数据进行增广可以有效地提升机器翻译的性能,在RNNSearch,Transformer等多个主流模型上均取得了6个点以上的BLEU值提升,验证了复述增广方法对领域机器翻译的有效性。同时,基于MT5等大规模预训练语言模型可以进一步地提升机器翻译的性能。 Medical machine translation is of great value for cross-border medical translation.Chinese to English neural machine translation has made great progress based on deep learning,powerful modeling ability and large-scale bilingual parallel data.Neural machine translation relies usually on large-scale parallel sentence pairs to train translation models.At present,Chinese-English translation data are mainly in the fields of news,policy and so on.Due to the lack of parallel data in the medical field,the performance of Chinese to English machine translation in the medical field is not compromising.To reduce the size of parallel data for training medical machine translation models,this paper proposes a paraphrase based data augmentation mechanism.The experimental results on a variety of neural machine translation models show that data augmentation through paraphrase augmentation can effectively improve the performance of medical machine translation,and has achieved consistency improvements on mainstream models such as RNNSearch and Transformers,which verifies the effectiveness of paraphrase augmentation method for domain machine translation.Meanwhile,the medical machine translation performances could be further improved based on large-scale pre-training language model,such as MT5.
作者 安波 龙从军 AN Bo;LONG Congjun(Institute of Ethnology and Anthropology,Chinese Academy of Social Sciences,Beijing,100081,China)
出处 《电子与信息学报》 EI CSCD 北大核心 2022年第1期118-126,共9页 Journal of Electronics & Information Technology
基金 国家自然科学基金(62076233) 中国社会科学院重大创新工程项目(2020YZDZX01-2)。
关键词 神经机器翻译 汉英翻译 复述生成 数据增广 大规模预训练语言模型 Neural machine translation Chinese to English translation Paraphrase generation Data augmentation Large scale pre-traing language model
  • 相关文献

参考文献2

二级参考文献15

  • 1俞士汶等.机器翻译译文质量自动评估系统[A]..中国中文信息学会1991年会论文集[C].,.314—319.
  • 2Peter F. Brown, John Cocke, Stephen A. Della Pietra, Vincent J. Della Pietra, Fredrick Jelinek, John D. Lafferty, Robert L. Mercer, Paul S. Roossin, A Statistical Approach to Machine Translation [J],Computational Linguistics, 1990.
  • 3Peter. F. Brown, Stephen A. Della Pietra, Vincent J. Della Pietra, Robert L. Mercer, The Mathematics of Statistical Machine Translation: Parameter Estimation [J], Computational Linguiatics, 19,(2), 1993.
  • 4F. J. Och, C. Tillmann, and H. Ney. Improved alignment models for statistical machine translation[A]. In Proc. of the Joint SIGDAT Conf. On Empirical Methods in Natural Language Processing and Very Large Corpora, pages 20-28, University of Maryland, College Park, MD, June 1999.
  • 5Franz Josef Och, Hermann Ney. What Can Machine Translation Learn from Speech Recognition? [A]In: proceedings of MT 2001 Workshop: Towards a Road Map for MT, 26-31, Santiago de Compostels,Spain, September 2001.
  • 6Franz Josef Och, Hermann Ney, Discriminative Training and Maximum Entropy Models for Statistical Machine Translation [A], ACL2002.
  • 7K. A. Papineni, S. Roukos, and R. T. Ward. Feature-based language understanding[A]. In European Conf. on Speech Communication and Technology, 1435-1438, Rhodes, Greece, September,1997.
  • 8K. A. Papineni, S. Roukos, and R. T. Ward. Maximum likelihood and discriminative training of direct translation models [A] In Proc. Int. Conf. on Accoustics, Speech, and Signal Processing,pages,189-192, Seattle, WA, May, 1998.
  • 9Kishore Papineni, Salim Roukos, Todd Ward, Wei-Jing Zhu, Bleu: a Method for Automatic Evaluation of Machine Translation [R], IBM Research, RC22176 (W0109-022) September 17, 2001.
  • 10Ye-Yi Wang, Grammar Inference and Statistical Machine Translation [D], Ph.D Thesis, Carnegie Mellon University, 1998.

共引文献167

同被引文献3

引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部