期刊文献+

ROUGE-SN:基于跨越N元语法的机器翻译评测方法 被引量:2

ROUGE-SN: A Machine Translation Evaluation Method Based on Skip-Ngram
下载PDF
导出
摘要 本文在ROUGE-S的基础上提出一种基于跨越N元语法的ROUGE-SN机器翻译评测方法,在跨越二元语法(Skip-bigram)的基础上尽量延长N元语法的长度,使更多的句子连贯信息得以体现。并通过设置N元语法的阈值、综合系统运行代价和译文匹配效率等因素选定ROUGE-S6对ROUGE-S进行改进。在俄汉双语句子数据集上对谷歌、百度、必应、有道在线翻译系统的俄汉翻译输出译文进行评测,ROUGE-S6方法与传统ROUGE-S以及BLEUS的评测结果一致且性能优于ROUGE-S和BLEUS;且基于跨越N元语法的ROUGE-S6使得ROUGE-S的性能得以提升,对于百度系统而言,ROUGE-S性能提升44.52%,对于谷歌系统而言,提升50.45%,对必应系统提升42.19%,有道系统中ROUGE-S性能提升40.01%。 This paper addressed a new machine translation evaluation method ROUGE-SN based on Skip-Ngram. This new method tried to extend the length of N-gram on the basis of Skip-bigram, to embody more sentence coherence information. What's more, it set Nth as the threshold value of N-gram, and selected ROUGE-S6 to improve the ROUGE-S performance considering the running cost of the system and the translations matching efficiency. Experiments were performed in Russian and Chinese bilingual sentence data set and it evaluated the output translations of online translation system such as Google, Baidu, Bing and Youdao. The evaluation results of ROUGE-SN were consistent with that of ROUGE-S and BLEUS, and the performance of ROUGE-SN was the best in the three. The experiment results also showed that the ROUGE-S6 based on Skip-Ngram improve the ROUGE-S performance. For Baidu, the ROUGE-S performance improved 44.52%; Google was 50.45%; Bing 42.19% and Youdao 40.01%.
出处 《数码设计》 2017年第3期1-5,共5页 Peak Data Science
基金 国家语委重点项目(ZDI135-26) 广东省高校特色创新项目(2015KTSCX035)
关键词 跨越N元语法 ROUGE-SN ROUGE-S 机器翻译评测 Skip-Ngram ROUGE-SN ROUGE-S machine translation evaluation
  • 相关文献

参考文献1

二级参考文献16

  • 1C Brew,H S Thompson.Automatic evaluation of computer generated text:A progress report on the TextEval project.Human Language Technology Workshop,Arpa,Isto,1994
  • 2Shiwen Yu.Automatic evaluation of quality for machine translation systems.Machine Translation,1993,8(1/2):117~126
  • 3A Guessoum,R Zantout.Semi-automatic evaluation of the grammatical coverage of machine translation systems.MT Summit Conf,Santiago de Compostela,2001
  • 4Sungryong Koh,Jinee Maeng,et al.A test suite for evaluation of English-to-Korean machine translation systems.MT Summit Conf,Santiago de Compostela,2001
  • 5Shoichi Yokoyama,Hideki Kashioka,et al.An automatic evaluation method for machine translation using two-way MT.MT Summit Conf,Santiago de Compostela,2001
  • 6Keiji Yasuda,Fumiaki Sugaya,et al.An automatic evaluation method of translation quality using translation answer candidates queried from a parallel corpus.MT Summit Conf,Santiago de Compostela,2001
  • 7Yasuhiro Akiba,Kenji Imamura,Eiichiro Sumita.Using multiple edit distances to automatically rank machine translation output.MT Summit Conf,Santiago de Compostela,2001
  • 8Jianmin Yao,Ming Zhou,et al.An automatic evaluation method for localization oriented lexicalised EBMT system.The 19th Int'l Conf on Computational Linguistics (COLING-2002),Taipei,2002
  • 9V I Levenshtein.Binary codes capable of correcting deletions,insertions and reversals.Doklady Akademii Nauk SSSR,1965,163(4):845~848
  • 10A R Wagner,M Fischer.The string-to-string correction problem.Journal of the ACM,1974,21(1):168~173

共引文献16

同被引文献34

引证文献2

二级引证文献12

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部