期刊文献+

N-Best句法知识增强的统计机器翻译预调序模型 被引量:1

N-best syntactic knowledge enhanced pre-reordering model for statistical machine translation
下载PDF
导出
摘要 源语言和目标语言的句法异构性对统计机器翻译(SMT)性能有重要影响。在基于短语的汉英统计机器翻译基础上,提出了一种基于N-best句法知识增强的源语言预调序方法。首先对源语言输入句子进行N-best句法分析,计算统计概率得到高可靠性子树结构,再根据词对齐信息从可靠性子树结构中抽取初始调序规则集。两种优化策略用于对初始规则集进行优化:基于中英文句法知识规则推导筛选和规则概率阈值控制机制。然后为减少短语内部调序,保证短语局部流利性,采用源语言短语翻译表为约束,使调序控制在短语块之间进行。最后根据获取的优化规则集和短语表约束条件对源语言端句子的句法分析树进行预调序。在基于NIST 2005和2008测试数据集上的汉英统计机器翻译实验结果表明,所提基于N-best句法知识增强的统计机器翻译预调序方法相对于基线系统,自动评价准则BLEU得分分别提高了0.68和0.83。 The syntactic heterogeneity between source and target languages has a significant impact on Statistical Machine Translation(SMT)performance. Based on the Chinese-English SMT system, an N-best syntactic knowledge enhanced method is proposed to pre-order the source-side sentences. Firstly, syntactic N-best parsed trees are generated, and highly reliable sub-trees are obtained by computing their posterior probabilities and then initial reordering rule set is extracted according to the word alignment links and sub-trees. Two optimization strategies are utilized to process the initial rule set,namely the bilingually syntactic knowledge-based and probability threshold-based. Secondly, in order to guarantee the local fluency of phrases, the phrase table is used to constrain the reordering only taking place between phrases rather than inside phrases. Finally, the optimized reordering rule set constrained by the phrase table is utilized to perform pre-reordering in source-side sentences. Experimental results on NIST 2005 and 2008 test sets show that the BLEU score improves 0.68 and 0.83 respectively compared to the baseline system.
作者 郭俊博 张喜媛 杜金华 GUO Junbo;ZHANG Xiyuan;DU Jinhua(Faculty of Higher Vocational and Technical Education, Xi’an University of Technology, Xi’an 710048, China;Faculty of Automation and Information Engineering, Xi’an University of Technology, Xi’an 710048, China)
出处 《计算机工程与应用》 CSCD 北大核心 2016年第17期160-165,176,共7页 Computer Engineering and Applications
基金 国家自然科学基金(No.61100085) 陕西省自然科学基金(No.2015JM6328)
关键词 统计机器翻译 预调序模型 N-best句法树 调序规则 规则优化 statistical machine translation pre-reordering model N-best parsed tree reordering rules rule optimization
  • 相关文献

参考文献19

  • 1Och F J,Ney H.The alignment template approach tostatistical machine translation[J].Computational Linguistics,2004,30(4):417-449.
  • 2Tillman C.A unigram orientation model for statisticalmachine translation[C].Proceedings of the Human LanguageTechnology Conference of the North American Chapter of the Association for Computational Linguistics.Boston:ACL,2004:101-104.
  • 3Xiong D,Liu Q,Lin S.Maximum entropy based phrasereordering model for statistical machine translation[C].Proceedings of the 21st International Conference onComputational Linguistics and 44th Annual Meeting ofthe Association for Computational Linguistics.Sydney:ACL,2006:521-528.
  • 4Galley M,Manning C D.A simple and effective hierarchicalphrase reordering model[C].Proceedings of the2008 Conference on Empirical Methods in Natural LanguageProcessing.Hawaii:ACL,2008:848-856.
  • 5Green S,Galley M,Manning C D.Improved models ofdistortion cost for statistical machine translation[C].Proceedingsof Human Language Technologies:The 2010Annual Conference of the North American Chapter ofthe Association for Computational Linguistics.Los Angeles:ACL,2010:867-875.
  • 6Bisazza A,Federico M.Modified distortion matrices forphrase-based statistical machine translation[C].Proceedingsof the 50th Annual Meeting of the Associationfor Computational Linguistics.South Korea:ACL,2012:478-487.
  • 7Cherry C.Improved reordering for phrase-based translationusing sparse features[C].Proceedings of the 2013Conference of the North American Chapter of the Associationfor Computational Linguistics:Human LanguageTechnologies.Atlanta:ACL,2013:22-31.
  • 8Wang C,Collins M,Koehn P.Chinese syntactic reorderingfor statistical machine translation[C].Proceedings ofthe 2007 Joint Meeting of the Conference on EmpiricalMethods on Natural Language Processing(EMNLP)and on Computational Natural Language Learning(CoNLL).Prague:ACL,2007:737-745.
  • 9Patel R N,Gupta R,Pimpale P B,et al.Reordering rulesfor English-Hindi SMT[C].Proceedings of the SecondWorkshop on Hybrid Approaches to Translation of Associationfor Computational Linguistics(ACL).Sofia:ACL,2013:34-41.
  • 10吴秋锋,黄书剑,戴新宇,等.一种基于句法的用于汉英翻译的预调序方法[C].中国计算语言学研究前沿进展(CNCCL):北京,2011:50-56.

同被引文献5

引证文献1

二级引证文献2

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部