期刊文献+

词汇化调序模型中融合语言特性的层次短语翻译方法研究 被引量:3

A Lexicalized Reordering Model of Integrating with Language Features for Hierarchical Phrase-based Translation
下载PDF
导出
摘要 针对越南语语言特性,提出在词汇化调序模型中融合语言差异特性的汉语-越南语的统计机器翻译方法。该方法首先分析汉语与越南语语法不同,提取越南语在定语位置、状语位置及修饰词词语顺序上与汉语的差异,然后形式化定义这些差异规则,以对数线性模型的形式融入进词汇化调序模型中。在训练过程,通过融合语言差异特性的词汇化调序模型对符合特性的规则进行权重调优,从而在解码过程中指导候选翻译的选择。实验结果表明,在词汇化模型里融合语言特性的汉语-越南语的层次短语机器翻译模型比基准系统提高了0.6~2.1个BLUE值。 According to the language characteristics of Vietnamese,this paper proposed a new lexicalized reordering modelwhere language features were integrated for Chinese-Vietnamese statistical machine translation. Firstly,the grammar differences be-tween Chinese and Vietnamese were analyzed,and the sequence differences in attribute,adverbial modifier and adjuncts were ex-tracted. Secondly,the extracted difference rulers were formally defined and be integrated in the lexicalized reordering model via thelog-linear model. In the training processing,the proposed model would optimize the weight for these rules that conform to the lin-guistic features Finally,it would guide the translation selection in the decoding. The experiment had verified that our reorderingmodel achieved a 0.6-2.1 BLEU point improvements for Chinese-to-Vietnamese translation over a baseline hierarchicalphrase-based system.
出处 《计算机与数字工程》 2017年第12期2389-2392,2427,共5页 Computer & Digital Engineering
关键词 统计机器翻译 词汇化调序模型 汉语 越南语 语言特性 statistical machine translation lexicalized reordering model chinese vietnamese language features
  • 相关文献

参考文献6

二级参考文献35

  • 1俞鸿魁,张华平,刘群,吕学强,施水才.基于层叠隐马尔可夫模型的中文命名实体识别[J].通信学报,2006,27(2):87-94. 被引量:156
  • 2[1]阮有琼.现代越语[Z].河内:越南百科全书出版社,1994.
  • 3[3]阮文修.现代越语词汇[Z].河内:越南大学与中专出版社,1978.
  • 4[4]爱德华·萨丕尔.语言论[M].陆卓元译.北京:商务印书馆,1977.
  • 5David Chiang. A hierarchical phrase-based model for statistical machine translation [C]//Proceedings of the 43rd Annual Meeting of the Association for Computa- tional Linguistics. 2005.. 263-270.
  • 6David Chiang. Hierarchical phrase-based translation [J]. Computational Linguistics. 2007, 33(2) : 201-228.
  • 7Philipp Koehn, Franz Joseph Och, Daniel Marcu. Sta- tistical Phrase-Based Translation [C]//Proeeedings of NAACL 2003. 2003.
  • 8Christoph Tillman. A unigram orientation model for statistical maeh[ne translation [C]//Proeeedings of HLT-NAACL 2004: Short Papers. 2004: 101-104.
  • 9Philipp Koehn, Amittai Axelrod, Alexandra Birch Mayne, et al. Edinburgh System Description for the 2005 IWSLT Speech Translation Evaluation[C]//Pro- ceedings of IWSLT 2005, 2005.
  • 10Philipp Koehn, Hieu Hoang, Alexandra Birch, et al. Moses: Open Source Toolkit for Statistical Machine Translation[C]//Proceeding of ACL 2007, demon- stration session. 2007.

共引文献161

同被引文献40

引证文献3

二级引证文献15

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部