期刊文献+

一个基于JAVA的堆栈式自然语言翻译解码器 被引量:1

Studies on a Stack-based Decoder for Statistical Natural Language Translation by JAVA
下载PDF
导出
摘要 解码是统计学自然语言翻译系统的重要一步,解码器的任务是用从训练文本中学习到的语言/翻译模型的信息来确定源句子最可能的翻译句子,解码器的输入是翻译模型和语言模型,以及源语言句子,输出源语言句子最可能的对应目标句子/翻译。由于可能的目标句子很多,通常解码算法只能搜索一小部分可能的目标语言句子。该文介绍了一种基于堆栈算法的,用Java实现的解码器。Java平台提供了方便的跨平台的应用,高度安全、开放、健壮。解码器的实现重点在于解码算法和参数的选择。 Decoding is a critical part in statistical machine translation system;the decoder's job is to find the most likely translation according to previous learned information from training corpora.The decoder takes translation model and language model information as well as source sentence as input,use some searching algorithm to find the target sentence that has maximum probability of being translation of given source sentence,typically decoding algorithms can only promise to search a small scope of possible translations due to the large scope of possible target sentences.In this report,a stack-based decoder based on statistical natural language translation principles are described and implemented.The programming language is Java.Java platform provides freedom of choice through multi-platform compatibility.It is a highly secure,open,robust,viable and flexible platform for developing.The software development is concentrated on the decoding algorithm and parameter selection.
作者 聂进
机构地区 林雪平大学
出处 《计算机工程与应用》 CSCD 北大核心 2005年第4期105-108,共4页 Computer Engineering and Applications
关键词 解码 解码器 基于统计模型的机器翻译 堆栈搜索 JAVA decoding,decoder,Statistical Machine Translation(SMT),stack search,Java
  • 相关文献

参考文献7

  • 1Brown,S D Pietra,V D Pietra et al.The mathematics of statistical machine translation:Parameter estimation[J].Computational Linguistics,1993;19(2):263~311.
  • 2Brown J Cocke,S Della Pietra,V Della Pietra et al.A Statistical Approach to Machine Translation[J].Computational Linguistics,1990;16(2).
  • 3Germann U,Jahr M,Knight K et a1.Fast decoding and optimal decoding for machine translation[C].In:Proc of the 39th Annual Meeting of the Association for Computational Linguistics(ACL),Toulouse,France,2001:228-235.
  • 4Wang Ye-Yi.Alex Waibel.Decoding Algorithm in Statistical Machine Translation[C].In:Proceedings of the 35th Annual Meeting of the Association for Computational Linguistics,Madrid,Spain,1997.
  • 5Merkel,Magus.The Linkfiping English-Swedish Translation Corpus. Linkopings Sweden,1999.
  • 6Clarkson.Philip.Statistical Language Modeling Using The CMU_Cambridge Toolkit.http://svr-www.eng.cam.ac.uk/-prc14/eurospeech97.ps.
  • 7Och.Franz Josef.GIZA++readme.http://www-i6.informatik.rwthaachen.de/Colleagues/och/software/GIZA++.Html.

同被引文献5

  • 1OCH F J ,NEY H. Discriminative training and maximum entropy models for statistical machine translation[ C]. Proceed- ings of the40th Annual Meeting of the Association for Computational Linguistics (ACL). Philadelphia, PA: [ s. n ], 2002 : 295-302.
  • 2http ://www. iccs. inf. ed. ac. uk/- pkoehn/publications/phrase2003, pdf.
  • 3Wang Ye -yi, Warble. Decoding algorithm in statistical machine translation [ C ]. Proceedings of the 35th Annual Meet- ing of the Association for Computational Linguistics, Madrid, Spain, 1997.
  • 4KOEHN P. Pharaoh:A beam search decoder for phrase--based statistical machine translation models [ C ]. Proceedings of the Association of Machine Translation in the Americas ( AMTA - 2004).
  • 5张亚军,吐尔根·依布拉音,田生伟.汉语-维吾尔语句子级对齐系统分析及其实现[J].中国科技纵横,2010(6):126-126. 被引量:3

引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部