期刊文献+

汉语语言模型的规模对统计机器翻译系统的影响 被引量:1

The Scale of Chinese Language Models for Statistical Machine Translation System
下载PDF
导出
摘要 本文专门研究了汉语语言模型的规模大小,语法元数在英汉统计机器翻译系统中的影响。实验表明,对于同样的语言模型,基于层次短语的翻译系统明显比基于短语的翻译系统性能要好。对于不同的语言模型,它的元数和规模对翻译的结果有较大的影响,但不一定元数或者规模越大,所得到结果就越好。 This paper presents the effects of Chinese language models’ scale and n-gram’s dimension in English-Chinese machine translation systems. Experiments shows that for the same language models, hierarchical phrase -based MT system is better than phrase-based MT system, but for the same MT system, Language models’ scale and dimension effects the BLEU value obviously. It is not sure that a larger scale and higher dimension language model has a better result.
作者 王韦华 徐波
出处 《微计算机信息》 2010年第27期108-109,共2页 Control & Automation
关键词 N元语法 语言模型 基于短语的统计机器翻译系统 层次短语 N-gram Chinese language model Phrase-Based MT system Hierarchical Phrase
  • 相关文献

参考文献6

  • 1Phitipp Koehn, Franz Josef Och and Daniel Marcu. Statistical phrase-based translation. In: Proceedings of the Human Language Technology Conference and the North American Association for Computational Linguistics (HLT-NAACL). Edmonton, Canada. 2003. 127-133.
  • 2David Chiang. A hierarchical phrase-based model for statistical machine translation. In: Proceedings of the 43rd Annual Meeting of the ACL. 2005. 263-270.
  • 3董广宇,吕学强,王涛,施水才.基于N-gram语言模型的汉字识别后处理研究[J].微计算机信息,2009,25(10):276-278. 被引量:5
  • 4Andreas Stolcke. SRILM-An Extensible Language Modeling Toolkit. ICSLP. 2002.
  • 5Wei Wei, Wei Pang, Zhendong Yang, et al. CASIA SMT System for TC-STAR Evaluation Campaign. In: TC-STAR workshop. 2006.
  • 6Philipp Koehn, Amittai Axelrod, Alexandra Birch Mayne, et al. Edinburgh System Description for the 2005 IWSLT Speech Translation Evaluation. International Workshop on Spoken Language Translation. 2005.

二级参考文献7

  • 1夏莹,马少平,常新功,朱小燕,金奕江.基于统计的汉字识别文本自动后处理方法[J].模式识别与人工智能,1996,9(2):172-178. 被引量:14
  • 2Wong P K, Chan C. Post-processing statistical language models for a handwritten Chinese character recognizer [J]. IEEE Trans on Svstem, Man and Cybemetics, 1999, 29(2):286-291.
  • 3Witten, Ian H, and Timothy C. Bell. The zero-frequency problem: Estimating the probabilities of novel events in adaptive text compression. IEEE Transactions on Information Theory. 1991,37:1085-1094
  • 4Lee H J et al. A markov language model in handwritten Chinese text recognition. Proceedings of 2nd ICDAR Japan, 1993.
  • 5Christopher D. Manning , Hinrich Schatze.统计自然语言处理基础[M].北京:电子工业出版社,2005.1.
  • 6Daniel Jurafsky,James H.Martin.自然语言处理综论[M].北京:电子工业出版社,2005.
  • 7李元祥,丁晓青,刘长松.基于HMM的汉语文本识别后处理研究[J].中文信息学报,1999,13(4):29-34. 被引量:14

共引文献4

同被引文献2

引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部