期刊文献+

基于分层语块分析的统计翻译研究 被引量:7

Statistical Machine Translation Model Based on Hierarchical Chunking Phrase
下载PDF
导出
摘要 本文描述了一个基于分层语块分析的统计翻译模型。该模型在形式上不仅符合同步上下文无关文法,而且融合了基于条件随机场的英文语块分析知识,因此基于分层语块分析的统计翻译模型做到了将句法翻译模型和短语翻译模型有效地结合。该系统的解码算法改进了线图分析的CKY算法,融入了线性的N-gram语言模型。目前,本文主要针对中文-英文的口语翻译进行了一系列实验,并以国际口语评测IWSLT(International Workshopon Spoken Language Translation)为标准,在2005年的评测测试集上,BLEU和NIST得分均比统计短语翻译系统有所提高。 This paper describes a Hierarchical chunking-phrase based (HCPB) statistical translation model. The model not only comply with formal synchronous context-free grammar but also learned partial parsing knowledge using CRF (Conditional Random Fields) . Therefore it can be taken as combination of fundamental ideas from both syntax-based translation and phrase-based translation. The decoder for HCPB MT system is based on Chart-CKY algorithm, and integrates N-gram language model effectively. In our benchmark evaluation focusing on Chinese-English spoken language translation. The method achieves higher accuracy in measure of Bleu and NIST score in IWSLT2005.
出处 《中文信息学报》 CSCD 北大核心 2007年第5期87-90,117,共5页 Journal of Chinese Information Processing
基金 国家863计划资助项目(2006AA01Z194) 富士通合作项目(K0604040)
关键词 人工智能 机器翻译 基于分层语块分析的统计翻译模型 条件随机场 CKY算法 artificial intelligence machine translation hierarchical chunking-phrase based SMT conditional random fields chart-based CKY algorithm
  • 相关文献

参考文献14

  • 1Peter F.Brown,Stephen A.Della Pietra,Vincent J.Della Pietra,and Pobert L.Mercer.The Mathematics of Statistical Machine Translation:Parameter Estimation[J].Computational Linguistics,1993,19(2):263-311.
  • 2Philipp Koehn,Franz Josef Och,and Daniel Marcu.Statistical phrase-based translation[A].In:Proc.of NAACL[C].Edmonton,Canada:2003.48-54.
  • 3Richard Zens and Hermann Ney.A comparative study on reordering constraints in statistical machine translation[A].In:Proc.of ACL 2003[C].144-151.
  • 4Christoph Tillman.A unigram orientation model for statistical machine translation[A].In:HLT-NAACL Short Papers[C].Boston,Massachusetts,USA:2004.May 2-May 7,101-104.
  • 5David Chiang.A hierarchical phrase-based model for statistical machine translation[A].In:Proc.of ACL 2005[C].Ann Arbor,Michigan:June,263-270.
  • 6Alfred V.Aho and Jeffrey D.Ullman.Syntax directed translations and the pushdown assembler[J].J.Comput.Syst.Sci.,1969,3(1):37-56.
  • 7J.Lafferty A.McCallum and F.Pereira.Conditional random Fields:probabilistic models for segmenting and labeling sequence data[A].Harry Q.Bovik.Proceedings of ICML[C].Massachusetts,USA:2001.282-289.
  • 8Fei Sha and Fernando Pereira.Shallow Parsing with Conditional Random Fields[A].Eduard Hovy.Proceedings of HLT-NAACL[C].Edmonton,Alberta:2003.134-141.
  • 9F.J.Och and H.Ney.Discriminative training and maximum entropy models for statistical machine translation[A].In:Proceedings of the 40th Annual Meeting of the Association for Computational Linguistic[C].2002.295-302.
  • 10Fang Xu,Chengqing Zong,and Jun Zhao.A Hybrid Approach to Chinese Base Noun Phrase Chunking[A].In:Proceedings of the Fifth SIGHAN Workshop on Chinese Language Processing[C].Sydney:July 22-23.2006.87-93.

共引文献1

同被引文献76

引证文献7

二级引证文献19

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部