期刊文献+

引入基于主题复述知识的统计机器翻译模型 被引量:1

Improved statistical machine translation model with topic-based paraphrase
下载PDF
导出
摘要 针对传统的基于双语平行语料的复述获取方法在复述获取和应用的过程中忽视文档上下文的缺点,引入基于主题模型的上下文信息来改善复述获取—主要致力于如何计算上下文无关的复述生成概率和上下文相关的复述生成概率.研究如何将上述2种概率融入统计机器翻译建模,以提高翻译系统的性能.多个测试集上的实验结果证明了该方法的有效性. Abstract. To deal with the defect of the conventional parallel corpus based paraphrase extraction method which neglects document-level context, the paraphrase extraction and its application in statistical machine translation were improved by introducing the context based on topic model. The problem that how to better learn two kinds of paraphrase probabilities, topic-insensitive and topic-sensitive ones, was mainly analyzed. Both of the two probabilities can be incorporated into the modeling of statistical machine translation by using different methods. The experimental results on various test sets demonstrated the effectiveness of the approach.
出处 《浙江大学学报(工学版)》 EI CAS CSCD 北大核心 2014年第10期1843-1849,共7页 Journal of Zhejiang University:Engineering Science
基金 国家"十二五"科技支撑计划资助项目(2012BAH14F03) 国家自然科学基金资助项目(61005052 61303082) 高等学校博士学科点专项科研基金资助项目(2012012120046) 福建省自然科学基金资助项目(2011J01360) 厦门市科技计划资助项目(3502Z20103001) 深圳市高性能数据挖掘重点实验室资助项目(CXB201005250021A)
关键词 统计机器翻译 复述 主题模型 statistical machine translation paraphrase topic model
  • 相关文献

参考文献28

  • 1KOEHN P, OCH F J, MARCU D. Statistical phrase- based translation[C]// Proceedings of HLT-NAACL. Edmonton, Canada: ACL, 2003: 48- 54.
  • 2CHIANG D. Hierarchical phrase-based translation [J]. Computational Linguistics, 2007, 33(2): 201-288.
  • 3GALLEY M, GRAEHL J, KNIGHT K, et al. Sealable inference and training of context-rich syntactic transla- tion models [C] // Proceedings of ACL. Sydney: ACL, 2006:961 - 968.
  • 4LIU Yang, LIU Qun, LIN Shou-xun. 2006. Tree-to- string alignment template for statistical machine transla- tion [C] // Proceedings of ACL. Sydney: ACL, 2006: 609 - 616.
  • 5WU De-kai. Stochastic inversion transduction grammars and bilingual parsing of parallel corpora[J]. Computa- tional Linguistics, 1997, 23(3): 377- 404.
  • 6XIONG De-yi, LIU Qun, LIN Shou-xun. Maximum entro- py based on phrase reordering model for statistical machine translation [-C] // Proceedings of ACL. Sydney: ACL, 2006:521 - 528.
  • 7刘群,王海峰,王惠临,等.机器翻译技术的进展与展望[EB/OL].2011-12-04.http:∥nlp.1et.ac.cn/li-uqun/index_zh.htm.
  • 8MITAMURA T, NYBERG E. Automatic rewriting for controlled language translation [C]//Proceedings of NL- PRS. Tokyo: ACL, 2001: 1 - 12.
  • 9YAMAMOTO K. Machine translation by interaction be- tween paraphraser and transfer[C]//Proeeedings of COLING. Taipei: ACM, 2002: 1107-1113.
  • 10ZHANG Y J, YAMAMOTO K. Paraphrasing of Chinese utterances [C] // Proceedings of COLING. Taipei : ACM, 2002: 1163- 1169.

二级参考文献1

共引文献13

同被引文献15

引证文献1

二级引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部