期刊文献+

语料对齐工具的性能比较与选择 被引量:3

原文传递
导出
摘要 本文利用实验研究的方法,以文学、财经和科技三种文体为样本,对6款常见的语料对齐工具进行了比较研究。研究发现:(1)除Deja VuX3之外,相同文本使用docx和txt格式对对齐结果没有影响;(2) Transmate、ABBYY Aligner 2.0和memoQ 2015的对齐准确率位居前列,表现稳定;(3)使用不同体裁的文本,对齐质量也会不同。科技文本的对齐效果最佳,其次是财经和文学;(4)对齐准确率是评测对齐质量的主要指标,但不是唯一指标;(5)距离完美对齐的距离、句段长短、标签数量也影响对齐质量。本文还提出了对齐准确率的概念和计算公式。本研究对对齐工具的选择和改进具有一定参考作用。
作者 蔡辉
机构地区 中央财经大学
出处 《中国翻译》 CSSCI 北大核心 2019年第3期150-155,共6页 Chinese Translators Journal
  • 相关文献

参考文献3

二级参考文献34

  • 1姜柄圭,张秦龙,谌贻荣,常宝宝.面向机器辅助翻译的汉语语块自动抽取研究[J].中文信息学报,2007,21(1):9-16. 被引量:12
  • 2Huang Fei, Vogel S, Waibel A. Automatic extraction of named entity translingual equivalence based on multi-feature cost minimization//Proceedings of the 2003 Annual Confer- ence of the ACL, Workshop on Multilingual and Mixed-lan- guage Named Entity Recognition. Sapporo, Japan, 2003: 184-192.
  • 3Al-Onaizan Y, Knight K. Translating named entities using monolingual and bilingual resources//Proceedings of the 40th Annual Meeting of the Association for Computational Lin- guistics (ACL). Philadelphia, PA, USA, 2002:400 -408.
  • 4Feng Donghui, Lv Yajuan, Zhou Ming. A new approach for English Chinese named entity alignment//Proceedings of the Conference on Empirical Methods in Natural Language Pro cessing (EMNLP 2004). Barcelona, 2004 : 372-379.
  • 5Lee Chun-Jen, Chang Jason S, Jang Jyh-Shing R. Alignment of bilingual named entities in parallel corpora using statistical models and multiple knowledge sources. ACM Transactions on Asian Language Information Processing (TAMP), 2006, 5(2) : 121-145.
  • 6Moore R C. Learning translations of named-entity phrases from parallel corpora//Proceedings of lOth Conference of the European Chapter of ACL. Budapest, Hungary, 2003: 456- 464.
  • 7Krishman Vijay, Manning Christopher D. An effective two- stage model for exploiting non-local dependencies in named entity recognition//Proceedings of the 44th Annual Meeting of ACL. Sydney, 2006:1121-1128.
  • 8Ji Heng, Grishman Ralph. Collaborative entity extraction and translation//Proceedings of the International Conference on Recent Advances in Natural Language Processing. Borovets, Bulgaria, 2007:281-238.
  • 9Chen Hsin-His, Yang Changhua, Lin Ying. Learning formu- lation and transformation rules for multilingual named enti- ties//Proceedings of the ACL 2003 Workshop on Multilingual and Mixed-language Named Entity Recognition. Sapporo, Japan, 2003:1-8.
  • 10Berger Adam L, Della Pietra Stephen A, Della Pietra Vin- cent J. A maximum entropy approach to natural language processing. Computational Linguistics, 1996, 22(1) : 39- 72.

共引文献29

同被引文献99

引证文献3

二级引证文献3

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部