摘要
多语言信息存取(MLIA:Multilingual Information Access)是许多数字图书馆希望提供的一种服务。本文从MLIA的概念入手,探讨了机器翻译(MT)在实现数字图书馆的多语言信息存取服务中的作用。为了评测目前MT系统的性能以确定它们能否应用于数字图书馆的多语言信息存取,我们开发了一个多语种的元数据机器翻译综合评价平台HeMT(Human Evaluation of Machine Translation.http://txcdk-v10.unt.edu/HeMT/)。利用HEMT平台6大功能模块,评价了3个在线机器翻译系统所译的2000条元数据记录。这些记录来自北德克萨斯大学(UNT)图书馆和德克萨斯州历史门户(The Portal to Texas History),在报告评价结果的基础上,对所发现的机器翻译的错误进行了分析,文章最后提出了使用多引擎机器翻译(Multi-engine Machine Translation)技术以提高元数据记录翻译性能的设想。
This paper describes the role of machine translation (MT) for multilingual information access, a service that is desired by digital libraries that wish to provide cross-cultural accessto their collections. To understand the performance of current MT systems, we have developed an integrated multilingual evaluation platform HeMT(http://txcdk-v10.unt.edu/HeMT/). HeMT consists six functional modules to facilitate human evaluation of machine translation. We have performed evaluation of three online MT services on 2000 metadata records that were extracted from two digital collections. This paper will report the evaluations on Chinese translation. Additionally, we present the analysis of the translation errors and propose Multi-engine MT strategies to improve translation performance.
出处
《公共图书馆》
2012年第3期51-58,共8页
Public Library Journal
基金
美国博物馆及图书馆服务机构(IMLS)的资助
资助号LG-06-10-0162-10