摘要
对利用汉英双语语料进行翻译等价词对抽取进行了探索 .对不同的词对关联度量方法进行了对比 ,考察了词性信息、多词翻译单元识别对翻译等价词对抽取结果所可能产生的改进以及负面影响 .
This paper examines how the translation equivalent pairs could be extracted from parallel corpus. Four different word association measurements are compared with each other. And this paper also proposes to use part of speech information for performance improvement and integrate a multi-word unit identification component to alleviate the problem of partial correct result. Some negative aspects of these proposals are also examined in this paper.
出处
《计算机学报》
EI
CSCD
北大核心
2003年第5期616-621,共6页
Chinese Journal of Computers
基金
国家"九七三"重点基础研究发展规划项目 (G19980 3 0 5 0 7 4)资助