摘要
计算机辅助翻译的核心——翻译记忆的积累造就了语料库。语料库的大小决定了计算机辅助翻译软件的效能。因此,语料库建设是CAT用户必做的一门功课。在语料库的建设中,会遇到句库切分、重复、杂质等问题。对此大规模处理的最快捷的手段莫过于软件。文章提出的一种软件处理方法,能较好地解决并处理语料库建设中遇到的绝大部分问题。
The core of Computer Aided Translation- the accumulation of translation memory makes the corpus. The size of corpus determines the efficiency of computer aided translation software. Therefore, the construction of the corpus is a subject that CAT users must do. In the construction of corpus, sentence segmentation, will encounter problems such as repetition, impurity. The most efficient way to deal with this large scale is software. A software processing method is proposed in this paper,which can solve and deal with the problems encountered in the construction of the corpus.
出处
《科教导刊》
2016年第8X期52-53,共2页
The Guide Of Science & Education
关键词
句库
拆分
重复
杂质
删除
explosive
split
repeat
impurity
delete