摘要
从网络技术的角度,阐述了与大学英语语料库标注相关的文本预处理、标注工具、标注格式、标注格式转换、标注准确率、标注校对等问题。
As a pedagogic corpus, the Colen Corpus has an annotation format based on the HTML technology, which is quite different from other corpus. This article outlines the philosophies related to the annotation under discussion, covering text-preparation, annotation process, annotation format exchange, and the annotation accuracy.
出处
《海南大学学报(人文社会科学版)》
CSSCI
2006年第2期281-284,共4页
Journal of Hainan University (Humanities & Social Sciences)