摘要
维语语料库是汉维机器翻译系统的核心基础,同时也是对维语中的词语进行统计的一个完整和完善的基础模型。在此我们要介绍管道与过滤器模式在维语语料库的构造和维语词频统计系统中的应用。第一步介绍管道和过滤器概念,其次说明建立维语语料库过程和选择语料库原则,最后陈述维语词频统计结果和进行分析。
As the core of Uyghur - Chinese artificial translation system, Uyghur Corpus is synchronously the full and perfect basic method of Uyghur vocabulary statistics. Hence we introduce the applications of pipeline and filtrate methods in Uyghur phraseology frequency statistic system and the construction of Corpus. Firstly, we introduce the concepts of pipeline and filtrate. In the second place, we illustrate the process that the Corpus is constructed and the principles of how the Corpus is selected. Lastly, we analyze the result of Uyghur Phraseology Frequency Statistics.
出处
《新疆师范大学学报(自然科学版)》
2006年第2期16-20,共5页
Journal of Xinjiang Normal University(Natural Sciences Edition)
基金
国家自然科学基金:项目编号:(60163002
60463005)
新疆维吾尔自治区科技厅少数民族特培人员启动资金项目资助