摘要
本文根据大型新闻资料计算机检索系统对中文检索语言的具体要求,对中文叙词表结构进行了分析,并以抽象代数为工具进行推导,给出了一种叙词表结构的形式化的描述方式。提出可利用叙词表的内在结构关系,将一个大的叙词表(集)划分成若干个彼此独立的小叙词集。提出了一种用于计算机检验叙词表构造正确性的多值关系矩阵算法。文中还研究了中文叙词表的中文处理、建表、正确性判定、词表维护和资料检索等问题。
The structure of Chinese thesauruses of a large Chinese information retrieval computer system is analyzed and a kind of formal description is presented through strictly inferring by using abstract algebra. It is pointed out that an enormous thesaurus can be divided into many small mutually independent ones by using the rules of thesauruses. A multivalued- matrix algorithm for testing validity of the structure of a thesaurus is proposed. And the problems on Chinese processing, compiling, testing validity, upholding thesauruses and information retrieval are also discussed.
出处
《计算机学报》
EI
CSCD
北大核心
1990年第1期48-56,共9页
Chinese Journal of Computers