摘要
用于汉语文献自动标引的词典组织结构对自动标引的效率有很大影响,自动标引中运用的词典查找算法有其自身的特点,符合这种特点的词典结构能提高自动标引过程中分词的速度。本文在分析了几种常用的词典结构的空间效率和时间效率之后,提出了一种通用而高效的词典组织方法。采用这种方法的词典,其体积可以减小到原来的0.4倍,分词速度提高到原来的2.5倍。
The dictionary structure used for automated indexing system of Chine-se document can produce a great impact on the indexing efficiency,The searching algorithm used in automated indexing process has a style of its own,so,the dict-ionary structure making full use of the style can increase the speed of the separ-ating word.After analysing the space and time efficiency of some dictionary str-uctures,the article puts forward a general and efficient method,Using the met-hod,the volume of dictionary can. be cut down to 0.4 times of it,and the speed of separating word will be increased to 2.5 times.
出处
《情报学报》
CSSCI
北大核心
1995年第1期9-15,共7页
Journal of the China Society for Scientific and Technical Information