摘要
作者从实际应用的角度对汉语文献自动标引的两种算法进行了改进。提出将非用字后缀表法改进为,考察相邻三字之间的联系关系,实现一次扫描完成分词;
This paper improves on the two algorithms of the Chinese literature auto indexing,the method of Stop Word Suffix List and the method of Single Chinese Character.To Stop Word Suffix List,the author proposes once extracting indexing words from text by analysing the relations of every together three Chinese characters,and then,creates the method of matching directly first Chinese character to improve the Single Chinese Character indexing technique.
出处
《情报学报》
CSSCI
北大核心
1996年第6期426-430,共5页
Journal of the China Society for Scientific and Technical Information