摘要
中文信息的标引是国内信息导航系统实现的基础,汉语分词和语义提取是目前尚未解决的难题。本文比较了信息检索系统中目前主要使用的标引方法,根据国内信息导航系统处理对象的“中文”特征,提出了关键词标引与全文标引相结合的混合标引方法,并给出了具体的实现方法,较好地解决了查全、查准和标引空间的增长问题。文中最后也给出了中文信息标引处理后入库的数据的检索方法。
We are all aware that a Chinese information indexing subsystem is the foundation of the internal information navigating system. As far as we know, how to distinguish between noise words and significant words and how to pick out the meaning in words from the text are hard problems which remain unsolved. We have put forward a new Chinese information manipulation method, which combines the keyword indexing method with the whole - length indexing method, compares with the indexing methods commonly used at present, and takes the features of Chinese in the internal information navigating system.Its implementation is also provided.This new method preferably provides a full and precise retrieval and solves the difficulty in the increase of the indexing space. At length, the data which is manipulated by the Chinese information indexing subsystem are also provided.
出处
《计算机应用与软件》
CSCD
北大核心
2002年第5期37-40,共4页
Computer Applications and Software