摘要
该文运用在西文信息检索中非常成功的向量空间模型来解决中文信息检索的问题,在中文文档的特征项抽取,加权、相似度计算,模型的建立等方面做了一些探讨,并建立系统原型,在小范围内进行了测试。
The full text seaching in the CJK(Chinese-Japanese-Korean)platform is a classical problem in the infor-mation searching field.This paper introduces the classical Vector Space Model(VSM)in the IR field and adoptes it into the Chinese full text seaching.The paper analyzes the problem laid in the word segmentation and approaches on the symbol-based searching in the Chinese environment.The paper makes a prototype to test the approach.The results are also presented in the paper.
出处
《计算机工程与应用》
CSCD
北大核心
2003年第15期109-111,共3页
Computer Engineering and Applications