摘要
在海量信息中检索时,与用户查询相关的信息常常被漏掉,而与查询无关的信息——信息垃圾,却大量地出现在检索结果中。改进文本信息检索系统的质量,提高检索效能,已成为亟待解决的问题。本文针对能够影响检索效力的一个易被忽略的因素——修饰语,研究其在文本信息检索中的作用。为此,构建了修正的向量空间模型(Modified Vector Space Model,MVSM),并以英文文本进行试验,进而说明修饰语的作用。
It happens more often than not that when people retrieve documents among a mass of information, the exact information relevant to the user's query can't be obtained,on the contrary too much information trash,which is not relative to the user's, is cover a large proportion. Therefore, improving the quality and effectiveness of the information retrieval (IR) system has become a desired issue. The objective of this paper is to research into the importance of modifiers, which is a factor often ignored but can influence the effectiveness of IR system, to document information retrieval. According to this, a modified vector space model (MVSM) is built. Experiments using English documents are also done to show the importance of modifiers.
出处
《情报学报》
CSSCI
北大核心
2006年第3期306-311,共6页
Journal of the China Society for Scientific and Technical Information
基金
国际合作项目:日本佳思腾株式会社资助.