摘要
以属性论为理论依据,分析了文本属性与属性重心剖分模型的关系,建立了文本属性重心剖分模型,并在属性坐标系中表示文本向量与查询式向量,确定向量之间的匹配基准,计算匹配距离,从而建立一个文本与查询式之间的匹配相似度计算公式.该模型有效地描述文本属性和查询式属性之间的关系.
Generally, in the process of information retrieval(IR), the users first put forward their key words to the system that they want to search. Then the key words are analyzed to the special format, they are matched with the document database whose results are considered as the results that are related to the users' interests. There are several IR models, such as reverse document model, vector space model, generalized vector space model and latent semantic model and so on. According to attribute theory, this paper analyses the relationship between textual attributes and the attribute barycenter coordinate model, and establishes the text attribute barycenter coordinate model. Within the coordinate, a text vector and a query vector can be represented. After deciding the criterion and computing the distance between the vectors, a formula that computes the similarity between the texts and the queries is shown.
出处
《计算机学报》
EI
CSCD
北大核心
1999年第6期651-655,共5页
Chinese Journal of Computers
基金
国家八六三高技术研究发展计划
关键词
信息检索
人工智能
属性论
文本相似度
计算
Information retrieval, artificial intelligence, attribute theory.