摘要
本文介绍一种基于多元统计分析的电子文本自动分类方法,可对从Internet 的搜索引擎上获取的检索结果进行分类,既可过滤掉非相关类别的文挡,又可将相关文挡按照相关的紧密程度从高到低排序,方便用户查询,有助于在保证信息检索召回率的前提下提高信息检索的精度。
In this paper,the algorithm of aut 0matic categorizing based on multivariate statistical analysis is presented.It categorizes texts from search engine and filtrates irrelevant texts.And it is not only able t 0 arrange relevant texts in the relativity order for convenience of reference,but also t 0 improve the precision of retrieving information on premise of the guaranteed rate of recalling information.
关键词
检索
分类
多元统计
Search,Categorize,Multivariate Statistical Analysis