摘要
面向行业主题的搜索在特定主题信息覆盖方面与通用搜索引擎有着截然不同的要求,为解决行业信息搜索的问题对基于向量空间算法的化工相关度计算以及对经典的Page-Rank页面排序算法做了研究与改进并且在Nutch搜索引擎架构基础上,搭建了一个面向化工行业信息资源的垂直搜索引擎。相对于通用搜索引擎来说剔除掉了不必要的搜索结果信息量,提升了系统速度,提高了行业信息搜索的准确度。
The demand between the general search engine and the professional information search is mainly on the coverage of special topic information is huge different, to solving the problem which the professional information searching encountered, this paper study and give improvement on the chemical industry topic co-relation value computation based on the vector-space algorithm and the classic webpage ranking algorithm of Page-Rank, and build a vertical search engine based on the framework of Nutch. Compared to the general search engine, eliminating the unnecessary search results, improving the search system speed and the accuracy of professional information search.
出处
《四川理工学院学报(自然科学版)》
CAS
2011年第1期71-73,共3页
Journal of Sichuan University of Science & Engineering(Natural Science Edition)
基金
四川理工学院人才引进科研启动项目(07ZR41)