摘要
随着互联网的高速发展,海量信息越来越多,搜索引擎技术发展很快,但是搜索引擎的搜索结果仍然不能满足人们的搜索要求,引入k-means聚类算法对Web文档进行聚类,为了提高聚类性能,引入蛙跳算法进行k值的选取。目的是提高搜索结果的准确性,增加搜索引擎返回结果与查询主题的相关性。
With the rapid development of Internet,the amount information will become more and more abundantly,The technology of Search Engine develop rapidly,but the result of research don't meet people' requirement of search,The article introduce K-means cluster algorithm for the cluster of web document.Selecting k value with shuffled frog-leaping algorithm in order to improve performance of cluster,The purpose of which improve accuracy of result of search,enhance relativity between the return' result of search engine and bring people better performance.
出处
《电脑开发与应用》
2011年第5期35-37,共3页
Computer Development & Applications
关键词
K-MEANS
聚类算法
蛙跳算法
文本聚类
k-means
cluster algorithm
shuffled frog-leaping algorithm
text cluster