期刊文献+
共找到1篇文章
< 1 >
每页显示 20 50 100
Research on Web Page Classification Method Based on Query Log
1
作者 叶飞跃 马祎星 《Journal of Shanghai Jiaotong university(Science)》 EI 2018年第3期404-410,共7页
Web page classification is an important application in many fields of Internet information retrieval,such as providing directory classification and vertical search. Methods based on query log which is a light weight v... Web page classification is an important application in many fields of Internet information retrieval,such as providing directory classification and vertical search. Methods based on query log which is a light weight version of Web page classification can avoid Web content crawling, making it relatively high in efficiency, but the sparsity of user click data makes it difficult to be used directly for constructing a classifier. To solve this problem, we explore the semantic relations among different queries through word embedding, and propose three improved graph structure classification algorithms. To reflect the semantic relevance between queries, we map the user query into the low-dimensional space according to its query vector in the first step. Then, we calculate the uniform resource locator(URL) vector according to the relationship between the query and URL. Finally, we use the improved label propagation algorithm(LPA) and the bipartite graph expansion algorithm to classify the unlabeled Web pages. Experiments show that our methods make about 20% more increase in F1-value than other Web page classification methods based on query log. 展开更多
关键词 Web page classification word embedding query log
原文传递
上一页 1 下一页 到第
使用帮助 返回顶部