期刊文献+
共找到1篇文章
< 1 >
每页显示 20 50 100
A new focused crawler using an improved tabu search algorithm incorporating ontology and host information
1
作者 Jingfa LIU Zhen WANG +1 位作者 Guo ZHONG Zhihe YANG 《Frontiers of Information Technology & Electronic Engineering》 SCIE EI CSCD 2023年第6期859-875,共17页
To solve the problems of incomplete topic description and repetitive crawling of visited hyperlinks in traditional focused crawling methods,in this paper,we propose a novel focused crawler using an improved tabu searc... To solve the problems of incomplete topic description and repetitive crawling of visited hyperlinks in traditional focused crawling methods,in this paper,we propose a novel focused crawler using an improved tabu search algorithm with domain ontology and host information(FCITS_OH),where a domain ontology is constructed by formal concept analysis to describe topics at the semantic and knowledge levels.To avoid crawling visited hyperlinks and expand the search range,we present an improved tabu search(ITS)algorithm and the strategy of host information memory.In addition,a comprehensive priority evaluation method based on Web text and link structure is designed to improve the assessment of topic relevance for unvisited hyperlinks.Experimental results on both tourism and rainstorm disaster domains show that the proposed focused crawlers overmatch the traditional focused crawlers for different performance metrics. 展开更多
关键词 Focused crawler Tabu search algorithm ONTOLOGY Host information Priority evaluation
原文传递
上一页 1 下一页 到第
使用帮助 返回顶部