摘要
结合主题爬虫和本体学习的研究现状,设计并实现了基于本体的主动学习主题爬行系统。通过更好地规划爬虫爬行流程,详细地划分功能相对独立的模块,提高了整个系统爬行工作效率和抓取相关网页的准确率。
This article designs and achieves focused crawling system based on ontology active learning by combining the research current situation on focused crawling and ontology learning.Through better accumulating the crawling process,it detailed divides the functions of relative independent modules,and enhances the working effectiveness of the whole crawling system and the correctness in capturing related website.
出处
《长春工程学院学报(自然科学版)》
2011年第1期128-130,共3页
Journal of Changchun Institute of Technology:Natural Sciences Edition
关键词
主题爬行
本体学习
相关度计算
本体
focused crawling
ontology learning
correlation calculation
ontology