期刊文献+

基于本体的主动学习主题爬行的研究与实现

Research and implement of active learning focused crawling based on ontology
下载PDF
导出
摘要 结合主题爬虫和本体学习的研究现状,设计并实现了基于本体的主动学习主题爬行系统。通过更好地规划爬虫爬行流程,详细地划分功能相对独立的模块,提高了整个系统爬行工作效率和抓取相关网页的准确率。 This article designs and achieves focused crawling system based on ontology active learning by combining the research current situation on focused crawling and ontology learning.Through better accumulating the crawling process,it detailed divides the functions of relative independent modules,and enhances the working effectiveness of the whole crawling system and the correctness in capturing related website.
作者 任斌 毛应爽
出处 《长春工程学院学报(自然科学版)》 2011年第1期128-130,共3页 Journal of Changchun Institute of Technology:Natural Sciences Edition
关键词 主题爬行 本体学习 相关度计算 本体 focused crawling ontology learning correlation calculation ontology
  • 相关文献

参考文献3

  • 1周立柱,林玲.聚焦爬虫技术研究综述[J].计算机应用,2005,25(9):1965-1969. 被引量:153
  • 2Neches R,Fikes R E, Gruber T R, etc. Enabling Teehenology for Knowledge Sharing[J]. A1 Magazine, 1991, 12(3) :36--56.
  • 3Studer R,BenjaminsV R,FenselD. Knowledge Engineering, Principles and Methods [J]. Data and Knowledge Engineering, 1998, Z5(1-- 2) :161--197.

二级参考文献26

  • 1EHRIG M, MAEDCHE A. Ontology-focused crawling of Web documents[A]. Proceedings of the 2003 ACM symposium on Applied computing[C], March 2003.
  • 2GUO Q, GUO H, ZHANG ZQ, et al. Schema Driven Topic Specific Web Crawling[A]. DASFAA[C], 2005.
  • 3GRAUPMANN J, BIWER M, ZIMMER C, et al. COMPASS: A Concept-based Web Search Engine for HTML, XML, and Deep Web Data[A]. Proceedings of the 30th VLDB Conference[C],2004.
  • 4QIN JL, ZHOU YL, CHAU M. Building domain-specific web collections for scientific digital libraries: a meta-search enhanced focused crawling method[A]. Proceedings of the 4th ACM/IEEE-CS joint conference on Digital libraries[C], June 2004.
  • 5CHO J , GARCIA - MOLINA H , PAGE L . Efficient crawling through URL ordering[A]. Proceedings of the seventh international conference on World Wide Web 7[C], April 1998.
  • 6FLORESCU D, LEVY AY, MENDELZON AO. Database techniques for the world-wide web: A survey[J]. SIGMOD Record, 1998,27(3) :59 -74.
  • 7LAWRENCE S, GILES CL. Searching the World Wide Web[J].Science, 1998,280(5360):98.
  • 8CHAKRABARTI S, VAN DEN BERG M, DOM B. Focused crawling: A new approach to topicspecific web resource discovery[A].Proceedings of the Eighth International World-Wide Web Conference[C], 1999.
  • 9DAVULCU H, KODURI S, NAGARAJAN S. Datarover: a taxonomy based crawler for automated data extraction from data-intensive websites[A]. Proceedings of the 5th ACM international workshop on Web information and data management[C], November 2003.
  • 10AGGARWAL CC. Collaborative Crawling: Aggarwal C. Collaborative crawling: mining user experiences for topical resource discovery [A]. Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining[C], July 2002.

共引文献152

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部