期刊文献+

基于语义的主题爬行策略 被引量:12

Semantic-Based Focused Crawling Approach
下载PDF
导出
摘要 为使主题爬行能够充分利用资源的语义信息,提出基于语义的主题爬行策略.该策略利用领域本体刻画爬行主题,将本体语义映射到关键词表.通过定义断言集一致性扩展和域值关联推理任务,推演关键词间语义关系.在定义网页主题概念的基础上,结合本体推理方案提出主题概念的语义叠加效应模型.最后,利用主题概念的语义包含关系判定URLs抓取顺序.实验结果表明,该语义主题爬行策略在抓取收获率和爬行效率上优于现有同类方法,该方案有效、可行. An approach of semantic-based focused crawling is proposed in order to use semantic resource efficiently. In this paper, a domain-ontology is used to describe the topic of Web crawling. Lexicon of the keywords list are mapped to ontology, and semantic of words are obtained through mapping. Inference services about assertion set expanding and domain-range relation are defined. The semantic relation among keywords can be inferred by inference services. At the same time, the definition of concept about Web page is given. A semantic computational model is proposed by combining inference services mentioned above. In the end, the order of URLs corresponding to their Web page is decided according to the subsumption of topic concepts. The result show that this approach is advanced in harvest-rate and crawling efficiency and is better than some classical algorithms.
出处 《软件学报》 EI CSCD 北大核心 2011年第9期2075-2088,共14页 Journal of Software
基金 国家自然科学基金重大项目(60496320 60496321) 国家自然科学基金(60873148 60973089) 吉林省科技发展计划(20080107) 欧盟合作项目(155776-EM-1-2009-1-IT-ERAMUNDUS-ECW-L12) 符号计算与知识工程教育部重点实验室开放基金(450060326019)
关键词 本体 语义WEB 主题爬行 Tableau演算 ontology semantic Web focused crawling Tableau calculus
  • 相关文献

参考文献3

二级参考文献47

  • 1苏金树,张博锋,徐昕.基于机器学习的文本分类技术研究进展[J].软件学报,2006,17(9):1848-1859. 被引量:383
  • 2Borst W N. Construction of Engineering Ontologies for Knowledge Sharing and Reuse: [Ph D Thesis][D]. University of Twente, 1997:11-12.
  • 3Baader F, Horrocks I, Sattler U. Description logics as Ontology Languages for the Semantic Web[C]// Festschrift in Honor of Jorg Siekmann, 2003 : 129-135.
  • 4Horrocks I, Patel-sehneider P F. Reducing OWL Entailment to Description Logic Satisfiability[C] //Proc of the 2003 Int'l Semantic Web Conf, 2003: 17-29.
  • 5Lutz C, Sattler U, Tendera L. The Complexity of Finite Model Reasoning in Description Logics[J]. Information and Computation, 2005,199(1-2) : 132-171.
  • 6Schmidt S, Smolka. Atrributive Concept Descriptions with Complements[J]. Artificial Intelligence, 1991,48(1) :1-26.
  • 7Kazakov Y, Sattler U, Zolin Z. How Many Legs Do I Have? [C]//Non-Simple Roles in Number Restrictions Revisited. LPAR'07, 2007:15-19.
  • 8Horroeks I, Sattler U, Tobies S. A PSPACE-Algorithm for Deciding ALCNIR+ Satisfiability[R]. Technical Report 98- 08, LuFg Theoretical Computer Science, RWTH Aachen, Germany, 1998: 62-73.
  • 9Ding Y, Haarslev V. Tableau Caching for Description Logics with Inverse and Transitive Roles[C]//Proc of Int' 1 Workshop on Description Logics, 2006 : 143-149.
  • 10Horroeks I, Sattler U. A Description Logic with Transitive and Inverse Roles and Role Hierarehies[J]. Journal of Logic and Computation, 1999,9(3) :385-410.

共引文献11

同被引文献73

引证文献12

二级引证文献51

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部