期刊文献+

针对主题挖掘的通用设计与性能评估框架的研究

The Research Of A Generic Design And Evaluation Framework Aiming At Topics Crawling
下载PDF
导出
摘要 本文不是设计一种新的主题挖掘机,而是借助分层思想,为每层赋予特定的任务,采用容器管理机制,提出了一种针对主题挖掘的通用设计框架,该设计框架旨在规范和指导主题挖掘机的研发设计步骤;同时提供了相应的评估框架,其中包括几个重要的评估参数,根据这些参数的实验值可以估量挖掘机的性能,以此推动高效挖掘机的研发设计。 This paper proposes a generic design framework aiming at topics crawling by means of delamination idea, which assigns each level one special task, and container management mechanism, instead of working on another project about topics distillation. This generic design framework helps us regulate and instruct the devisal of focused crawlers. In addition to the part before, we at the same time provide the corresponding evaluation framework which consists of a couple of important evaluation parameters, according to whose results we can advance and instruct the crawlers design.
出处 《微计算机信息》 北大核心 2006年第05X期172-174,共3页 Control & Automation
基金 国家自然科学基金(No.60273072) 国家高技术研究发展计划(863)(No.2002AA423450)资助
关键词 主题挖掘 框架 模型 性能评估 Topics Crawling framework model performance evaluation
  • 相关文献

参考文献6

  • 1"Information Retrieval in Distributed Hypertexts", P. De Bra,G. Houben, Y. Kornatzky and R. Post. In Proceedings of the 4th RIAO Conference, 481 - 491, New York, 1994.
  • 2"Efficient Crawling Through URL Ordering", J. Cho, H. Garcia-Molina, L Page. In Proceedings of the 7th International WWW Conference, Brisbane, Australia,April 1998.
  • 3"Focused Crawling: A New Approach to Topic-Specific Web Resource Discovery", S. Chakrabarti, M. van den Berg and B.Dom. In Proceedings of the 8th International WWW Conference,Toronto, Canada, May 1999.
  • 4Diligenti M., Coetzee F., Lawrence S., Giles C.L, GoriM.: "Focused Crawling Using Context Graphs", Proceedings VLDB 2000.
  • 5Chakrabarti S., Punera K., Subramanyarn M.:"Accelerated Focused Crawling through Online Relevance Feedback", Proceedings WWW 2002.
  • 6龙银香.移动计算环境下的数据挖掘研究[J].微计算机信息,2005,21(07X):35-38. 被引量:17

二级参考文献4

共引文献16

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部