摘要
本文不是设计一种新的主题挖掘机,而是借助分层思想,为每层赋予特定的任务,采用容器管理机制,提出了一种针对主题挖掘的通用设计框架,该设计框架旨在规范和指导主题挖掘机的研发设计步骤;同时提供了相应的评估框架,其中包括几个重要的评估参数,根据这些参数的实验值可以估量挖掘机的性能,以此推动高效挖掘机的研发设计。
This paper proposes a generic design framework aiming at topics crawling by means of delamination idea, which assigns each level one special task, and container management mechanism, instead of working on another project about topics distillation. This generic design framework helps us regulate and instruct the devisal of focused crawlers. In addition to the part before, we at the same time provide the corresponding evaluation framework which consists of a couple of important evaluation parameters, according to whose results we can advance and instruct the crawlers design.
出处
《微计算机信息》
北大核心
2006年第05X期172-174,共3页
Control & Automation
基金
国家自然科学基金(No.60273072)
国家高技术研究发展计划(863)(No.2002AA423450)资助
关键词
主题挖掘
框架
模型
性能评估
Topics Crawling
framework
model
performance evaluation