期刊文献+

基于规则集的Deep Web信息检索

Rules-based Deep Web Information Retrieval
下载PDF
导出
摘要 提出一种基于规则集的新型Deep Web信息检索模型。该模型包含4个层次,主要处理环节如任务分派、信息提取、数据清洗等引入了Deep Web特有的结构规则、逻辑规则和应用规则协助工作。把该模型应用于科技文献检索、电子机票定购和工作简历搜索3个领域,实验结果证明该模型灵活、可信,有效信息查全率达到96%以上。 This paper proposes a novel rules-based model to extract data from Deep Web pages. The model comprises four layers, main processing parts as task allocation, information extraction, data cleaning which work based on the rules of structure, logic and application. It applies the new model to three intelligent system, scientific paper retrieval, electronic ticket ordering and resume searching. Experimental results show that the proposed method is robust and feasible.
出处 《计算机工程》 CAS CSCD 北大核心 2008年第13期51-53,共3页 Computer Engineering
基金 天津市自然科学基金资助项目(05YFJMJC01500)
关键词 信息检索 深层网络 规则集 数据提取 information retrieval Deep Web rules set data extraction
  • 相关文献

参考文献6

  • 1Chang K C C, He Bin, Li Changkai, et al. Structured Databases on the Web: Observations and Implications[J]. SIGMOD Record, 2004, 33(3): 61-70.
  • 2Cope J, Craswell N, Hawking D. Automated Discovery of Search Interfaces on the Web[C]//Proceedings of the 14th Australasian Database Conference. Adelaide, Australia: [s. n.], 2003:181-189.
  • 3Arasu A, Garcia-Molina H. Extracting Structured Data from Web Pages[C]//Proc. of the 22nd International Conf. on Management of Data. San Diego, California, USA: [s. n.], 2003: 337-348.
  • 4Liu Ling, Pu Calton, Han Wei. XWRAP: An XML-enabled Wrapper Construction System for Web Information Sources[C]//Proceedings of the 16th International Conference on Data Engineering. San Diego, CA, USA: [s. n.], 2000: 611-621.
  • 5Crescenzi V, Mecca G; Merialdo E Road Runner: Towards Automatic Data Extraction from Large Web Sites[C]//Proceedings of the 27th International Conference on Very Large Data Bases. San Francisco, CA, USA: Morgan Kaufmann Publishers Inc., 2001: 109-118.
  • 6Song Hui, Gift S, Ma Fanyuan. Data Extraction and Annotation for Dynamic Web Pages[C]//Proceedings of the 2004 IEEE International Conference on E-technology, E-commerce and E-service. Taipei, Taiwan, China: [s. n.], 2004: 499-502.

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部