期刊文献+

基于web的信息抽取方法研究

下载PDF
导出
摘要 结合DOM方法构造模型,运用xML建立精确文档信息,解决半结构化网页动态信息抽取的困难,提出一种新型的基于样本的信息检索方法,将信息整合为新数据模型,提高网络信息抽取的效率和准确度。
作者 王毅
出处 《科技与生活》 2010年第13期11-11,共1页
  • 相关文献

参考文献5

二级参考文献33

  • 1Hammer J, Garcia-Molina H, Cho J, et al. Extracting Semistructured Information from the Web. Proceedings of file First Workshop on Management of Semistructured Data, 1997-05.
  • 2Sahuguet A, Azavant F. Building Light-weight Wrappers for Legacy Web Data-sources Using W4F. International Conference on Very Large Databases (VLDB), 1999.
  • 3Soderland S. Learning Information Extraction Rules for Semistructured and FreeText. Machine Learning, 1999.
  • 4Kushmerick N, Weld D, Doorenbos B. Wrapper Induction for Information Extraction. In Proceedings of the Fifteenth International Joint Conference on Artificial Intelligence (IJCAI-97), 1997.
  • 5Muslea I, Minton S, Knoblock C. STALKER: Learning Extraction Rules for Semistructured, Web-based Information Sources. AAAI-98 Workshop on "AI & Information Integration", 1998.
  • 6Muslea I. Extraction Patterns: From Information Exlraction to Wrapper Induction. Technical Report, Information Sciences Institute,University of Southern Califomi, 1998.
  • 7Doorenbos R B, Etzioni O, Weld D W. A Scalable Comparison-shopping Agent for the World Wide Web. In Proceedings of the First International Conference on Autonomous Agents, 1997-02.
  • 8Gao X, Sterling L AutoWrapper: Automatic Wrapper Generation for Multiple Online Services. In Proceedings of Asia Pacific Web Conference 1999 (AP- Web99), 1999.
  • 9Chang C H, Lui S C. IEPAD: Information Extraction Based on Pattern Discovery. In the Proceedings of the Tenth International Conference on World Wide Web, Hongkong, 2001-05.
  • 10Laender H F, Ribeiro-Neto B A, A S da Silva, et al.A Brief Survey of Web Data Extraction Tools.SIGMOD Record, 2002, 31(2): 84-93

共引文献49

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部