期刊文献+

基于DOM和元数据的Web信息提取 被引量:5

DOM-based and Metadata-based Information Extraction for Web Sources
下载PDF
导出
摘要 以W3C的文档对象模型DOM和元数据为基础,把要提取的信息以DOM层次结构中的路径表达式来表示,通过归纳学习来获得所需信息的路径表达式,从而获得提取信息;元数据在信息提取过程中起到关键作用,它以XML的DTD表示,可以由信息服务商提供,也可以由开发人员给出,适应了信息源不断变化的特点。 Based on DOM and metadata,retrieved information is organized by path expression that complies with DOM.Path expression is gained by inductive learning.Metadata expressed by DTD is a key during the information retrieval.It is provided by information suppliers or developers and adapts to everincreasing scale and diversity of information and application on Internet.
作者 刘政怡
出处 《计算机与现代化》 2003年第10期81-82,94,共3页 Computer and Modernization
关键词 互联网 WEB 信息提取 DOM 元数据 归纳学习 文档对象模型 wrapper DOM metadata information extraction inductive learning
  • 相关文献

参考文献4

  • 1李效东,顾毓清.基于DOM的Web信息提取[J].计算机学报,2002,25(5):526-533. 被引量:101
  • 2朱明,王军,王俊普.基于多层模式的多记录网页信息抽取方法[J].计算机工程,2001,27(9):40-42. 被引量:5
  • 3Yue-Shan Chang, Min-Huang Ho, Wen-Chen Sun, Shyan-Ming Yuan. Supporting unified interface to wrapper generator in integrated information retrieval[J]. Computer Standards & Interfaces,2002,2.
  • 4Yue-Shan Chang, Min-Huang Ho, Shyan-Ming Yuan. A unified interface for integrating information retrieval [ J ]. Computer Standards & Interfaces,2001,6.

二级参考文献20

  • 1黄豫请,软件学报,2000年,11卷,2期,73页
  • 2Embley D W,Data and Knowledge Engineering,1999年
  • 3Florescu D, Levy A Y, Mendelzon A. Database techniques for the World-Wide Web: A Survery. In: ACM The SIGMOD Record, 1998.59-74
  • 4Atzeni P, Mecca G, Merialdo P. To weave the Web. In: Proc the 23rd International Conference on Very Large Data Bases. Athens, Greece, 1997. 206-215
  • 5Pemberton S et al. XHTML 1.0: The extensible hyperText markup language. In: http://www.w3.org/MarkUp/
  • 6Cattell R G G. The Object Database Standard ODMG-93. San Mateo,California: Morgan Kaufmann Publishers,1994
  • 7Mitchell T. Machine Learning. New York: McGraw Hill, 1997
  • 8Wall L et al. Programming Perl(3rd Edition). O'Reilly & Associates,2000
  • 9Birbeck M et al. Professional XML. Wrox Press Inc, 2000
  • 10Liu L, Pu C, Han W. XWRAP: An XML-enabled wrapper construction system for web information sources. In: Proc International Conference on Data Engineering (ICDE), San diego, California, 2000. 611-621

共引文献103

同被引文献45

引证文献5

二级引证文献23

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部