期刊文献+

基于结果模式的Deep Web数据标注方法 被引量:2

Deep Web data annotation method based on result schema
下载PDF
导出
摘要 全面准确地标注Deep Web查询结果是Deep Web数据集成的关键问题,但现有的Web数据库标注方法还不能较好地解决该问题,为此提出一种基于结果模式的Deep Web数据标注方法。首先通过结果页面解析和抽取结构化数据来完成数据预处理的工作,并在集成结果模式和待标注数据之间建立正确的语义映射,进而确定DeepWeb数据的标注信息。通过对4个领域Web数据库进行实验测试,结果表明所提方法能有效地标注Deep Web查询结果数据。 Comprehensive and accurate annotation of Deep Web data is the key technology to Deep Web data integration,but the existing methods of Deep Web data annotation are unavailable to effectively solve the problem.Therefore,an approach of Deep Web data annotation based on result schema was proposed.The paper,through analyzing Deep Web result pages and extracting structured data,completed data pretreatment work,then though establishing the correct semantic mapping relation between integrated result schema and staying annotation data,achieved correct annotation of Deep Web data.The experimental results over four real areas show that the proposed method can efficiently annotate Deep Web data.
作者 李明 李秀兰
出处 《计算机应用》 CSCD 北大核心 2011年第7期1733-1736,共4页 journal of Computer Applications
基金 甘肃省自然科学基金资助项目(0809RJZA018)
关键词 DEEP WEB 结果模式 数据标注 数据抽取 Deep Web result schema data annotation data extraction
  • 相关文献

参考文献10

二级参考文献99

  • 1孔令波,唐世渭,杨冬青,王腾蛟,高军.XML数据的查询技术[J].软件学报,2007,18(6):1400-1418. 被引量:72
  • 2CHANG K C , HE B , LI C , et al . Structured databases on the Web: Observations and implications[ J]. ACM SIGMOD Record, 2004, 33 (3):61 -70.
  • 3HE HAI, MENG W Y, LU Y Y, et al. Towards deeper understanding of the search interfaces of the deep Web[ J]. World Wide Web, 2007, 10(2) : 133 - 155.
  • 4CRESCENZI V, MECCA G, MERIALDO P. Roadrunner: Towards automatic data extraction from large Web sites[ EB/OL]. [ 2008 - 05 -05]. http://www, dia. uniroma3, it/- vldbproc/015_109, pdf.
  • 5WANG J, LOCHOVSKY F H. Data extraction and label assignment for Web databases[ C]//Proceedings of the 12th international conference on World Wide Web. New York: ACM Press, 2003:187 - 196.
  • 6ZHAO H, MENG W Y, WU Z, et al. Fully automatic wrapper generation for search engines [ EB/OL]. [ 2008 - 05 - 05 ]. http:// www. www2005, org/edrom/docs/p66, pdf.
  • 7ARLOTTA L, CRESCENZI V, MECCA G, et al. Automatic annotation of data extracted from large Web sites[ EB/OL]. [2008 -05 - 05]. http://www, cse. ogi. edu/webdb03/papers/02, pdf.
  • 8LU Y Y, HE H, ZHAO H K, et al. Annotating structured data of the deep Web [ C]//ICDE 2007: IEEE 23rd International Conference on Data Engineering. [ S.l. ] : IEEE Press, 2007: 376?385.
  • 9WU W, DOAN A, YU C T. WeblQ: Learning from the Web to match Deep-Web query interfaces[ EB/OL]. [2008 -05 -05]. http://www, dit. unitn, it/- p2p/RelatedWork/Matching/icde06- webiq, pdf.
  • 10ARASU A, GARCIA-MOLINA H. Extracting structured data from Web pages[ C]. ICDE '03: 19th International Conference on Data Engineering. [ S. l ] : IEEE Press, 2003:337 -348.

共引文献174

同被引文献10

引证文献2

二级引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部