期刊文献+

基于节点相对路径的XML模式抽取算法

A Method for XML Schema Extraction Based on Node Relative Path
下载PDF
导出
摘要 结合XML文档树结构提出了一种基于节点相对路径的模式抽取算法,通过使用SAX解析器对XML文档进行一遍扫描,提取出XML文档节点及其相对路径来实现XML文档模式的抽取.该算法有效地解决了XML文档中存在的环路及缺边问题,计算结果模式的代价较低,效率较高. Schema extracting for XML is broadly used in the field of data storing, query optimization and heterogeneous data integration. This paper presents a method based on node relative path for extracting XML Schema. The new approach finishes extracting schema by scanning XML document once- over and extracting nodes and relative paths, which can help to overcome the defects of the schema extracting including circle and deficit. This approach requires lower cost and more effectiveness for the final schema.
作者 孙霞 程宏斌
出处 《湖州师范学院学报》 2009年第1期76-80,共5页 Journal of Huzhou University
关键词 模式抽取 XML SAX 相对路径 schema extracting XML SAX relative path
  • 相关文献

参考文献5

  • 1CHANG C H,LUI S C,WU Y C. Applying pattern mining to Web information extraction[A]. In Proceedings of the Fifth Pacific Asia Conference on Knowledge Discovery and Data Mining [C]. Hong Kong,2001:3.
  • 2HAN J W,PEI J, YIN Y W. Mining frequent patterns without candidate generation [C]. The 2000 ACM - SIGMOD International Conference Management of Data ( SIGMOD ' 00). Dallas, TX, 2000 (5) : 1 - 12.
  • 3SODERLAND S. Learning Information Extraction Rules for Semi - Structured and Free Text [J].Machine Learning, 1999,34:133.
  • 4MIN J K,AHN J Y,CHUNG C W. Efficient Extraction of Schemas for XML Documents [J].Information Processing Letters, 2003,85(1) :7.
  • 5HEGEWALD J,NAUMANN F,WEIS M. XStruct:Efficient Schema Extraction from Multiple and Large XML Documents[C]. Proceedings of the 22nd International Conference on Data Engineering Workshops. Atlanta, GA, USA:[s. n. ] ,2006 :81.

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部