期刊文献+

基于特征路径的XML文档变化检测算法 被引量:3

A Key-Path-Based Change Detection Algorithm for XML Documents
下载PDF
导出
摘要 由于在线信息变化频繁 ,XML文档变化快速检测成为Internet查询系统、搜索引擎以及连续查询系统的关键技术 目前国际上的研究主要集中于有序模式的XML文档比较 ,针对有序模式最好的算法复杂度为O(nlogn) ,其中n为文档的长度 ,而针对无序模式为多项式时间复杂度 为提高处理效率 ,提出一种基于特征路径的变化检测算法 ,将传统标号树匹配问题转换为基于特征路径的无重复路径标号树的匹配问题 ,同时适于有序和无序两种模式 ,复杂度为O(n) ,其中n为文档结点的个数 实验证明KF Diff Since online information changes frequently, being able to quickly detect changes in XML documents is important to Internet query systems, search engines, and continuous query systems Most previous work in change detection on XML or other hierarchically structured data uses the ordered tree model, with the best complexity of O(n log n ), where n is the size of the document The best algorithm for unordered mode achieves polynomial time in complexity In this paper, a Key path based change detection algorithm named KF Diff+is represented, which transforms the traditional tree to tree correction into the comparing of the Key trees which are substantially label trees without duplicate paths, with the complexity of O(n) , where n is the total number of nodes in the documents In addition, KF Diff+is tailored to both ordered trees and unordered trees, which is also a significant difference compared with the previous work Experiments show that KF Diff+can handle XML documents at extreme speed
出处 《计算机研究与发展》 EI CSCD 北大核心 2003年第9期1375-1381,共7页 Journal of Computer Research and Development
基金 国家"八六三"高技术研究发展计划基金项目 ( 2 0 0 2AA1160 40 )
关键词 XML DELTA KEY 算法 XML delta Key algorithm
  • 相关文献

参考文献9

  • 1World Wide Consortium. Extensible markup language (xml) 1.0.2000. htrp://www, w3. org/TR/REC-xml.
  • 2S Chawathe, A Rajarmmn, H Garcia-Molina. Change detection in hierarchically structured information. The ACM Int'l Conf on Management of Data (SIGMOD), Montreal, Quebec, Canada,1996.
  • 3E Berk. HtmlDiff: A differencing tool for HTML documents. Princeton University, New Jersey, 1996. http://www. htmldiff.com.
  • 4F P Curbera, D A Epstein. Fast difference and update of XML documents XTech'99, San Jose, 1999. http://www, beust.com/cedric/xtech99, html.
  • 5H Maruyama, K Tamura, R Uramoto. Digest values for DOM (DOMHash) proposal. IBM Tokyo Research Laboratory, Tokyo,1998. http://www. trl. ibm. co. jp/projects/xml/domhash. htm.
  • 6K Zhang, D Shasha. Simple fast algorithms for the editing distance between trees and related problems. SIAM Journal of Computing, 1989, 18(6): 1245-1262.
  • 7G Cobéna, S Abiteboul, A Marian. Detecting changes in XML documents. In: The Int'l Conf on Data Engineering ( ICDE). San Jose, California: IEEE Computer Society, 2002.
  • 8Y Wang, D J DeWitt, J Y Cal. X-Diff: A fast change detection algorithm for XML documents. University of Wisconsin,Wisconsin.2002.http://www. drirkme. com/library/uci/ics215/xml/xdiff. pdf.
  • 9W Fan, P Sehwenzer, K Wu. Keys with upward wildcards for XML. In: Database and Expert Systems Applications. Lecture Notes in Computer Science 2113. Munich, Germany: Springer,2001. 657~667.

同被引文献3

引证文献3

二级引证文献2

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部