期刊文献+

XM L文档结构相似测度研究 被引量:4

Research on Evaluating Structural Similarity between XML Documents
下载PDF
导出
摘要 为了满足基于W eb的XML数据信息的近似搜索、信息分类以及数据交换的需求,提出一种新的有效地鉴定XML文档间结构相似度的标准。该标准包含了XML文档的结构信息和节点嵌套的语义信息,可以有效地给出XML文档间的结构相似测度。通过实验证明该标准具有高度的准确性和有效性。 For sake of increasing requirement about approximate search, data cluster and data exchange from XML documents in the Web, a new effective metric for evaluating structural similarity between XML documents is brought forward. It' s accurateness and effectiveness are testified by the experiments.
作者 闫利国 贺飞
出处 《计算机应用研究》 CSCD 北大核心 2006年第3期44-46,共3页 Application Research of Computers
基金 国家"863"计划资助项目(2003AA4Z3210 2003AA413031)
关键词 可扩展标记语言 结构相似测度 编辑距离 XML Structural Similarity Edit Distance
  • 相关文献

参考文献8

  • 1M N Garofalakis,A Gionis,R Rastogi,et al.XTRACT:A System for Extracting Document Type Descriptors from XML Documents[C].Dallas,Texas:Proceedings of ACM SIGMOD Conference on Management of Data,2000.165-176.
  • 2S Flesca,G Manco,E Masciari,et al.Detecting Structural Similarities between XML Documents[C].Proceedings of the 5th Internatio-nal Workshop on the Web and Databases,WebDB,2002.
  • 3Wang Lian,David Wai-Lok Cheung,Nikos Mamoulis,et al.An Efficient and Scalable Algorithm for Clustering XML Documents by Structure[J].IEEE Transactions on Knowledge and Data Engineering,2004,16(1):82-96.
  • 4K Zhang,R Stgatman,D Shasha.Simple Fast Algorithm for the Editing Distance Between Trees and Related Problems[J].SIAM Journal on Computing,1989,18(6):1245-1262.
  • 5A Nierman,H V Jagadish.Evaluating Structural Similarity in XML Documents[C].Madison,Wisconsin,USA:Proceedings of the 5th International Workshop on the Web and Databases,WebDB,2002.
  • 6.[EB/OL].http://wwws.sun.com/software/xml[EB/OL],2004-09-20.
  • 7Alfred V Aho,Ravi Sethi,Jeffrey D Ullman.Compilers:Principles,Techniques,and Tools[M].Publisher:Addison-Wesley,Hardco-ver,1986.796.
  • 8郑仕辉,周傲英,张龙.XML文档的相似测度和结构索引研究[J].计算机学报,2003,26(9):1116-1122. 被引量:28

二级参考文献15

  • 1XQuery: A query language for XML. W3C Working Draft 15February 2001, available: http://www. w3. org/TR/xquery/.
  • 2Tarjan. Three partition refinement algorithms. SIAM Journalon Computing, 1987, 16(6): 973-989.
  • 3Henzinger M R, Henzinger T A, Kopke P W. Computing sim-ulations on finite and infinite graphs. In: Proceedings of the36th Annual IEEE Symposium on Foundations of ComputerScience, Milwaukee, Wisconsin, 1995. 453-462.
  • 4Marian A, Abiteboul S, Cobena G, Mignet L. Change-centricmanagement of versions in an XML warehouse. In: Proceed-ings of the 27th International Conference on Very Large DataBases, Roma, Italy,2001. 581-590.
  • 5Goldman R, Widom J. Summarizing and searching sequential semistructured sources. Stanford University: Technical ReportTR20000312, 2000.
  • 6Zheng Shi-Hui, Zhou Ao-Ying et al. Structure-based approximate searching in XML data. Fudan University: Technical Report TR20010203,2001.
  • 7Wang J T-L, Shasha D etal. Structural matching and discovery in document databases. Sigmod Record, 1997, 26(2): 560-564.
  • 8Zhang K. A constrained editing distance between unordered labeled trees. Journal of Algorithmica, 1996, 15(3): 205-222.
  • 9Zhang K, Shasha D. On the editing distance between unordered labeled trees. Information Processing Letters, 1992, 42(3): 133-139.
  • 10Wang J T-L, Zhang K etal. Exact and approximate algorithmsfor unordered tree matching. IEEE Transactions on Systems,Man and Cybernetics, 1994, 24(4): 668-678.

共引文献27

同被引文献20

  • 1周凯波,魏莹,冯珊.基于案例推理的金融危机预警支持系统[J].计算机工程与应用,2001,37(14):18-21. 被引量:23
  • 2路云,吴应宇,达庆利.基于案例推理技术的企业经营决策支持模型设计[J].中国管理科学,2005,13(2):81-87. 被引量:17
  • 3薛为民,陆玉昌.文本挖掘技术研究[J].北京联合大学学报,2005,19(4):59-63. 被引量:63
  • 4潘有能.XML文档自动聚类研究[J].情报学报,2006,25(2):215-220. 被引量:16
  • 5SHIU S C K, PAL S K. Case-based reasoning: concepts, features and soft computing [J]. Applied Intelligence, 2004, 21(3) : 233-238.
  • 6LIAN W, CHEUNG D W L, MAMOULIS N, et al. An efficient and scalable algorithm for clustering XML documents by structure [J]. IEEE Transactions on Knowledge and Data Engineering, 2004, 16(1) :82-96.
  • 7Yan X, Han J. gSpan: Graph-based substructure pattem mining [C].IEEE ICDM,2002:45-49.
  • 8Elisa B,Giovanna G, Macro M.A matching algorithm for measuring the structural similarity between an XML document and a DTD and its applications [J]. Information Systems, 2004,29: 23-46.
  • 9Lee M, Yang L, Hsu W, et al.XClust:clustering XML schemas for effective integration[C].CIKM'02,2002:292-299.
  • 10Jongik Kim, Hyoung-Joo kim. A partition index for XML and semi-structured data.[J].Data & Knowledge Engineering, 2004, 51:349-368.

引证文献4

二级引证文献21

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部