期刊文献+

一种快速的XML文档验证算法 被引量:3

A Fast Algorithm of XML Document Verification
下载PDF
导出
摘要 在用XML(e Xtensible Markup Language)Schema校验来判断XML文档合法性的过程中,目前LIBXML2所采用的逐层遍历校验法对3层及以下嵌套效率低下,对多于3层嵌套的校验法失效。针对这一问题,文中提出一种新算法,可有效避免逐层遍历法验证的缺陷。先计算XML文档中某待验证元素出现的次数,然后把该次数分解为XSD(XML Schema Definition)文档中所定义的该元素允许出现次数区间内整数的线性组合,计算出所有线性组合中系数和的最小值和最大值。若计算出的系数和范围与XSD文档定义的范围有交集,则验证通过,否则不通过。利用所提算法,对多嵌套和max Occurs较大的XML文档,验证效率可提高两个数量级。 When using XML ( eXtensible Markup Language) Schema to verify the validation of XML document, the present traversal loop method used by LIBXML2 has low efficiency for no more than 3-loop,and will be invalid for more than 3-loop nested document. To solve the problem,propose a new verification algorithm and it can overcome the defect of traversal loop method. First,the times of element that will be validated in the XML document is calculated, then this times is divided into the linear combination of the permitted times section that defined in XSD document,next is to calculate the maximum and minimum of summation of the coefficient in all the linear combination. If the range of the coefficient summation can overlap with the range that defined in XSD document,then the validation passes, otherwise falls. Using this method,the validation efficiency can increase two orders for the XML document with multi-loop nested and bigger maxOccurs.
作者 张苗 惠小强
出处 《计算机技术与发展》 2015年第8期123-127,共5页 Computer Technology and Development
基金 国家自然科学基金资助项目(11275099)
关键词 XML文档 复杂类型 LIBXML2 XML Schema校验 XML documents complex type LIBXML2 XML Schema validation
  • 相关文献

参考文献13

二级参考文献100

共引文献144

同被引文献25

  • 1李清泉,谢智颖,左小清,王冲.基于SVG的空间信息描述与可视化表达[J].测绘学报,2005,34(1):58-63. 被引量:32
  • 2孔令波,唐世渭,杨冬青,王腾蛟,高军.XML数据的查询技术[J].软件学报,2007,18(6):1400-1418. 被引量:72
  • 3Posonia A M, Jyothi V L. XML document retrieval by develo- ping an effective indexing technique[ C ]//Proc of sixth inter- national conference on advanced computing. [ s. l. ] : IEEE, 2014:120-123.
  • 4Rekha M, Rani N U. Efficient extraction of frequent elements from XML document using XML tree pattern matching [ J ]. In- ternational Journal of Advance Research in Computer Science and Management Studies ,2014,2( 3 ) :54-59.
  • 5Liao H, Li X, Chen J. An accurate identification of extended XML tree pattern for XQuery language [ J ]. International Jour- nal of Database Theory and Application, 2014,7 ( 5 ) : 211 - 226.
  • 6Piernik M, Brzezinski D, Morzy T. Clustering XML documents by patterns [ J ]. Knowledge and Information Systems, 2016,46 (1) :185-212.
  • 7Piemik M,Brzezinski D,Morzy T,et al. XML clustering: a re- view of structural approaches [ J ]. The Knowledge Engineering Review ,2015,30( 3 ) :297-323.
  • 8Mohan G B, Ravi T, Chandra J L E, et al. Duplicate detection in XML data using extended sub tree similarity function [ J ]. International Journal of Applied Engineering Research,2015, 10 ( 3 ) : 7325 -7334.
  • 9田芳,李亚楼,周孝信,吴中习,张星.电力系统全数字实时仿真装置[J].电网技术,2008,32(22):17-22. 被引量:87
  • 10陈红梅,陈静.XML文档解析技术[J].信息技术,2009,33(10):175-177. 被引量:2

引证文献3

二级引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部