The Extensible Markup Language(XML)files,widely used for storing and exchanging information on the web require efficient parsing mechanisms to improve the performance of the applications.With the existing Document Obj...The Extensible Markup Language(XML)files,widely used for storing and exchanging information on the web require efficient parsing mechanisms to improve the performance of the applications.With the existing Document Object Model(DOM)based parsing,the performance degrades due to sequential processing and large memory requirements,thereby requiring an efficient XML parser to mitigate these issues.In this paper,we propose a Parallel XML Tree Generator(PXTG)algorithm for accelerating the parsing of XML files and a Regression-based XML Parsing Framework(RXPF)that analyzes and predicts performance through profiling,regression,and code generation for efficient parsing.The PXTG algorithm is based on dividing the XML file into n parts and producing n trees in parallel.The profiling phase of the RXPF framework produces a dataset by measuring the performance of various parsing models including StAX,SAX,DOM,JDOM,and PXTG on different cores by using multiple file sizes.The regression phase produces the prediction model,based on which the final code for efficient parsing of XML files is produced through the code generation phase.The RXPF framework has shown a significant improvement in performance varying from 9.54%to 32.34%over other existing models used for parsing XML files.展开更多
Excel操作题自动阅卷系统的设计往往采用RTF、VBA方式,但效果并不理想,针对这种情况提出了一种基于Office Open XML格式和Python语言,使用开源库Openpyxl、Element Tree及自编类解析Excel文件的操作题自动阅卷评分系统的方法,总结了Offi...Excel操作题自动阅卷系统的设计往往采用RTF、VBA方式,但效果并不理想,针对这种情况提出了一种基于Office Open XML格式和Python语言,使用开源库Openpyxl、Element Tree及自编类解析Excel文件的操作题自动阅卷评分系统的方法,总结了Office Open XML格式的Spread sheet ML标记语言的常用标记及作用,其次设计了自动阅卷流程,细化为格式解析、格式保存和格式对比量化评分三个阶段,并将格式解析分解为Openpyx l解析、Element Tree解析两个步骤,解决了解析中容错的自定义问题。经过测试,使用该方法能可靠地实现Excel 2010版本操作题的自动阅卷。展开更多
可追溯体系作为控制农产品质量安全的有效方法日益受到世界各国重视,供应链各环节与溯源中心间的异构数据交换是建立以政府监管为中心的农产品溯源系统急需解决的问题。XML(eXtensible Markup Language,可扩展标记语言)是先进的网络语...可追溯体系作为控制农产品质量安全的有效方法日益受到世界各国重视,供应链各环节与溯源中心间的异构数据交换是建立以政府监管为中心的农产品溯源系统急需解决的问题。XML(eXtensible Markup Language,可扩展标记语言)是先进的网络语言标准和网络信息共享的重要工具,采用可扩展标记可描述各种数据结构。该文在分析供应链数据流基础上,总结蔬菜溯源数据模型;以实现数据无缝交换为目标,基于XML构建了VTML(Markup Language for VegetableTraceability,蔬菜溯源信息描述语言),并设计了VTML Schema;应用VTML于蔬菜溯源系统数据交换。结果表明,采用ADO.NET开发数据交换接口,以VTML语言为基础的数据交换模式,通过VTML文档和关系型数据库数据间的映射,屏蔽了数据源间的差异。应用VTML是实现蔬菜溯源数据交换的有效手段。展开更多
XML 灵活的数据模式和开放的数据访问可为产品信息集成提供强大支持 .文中在提出基于 XML 进行产品信息集成的系统结构的基础上 ,针对其中的关键技术—— XML 的数据模式与数据库的数据模式的映射 ,结合轿车产品对象建立了结构映射和完...XML 灵活的数据模式和开放的数据访问可为产品信息集成提供强大支持 .文中在提出基于 XML 进行产品信息集成的系统结构的基础上 ,针对其中的关键技术—— XML 的数据模式与数据库的数据模式的映射 ,结合轿车产品对象建立了结构映射和完整性约束映射 .展开更多
XML is a promising technology developed in recent years. Due to its superiority in extensibility and flexibility,XML has become the language over the internet. With more and more XML documents produced,the problem exi...XML is a promising technology developed in recent years. Due to its superiority in extensibility and flexibility,XML has become the language over the internet. With more and more XML documents produced,the problem exists to transform them to other documents of various structures. This paper discusses and compares four methods to transform XML documents,then introduces XSLT,a W3C recommendation, by giving examples and outlining a project in brief.展开更多
文摘The Extensible Markup Language(XML)files,widely used for storing and exchanging information on the web require efficient parsing mechanisms to improve the performance of the applications.With the existing Document Object Model(DOM)based parsing,the performance degrades due to sequential processing and large memory requirements,thereby requiring an efficient XML parser to mitigate these issues.In this paper,we propose a Parallel XML Tree Generator(PXTG)algorithm for accelerating the parsing of XML files and a Regression-based XML Parsing Framework(RXPF)that analyzes and predicts performance through profiling,regression,and code generation for efficient parsing.The PXTG algorithm is based on dividing the XML file into n parts and producing n trees in parallel.The profiling phase of the RXPF framework produces a dataset by measuring the performance of various parsing models including StAX,SAX,DOM,JDOM,and PXTG on different cores by using multiple file sizes.The regression phase produces the prediction model,based on which the final code for efficient parsing of XML files is produced through the code generation phase.The RXPF framework has shown a significant improvement in performance varying from 9.54%to 32.34%over other existing models used for parsing XML files.
文摘可追溯体系作为控制农产品质量安全的有效方法日益受到世界各国重视,供应链各环节与溯源中心间的异构数据交换是建立以政府监管为中心的农产品溯源系统急需解决的问题。XML(eXtensible Markup Language,可扩展标记语言)是先进的网络语言标准和网络信息共享的重要工具,采用可扩展标记可描述各种数据结构。该文在分析供应链数据流基础上,总结蔬菜溯源数据模型;以实现数据无缝交换为目标,基于XML构建了VTML(Markup Language for VegetableTraceability,蔬菜溯源信息描述语言),并设计了VTML Schema;应用VTML于蔬菜溯源系统数据交换。结果表明,采用ADO.NET开发数据交换接口,以VTML语言为基础的数据交换模式,通过VTML文档和关系型数据库数据间的映射,屏蔽了数据源间的差异。应用VTML是实现蔬菜溯源数据交换的有效手段。
文摘XML is a promising technology developed in recent years. Due to its superiority in extensibility and flexibility,XML has become the language over the internet. With more and more XML documents produced,the problem exists to transform them to other documents of various structures. This paper discusses and compares four methods to transform XML documents,then introduces XSLT,a W3C recommendation, by giving examples and outlining a project in brief.