Storing and querying XML (eXtensible Markup Language) data in relational form can exploit various services offered by modern relational database management systems (RDBMSs). Due to structural complexity of XML, there ...Storing and querying XML (eXtensible Markup Language) data in relational form can exploit various services offered by modern relational database management systems (RDBMSs). Due to structural complexity of XML, there are many equivalent relational mapping schemes for the same XML data and queries. In this paper, we propose the adaptive XML to relational mapping (AX2RM) system, which considers finding optimal XML to relational (X2R) mapping as four separate but correlated procedures: logical database design, data scale estimation, workload transformation, and physical database design. We view the whole process as an autonomic computing problem and formalize the adaptive X2R mapping problem. Search spaces for each procedure are investigated individually, and five approaches for finding the optimal mapping are studied. We propose an integrated approach with greedy pruning (IT-GP), which views the mapping procedures as a whole and exploits heuristic rules in each procedure to prune impossible mappings as early as possible. Evaluation of these approaches shows the validity and high efficiency of IT-GP.展开更多
Multidatabase systems are designed to achieve schema integration and data interoperation among distributed and heterogeneous database systems. But data model heterogeneity and schema heterogeneity make this a challeng...Multidatabase systems are designed to achieve schema integration and data interoperation among distributed and heterogeneous database systems. But data model heterogeneity and schema heterogeneity make this a challenging task. A multidatabase common data model is firstly introduced based on XML, named XML-based Integration Data Model (XIDM), which is suitable for integrating different types of schemas. Then an approach of schema mappings based on XIDM in multidatabase systems has been presented. The mappings include global mappings, dealing with horizontal and vertical partitioning between global schemas and export schemas, and local mappings, processing the transformation between export schemas and local schemas. Finally, the illustration and implementation of schema mappings in a multidatabase prototype - Panorama system are also discussed. The implementation results demonstrate that the XIDM is an efficient model for managing multiple heterogeneous data sources and the approaches of schema mapping based on XIDM behave very well when integrating relational, object-oriented database systems and other file systems.展开更多
XML已成为互联网事实上的数据表示标准,在数据交换和数据仓库中广泛应用。但XML文件特征不能保障数据的安全性和并发访问,而RDBMS严谨的关系理论和技术的成熟性可以弥补XML技术的不足。结合XML与RDBMS的优点,提出一个基于RDBMS的XML数...XML已成为互联网事实上的数据表示标准,在数据交换和数据仓库中广泛应用。但XML文件特征不能保障数据的安全性和并发访问,而RDBMS严谨的关系理论和技术的成熟性可以弥补XML技术的不足。结合XML与RDBMS的优点,提出一个基于RDBMS的XML数据存取方案,用于简化XML数据的管理和数据仓库的构建。利用Oracle XML DB技术实现XML在关系数据库中存储、更新和检索操作,使用户能透明地通过RDBMS来管理XML数据,相对于映射策略的数据转储方式,明显提高了XML数据存储效率。展开更多
基金the National Natural Science Foundation of China (No. 60603044)the China Postdoctoral Science Foundation (No. 20070411179)the Program for Changjiang Scholars and Innovative Research Team in University of China (No. IRT0652)
文摘Storing and querying XML (eXtensible Markup Language) data in relational form can exploit various services offered by modern relational database management systems (RDBMSs). Due to structural complexity of XML, there are many equivalent relational mapping schemes for the same XML data and queries. In this paper, we propose the adaptive XML to relational mapping (AX2RM) system, which considers finding optimal XML to relational (X2R) mapping as four separate but correlated procedures: logical database design, data scale estimation, workload transformation, and physical database design. We view the whole process as an autonomic computing problem and formalize the adaptive X2R mapping problem. Search spaces for each procedure are investigated individually, and five approaches for finding the optimal mapping are studied. We propose an integrated approach with greedy pruning (IT-GP), which views the mapping procedures as a whole and exploits heuristic rules in each procedure to prune impossible mappings as early as possible. Evaluation of these approaches shows the validity and high efficiency of IT-GP.
文摘Multidatabase systems are designed to achieve schema integration and data interoperation among distributed and heterogeneous database systems. But data model heterogeneity and schema heterogeneity make this a challenging task. A multidatabase common data model is firstly introduced based on XML, named XML-based Integration Data Model (XIDM), which is suitable for integrating different types of schemas. Then an approach of schema mappings based on XIDM in multidatabase systems has been presented. The mappings include global mappings, dealing with horizontal and vertical partitioning between global schemas and export schemas, and local mappings, processing the transformation between export schemas and local schemas. Finally, the illustration and implementation of schema mappings in a multidatabase prototype - Panorama system are also discussed. The implementation results demonstrate that the XIDM is an efficient model for managing multiple heterogeneous data sources and the approaches of schema mapping based on XIDM behave very well when integrating relational, object-oriented database systems and other file systems.
文摘XML已成为互联网事实上的数据表示标准,在数据交换和数据仓库中广泛应用。但XML文件特征不能保障数据的安全性和并发访问,而RDBMS严谨的关系理论和技术的成熟性可以弥补XML技术的不足。结合XML与RDBMS的优点,提出一个基于RDBMS的XML数据存取方案,用于简化XML数据的管理和数据仓库的构建。利用Oracle XML DB技术实现XML在关系数据库中存储、更新和检索操作,使用户能透明地通过RDBMS来管理XML数据,相对于映射策略的数据转储方式,明显提高了XML数据存储效率。