We propose a three-step technique to achieve this purpose. First, we utilize a collection of XML namespaces organized into hierarchical structure as a medium for expressing data semantics. Second, we define the format...We propose a three-step technique to achieve this purpose. First, we utilize a collection of XML namespaces organized into hierarchical structure as a medium for expressing data semantics. Second, we define the format of resource descriptor for the information source discovery scheme so that we can dynamically register and/or deregister the Web data sources on the fly. Third, we employ an inverted-index mechanism to identify the subset of information sources that are relevant to a particular user query. We describe the design, architecture, and implementation of our approach—IWDS, and illustrate its use through case examples. Key words integration - heterogeneity - Web data source - XML namespace CLC number TP 311.13 Foundation item: Supported by the National Key Technologies R&D Program of China(2002BA103A04)Biography: WU Wei (1975-), male, Ph.D candidate, research direction: information integration, distribute computing展开更多
Deep web data integration needs to do schema matching on web query interfaces and obtain the mapping table.By introducing semantic conflicts into web query interface integration and discussing the origins and categori...Deep web data integration needs to do schema matching on web query interfaces and obtain the mapping table.By introducing semantic conflicts into web query interface integration and discussing the origins and categories of the semantic conflicts,an ontology-based schema matching method is proposed.The process of the method is explained in detail using the example of web query interface integration in house domain.Conflicts can be detected automatically by checking semantic relevance degree,then the categories of the conflicts are identified and messages are sent to the conflict solver,which eliminates the conflicts and obtains the mapping table using conflict solving rules.The proposed method is simple,easy to implement and can be flexibly reused by extending the ontology to different domains.展开更多
为了实现 Web 内部分布、异构数据之间的互操作和全局操作,必须对不同数据源进行集成。在分析了各集成模式的优缺点之后,提出了一种基于 XML 的虚拟化的 Web 数据集成方法。该方法采用 XML 作为集成数据的公共数据格式,通过在不同的数...为了实现 Web 内部分布、异构数据之间的互操作和全局操作,必须对不同数据源进行集成。在分析了各集成模式的优缺点之后,提出了一种基于 XML 的虚拟化的 Web 数据集成方法。该方法采用 XML 作为集成数据的公共数据格式,通过在不同的数据源和 XML 文档数据模型之间建立映射,实现了一种虚拟化的数据集成方法。这种数据集成方法简化了 Web 数据集成的实现。最后通过一个实例方案验证了方法的可行性和有效性。展开更多
In order to solve the semantic irreconcilable problems caused by contextual differences during the process of ontology integration, a context-driven reconciliation mechanism is proposed. The mechanism is based on the ...In order to solve the semantic irreconcilable problems caused by contextual differences during the process of ontology integration, a context-driven reconciliation mechanism is proposed. The mechanism is based on the previous work about a context-based formalism-Context-SHOIQ (D + ) DL, which is used for explicitly representing context of ontology by adopting the description logic and the category theory. The formalism is extended by adding four migration rules (InclusionRule, SelectionRule, PreferenceRule, and MappingRule), that are used to specify what should be imported into the IntegrativeContext, and three related contextual integration operations of increasing interoperability (import, partial reconciliation, and full reconciliation). While not exhaustive, the mechanism is sufficient for solving the five types of semantic irreconcilable problems that are discussed, and favors integration of ontologies from one context to another.展开更多
文摘We propose a three-step technique to achieve this purpose. First, we utilize a collection of XML namespaces organized into hierarchical structure as a medium for expressing data semantics. Second, we define the format of resource descriptor for the information source discovery scheme so that we can dynamically register and/or deregister the Web data sources on the fly. Third, we employ an inverted-index mechanism to identify the subset of information sources that are relevant to a particular user query. We describe the design, architecture, and implementation of our approach—IWDS, and illustrate its use through case examples. Key words integration - heterogeneity - Web data source - XML namespace CLC number TP 311.13 Foundation item: Supported by the National Key Technologies R&D Program of China(2002BA103A04)Biography: WU Wei (1975-), male, Ph.D candidate, research direction: information integration, distribute computing
基金The National Natural Science Foundation of China(No.60673130)the Natural Science Foundation of Shandong Province(No.Y2006G29,Y2007G24,Y2007G38)the Encouragement Fund for Young Scholars of Shandong Province(No.2005BS01002)
文摘Deep web data integration needs to do schema matching on web query interfaces and obtain the mapping table.By introducing semantic conflicts into web query interface integration and discussing the origins and categories of the semantic conflicts,an ontology-based schema matching method is proposed.The process of the method is explained in detail using the example of web query interface integration in house domain.Conflicts can be detected automatically by checking semantic relevance degree,then the categories of the conflicts are identified and messages are sent to the conflict solver,which eliminates the conflicts and obtains the mapping table using conflict solving rules.The proposed method is simple,easy to implement and can be flexibly reused by extending the ontology to different domains.
文摘为了实现 Web 内部分布、异构数据之间的互操作和全局操作,必须对不同数据源进行集成。在分析了各集成模式的优缺点之后,提出了一种基于 XML 的虚拟化的 Web 数据集成方法。该方法采用 XML 作为集成数据的公共数据格式,通过在不同的数据源和 XML 文档数据模型之间建立映射,实现了一种虚拟化的数据集成方法。这种数据集成方法简化了 Web 数据集成的实现。最后通过一个实例方案验证了方法的可行性和有效性。
文摘In order to solve the semantic irreconcilable problems caused by contextual differences during the process of ontology integration, a context-driven reconciliation mechanism is proposed. The mechanism is based on the previous work about a context-based formalism-Context-SHOIQ (D + ) DL, which is used for explicitly representing context of ontology by adopting the description logic and the category theory. The formalism is extended by adding four migration rules (InclusionRule, SelectionRule, PreferenceRule, and MappingRule), that are used to specify what should be imported into the IntegrativeContext, and three related contextual integration operations of increasing interoperability (import, partial reconciliation, and full reconciliation). While not exhaustive, the mechanism is sufficient for solving the five types of semantic irreconcilable problems that are discussed, and favors integration of ontologies from one context to another.