随着各种本体构建方法和诸多实验本体的相继涌现,以跨本体通信、跨本体协同为目的的本体对应相关研究在近年来受到国际学术界的普遍关注.为了最大限度复用现有本体,解决本体对应中跨本体映射的核心问题,在对当前本体映射中概念相似度的...随着各种本体构建方法和诸多实验本体的相继涌现,以跨本体通信、跨本体协同为目的的本体对应相关研究在近年来受到国际学术界的普遍关注.为了最大限度复用现有本体,解决本体对应中跨本体映射的核心问题,在对当前本体映射中概念相似度的计算方法进行梳理和总结的基础上,提出了以"基于概念格的对象-属性相似度(object-attribute similarity based on concept lattice,OASBCL)"法计算跨本体映射中概念的相似度.通过对该方法在跨本体映射中的应用举例,阐明了方法的有效性.并在此基础上从概念格与本体互补、相似度要素指标、映射性质三个方面对该方法进行了讨论.以尝试探索一种能够支持异构本体间跨本体映射的形式化的概念相似度计算方法.展开更多
To extract structured data from a web page with customized requirements,a user labels some DOM elements on the page with attribute names.The common features of the labeled elements are utilized to guide the user throu...To extract structured data from a web page with customized requirements,a user labels some DOM elements on the page with attribute names.The common features of the labeled elements are utilized to guide the user through the labeling process to minimize user efforts,and are also utilized to retrieve attribute values.To turn the attribute values into a structured result,the attribute pattern needs to be induced.For this purpose,a space-optimized suffix tree called attribute tree is built to transform the document object model(DOM) tree into a simpler form while preserving its useful properties such as attribute sequence order.The pattern is induced bottom-up on the attribute tree,and is further used to build the structured result.Experiments are conducted and show high performance of our approach in terms of precision,recall and structural correctness.展开更多
文摘随着各种本体构建方法和诸多实验本体的相继涌现,以跨本体通信、跨本体协同为目的的本体对应相关研究在近年来受到国际学术界的普遍关注.为了最大限度复用现有本体,解决本体对应中跨本体映射的核心问题,在对当前本体映射中概念相似度的计算方法进行梳理和总结的基础上,提出了以"基于概念格的对象-属性相似度(object-attribute similarity based on concept lattice,OASBCL)"法计算跨本体映射中概念的相似度.通过对该方法在跨本体映射中的应用举例,阐明了方法的有效性.并在此基础上从概念格与本体互补、相似度要素指标、映射性质三个方面对该方法进行了讨论.以尝试探索一种能够支持异构本体间跨本体映射的形式化的概念相似度计算方法.
基金Supported by the National High Technology Research and Development Programme of China(No.2009AA01 Z141)the National Natural Science Foundation of China(No.60573117)Beijing Natural Science Foundation(No.4131001)
文摘To extract structured data from a web page with customized requirements,a user labels some DOM elements on the page with attribute names.The common features of the labeled elements are utilized to guide the user through the labeling process to minimize user efforts,and are also utilized to retrieve attribute values.To turn the attribute values into a structured result,the attribute pattern needs to be induced.For this purpose,a space-optimized suffix tree called attribute tree is built to transform the document object model(DOM) tree into a simpler form while preserving its useful properties such as attribute sequence order.The pattern is induced bottom-up on the attribute tree,and is further used to build the structured result.Experiments are conducted and show high performance of our approach in terms of precision,recall and structural correctness.