期刊文献+
共找到2篇文章
< 1 >
每页显示 20 50 100
Extracting Result Schema Based on Query Instances in the Deep Web 被引量:1
1
作者 NIE Tiezheng YU Ge SHEN Derong KOU Yue LIU Wei 《Wuhan University Journal of Natural Sciences》 CAS 2007年第5期835-839,共5页
Deep Web sources contain a large of high-quality and query-related structured date. One of the challenges in the Deep Web is extracting result schemas of Deep Web sources. To address this challenge, this paper describ... Deep Web sources contain a large of high-quality and query-related structured date. One of the challenges in the Deep Web is extracting result schemas of Deep Web sources. To address this challenge, this paper describes a novel approach that extracts both result data and the result schema of a Web database. The approach first models the query interface of a Deep Web source and fills in it with a specifically query instance. Then the result pages of the Deep Web sources are formatted in the tree structure to retrieve subtrees that contain elements of the query instance, Next, result schema of the Deep Web source is extracted by matching the subtree' nodes with the query instance, in which, a two-phase schema extraction method is adopted for obtaining more accurate result schema. Finally, experiments on real Deep Web sources show the utility of our approach, which provides a high precision and recall. 展开更多
关键词 Deep Web schema extraction result schema query instance
下载PDF
Extracting Local Schema from Semistructured Data Based on Graph-Oriented Semantic Model
2
作者 王腾蛟 唐世渭 +2 位作者 杨冬青 刘云峰 林斌 《Journal of Computer Science & Technology》 SCIE EI CSCD 2001年第6期560-566,共7页
Many modern applications (e-commerce, digital library, etc.) require inte- grated access to various information sources (from traditional RDBMS to semistructured Web repositories). Extracting schema from semistructure... Many modern applications (e-commerce, digital library, etc.) require inte- grated access to various information sources (from traditional RDBMS to semistructured Web repositories). Extracting schema from semistructured data is a prerequisite to integrate hetero- geneous information sources. The traditional method that extracts global schema may require time (and space) to increase exponentially with the number of objects and edges in the source. A new method is presented in this paper, which is about extracting local schema. In this method, the algorithm controls the scale of extracting schema within the 'schema diameter' by examining the semantic distance of the target set and using the Hash class and its path distance operation. This method is very efficient for restraining schema from expanding. The prototype validates the new approach. 展开更多
关键词 information integration data model semistructured data extracting schema
原文传递
上一页 1 下一页 到第
使用帮助 返回顶部