摘要
随着互联网技术的迅速发展,大量结构化的高质量信息被埋入网络,却无法被传统的搜索引擎检索到,进而难以被挖掘利用。针对这一现象,提出设计一个基于隐形Web的信息查询系统,设计基于隐形Web的查询方式,并结合数据挖掘的相关技术,获取并挖掘隐形Web信息资源,解决传统手工收集表单信息的弊端,缩短人工查询时间和减少费用,降低成本,便于维护,为实现隐形信息提取自动化提供平台。
With the rapid development of Internet technology, a large amount of structured and high-quality information are hidden into Internet. However, the information cannot be retrieved by traditional search engine and it is difficult to find out and make full use of it. In reaction to the phenomenon, presents a system based on the Deep Web information inquiry, designs a query schema based on the Deep Web, and combines some relevant technology of data mining. As a result, we can get and mine the information which is hidden in the Deep Web. At the same time, the system can resolve the traditional drawback of collecting form information artificially, reduce the time of the artificial inqury and the expense for the Deep Web information inquiry. It is easy to be maintained. As a result, it will provide a platform for the automatic extraction of the Deep Web.
出处
《图书馆学研究》
CSSCI
北大核心
2009年第3期61-63,23,共4页
Research on Library Science