期刊文献+

基于返回结果的Deep Web查询接口识别 被引量:1

Recognized Query Interface of Deep Web Based on Response Pages
下载PDF
导出
摘要 互联网上存在许多有价值的信息,搜索引擎只能索引静态页面,无法索引Deep Web数据,而Deep Web通常以表单形式存在,只有提交表单查询才能获得其数据,如何发现和识别Deep Web查询接口成为人们关注的问题。在分析表单表现形式与功能内在的联系的基础上,提出一个表单的抽象模型,依此过滤非Deep Web查询接口的表单。通过对返回结果页面分析方法,实现Deep Web查询接口的识别,实验结果证明了该方法的有效性。 There are many valuable resources on the Intemet, traditional search engines work well for finding the pages which are static and linked to other pages, and ignore the deep web which only produce results dynamically in response to a direct request of the form, so many people pay attention to find and recognize the query interface of deep web. On the deep web,many sources are structured by provid- ing structured query interfaces and results, extract the features of query forms based on the features available on the search interlaces ,and introduce a generic operational model which can support advanced query and analysis, then delete the interfaces which is not the interface of deep web on the model. Through analysing the structure of response pages, recognize the query interface of deep web. The results of experiments validate the feasibility of the approach.
出处 《计算机技术与发展》 2009年第7期117-119,123,共4页 Computer Technology and Development
基金 安徽省自然科学基金项目(070412051)
关键词 DEEP Web 查询接口 post—query 表单 deep web query interface post - query form
  • 相关文献

参考文献8

  • 1Bergman M K. The Deep Web: Surfadng hidden value[EB/ OL]. 2001 - 09 - 24. http://www. brightplanet. com.
  • 2马军,宋玲,韩晓晖,闫泼.基于网页上下文的Deep Web数据库分类[J].软件学报,2008,19(2):267-274. 被引量:31
  • 3Cope J, Nick C, Davia H. Automated Discovery of Search Interfaces on the Web[C]//Conferences in Research and Practice in Information Technology. Australia: Australian Computer Society,2003.
  • 4Lin Peiguang, Xu Ruzhi, Hong Zhimin, et al. Fingding the WDB's Query, Interface in Deep Web Automatically[C]//2008 International Conference on Intemet Gomputing in Science and Engineering. Washington, DC, USA: IEEE Computer Society,2008:195 - 200.
  • 5袁柳,李战怀,陈世亮.基于本体的Deep Web数据标注[J].软件学报,2008,19(2):237-245. 被引量:28
  • 6苏志华,杨冬青,唐世渭,王腾蛟.基于结构分析和实体识别的信息集成[J].计算机研究与发展,2004,41(10):1823-1828. 被引量:5
  • 7Raghavan S, Garcia - Molina H. Crawling the Hidden Web [C]//the International Conference on Very Large Data Bases (VLDB). Rome: Morgan Kaufmann Publishers, 2001:129 - 138.
  • 8郑冬冬,崔志明.Deep Web查询接口选择[J].计算机应用,2006,26(9):2024-2027. 被引量:6

二级参考文献56

  • 1M E Califf, R J Mooney. Relational learning of pattern-match rules for information extraction. In: Proc of the 16th National Conf on Artificial Intelligence and the 11th Conf on Innovative Applications of Artificial Intelligence. Menlo Park, California:AAAI Press/The MIT Press, 1999. 328~334
  • 2D Freitag. Machine learning for information extraction in informal domains. Machine Learning, 2000, 39(2-3): 169~202
  • 3S SoderLan. Learning information extraction rules for semistructured and free text. Machine Learning, 1999, 34(1-3): 233~272
  • 4A Sahuguet, F Azavant. Building intelligent Web applications using lightweight wrappers. Data and Knowledge Engineering,2001, 36(3): 283~316
  • 5Liu L, Pu C, Han W. XWRAP: An XML-enabled wrapper construction system for Web information sources. In: Proc of the 16th Int'l Conf on Data Engineering. Los Alamitos, California:IEEE Computer Society, 2000. 611~621
  • 6R Baumgartner, S Flesca, G Gottlob. Visual Web information extraction with Lixto. In: Proc of the 27th Int'l Conf on Very Large Data Bases. San Francisco: Morgan Kaufmann, 2001. 119~ 128
  • 7V Crescenzi, G Mecca. Grammars have exceptions. Information Systems, 1998, 23(9): 539~565
  • 8B Adelberg. NoDoSE-A tool for semi-automatically extracting structured and semi-structured data from text documents. In: Proc of the 1998 ACM SIGMOD Int'l Conf on Management of Data.New York: ACM Press, 1998. 283~294
  • 9D Bikel, R Schwarta, R Weisehedel. An algorithm that learns what's in a name. Machine Learning, 1997, 34(1-3): 211~231
  • 10D Freitag, A L McCallum. Information extraction using HMMs and shrinkage. In: Proc of the 16th National Conf on Artificial Intelligence. Menlo Park, California: AAAI Press, 1993. 31~36

共引文献65

同被引文献3

引证文献1

二级引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部