期刊文献+

Deep Web查询接口及其识别算法

Deep Web Query Interface and Identification Algorithms
下载PDF
导出
摘要 查询接口是Deep Web的唯一入口,在对后台数据库展开研究时,查询接口的识别凸显重要。该文首先分析了查询接口的结构特点,并总结出一系列可用于进行查询接口识别的启发式规则,并通过概率计算对规则的使用顺序进行了优化,实验证明,具有较好的使用价值。 Query interface is the only entrance of Deep Web, so a study of the background database, the query interface is important for recognition. This paper analyzes the structural characteristics of query interfaces, and summarizes a series of query interface can he used for identification Heuristic rules, and rules through the use of probability, sequence is optimized experimental results show that the use of good value.
作者 王彩霞 高明 WANG Cai-xia, GAO Ming (1.The School of Software Engineering, Tongji University, Shanghai 201804, China; 2.Electronic and Information Engineering Institute, Tong;ji University, Shanghai 201804, China)
出处 《电脑知识与技术》 2011年第8期5422-5424,共3页 Computer Knowledge and Technology
关键词 DEEP WEB 查询接口 表单 正则表达式 Deep Web interface form regular expression
  • 相关文献

参考文献9

  • 1刘伟,孟小峰,孟卫一.Deep Web数据集成研究综述[J].计算机学报,2007,30(9):1475-1489. 被引量:136
  • 2Raghavan S,Garcia-Molina H.Crawling the Hidden Web[C].Proceedings of the 27th International Conference on Very Large DataBases. Roma.2001:129-138.
  • 3Cope J,craswell N.Automated Discovery of Search Interfaces on the Web [C].Proc.of the 4th Australasian Database Conference (ADC2003).2003.
  • 4王辉,刘艳威,左万利.使用分类器自动发现特定领域的深度网入口(英文)[J].软件学报,2008,19(2):246-256. 被引量:14
  • 5Barbosa L,Freire J.Searching for Hidden Web Databases[C].Proc.of the 8th International Workshop on the Web and Databases (WebDB 2005).
  • 6Barbosa L, Freire J.An Adaptive Crawler for Locating Hidden Web Entry Points [C].Proceedings of the 16th international Conference on World Wide Web.New York:ACM,2007:441-450.
  • 7Pang Bo,Lee Lillian,Vaithyananthan S.Thumbs up Sentiment classification using machine Learning techniques [C].Proceeings of the 2002 Conference on Emprical Methods in Natural Language Processing.NJ,UAS:Association for Computational Liguistics Morristown 2002:79-86.
  • 8Julian Palmieri Lage,Ahigran S da Silva,Paulo B Golgher, et,al.Automatic Generation of Agents for Collecting Hidden Web Pages for Data Extraction[J].Data and Knowledge Engineering,2004(2):177-196.
  • 9Jared Cope,Nick Craswell,David Hawking.Automated Discovery of Search Interfaces on the Web [C].Proceedings of the 14th Aus- tralasian Database Conference (ADC2003):Adelaide Australia,2003.

二级参考文献86

  • 1.[EB/OL].http://www.cogsci.Princeton.edu,.
  • 2Fetterly D,Manasse M,Najork M,Wiener J L.A largescale study of the evolution of Web pages//Proceedings of the 12th International World Wide Web Conference.Budapest,2003:669-678
  • 3Chang K C,He B,Li C,Patel M,Zhang Z.Structured databases on the Web:Observations and Implications.SIGMOD Record,2004,33(3):61-70
  • 4Cope J,Craswell N,Hawking D.Automated discovery of search interfaces on the Web//Proceedings of the 14th Australasian Database Conference(ADC 2003).Adelaide,2003:181-189
  • 5Zhang Z,He B,Chang K C.Understanding Web query interfaces:Best-effort parsing with hidden syntax//Proceedings of the 23rd ACM SIGMOD International Conference on Management of Data.Paris,2004:107-118
  • 6Arasu A,Garcia-Molina H.Extracting structured data from Web pages//Proceedings of the 22nd ACM SIGMOD International Conference on Management of Data.San Diego,2003:337-348
  • 7Crescenzi V,Mecca G,Merialdo P.RoadRunner:Towards automatic data extraction from large Web sites//Proceedings of the 27th International Conference on Very Large Data Bases.Italy,2001:109-118
  • 8Wittenburg K,Weitzman L.Visual grammars and incremental parsing for interface languages//Proceedings of the IEEE Symposium on Visual Languages (VL).Skokie,1990:111-118
  • 9He H,Meng W,Yu C T,Wu Z.WISE-integrator:An automatic integrator of Web search interfaces for e-commerce//Proceedings of the 29th International Conference on Very Large Data Bases.Berlin,2003:357-368
  • 10Peng Q,Meng W,He H,Yu C T.WISE-cluster:Clustering e-commerce search engines automatically//Proceedings of the 6th ACM International Workshop on Web Information and Data Management.Washington,2004:104-111

共引文献141

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部