期刊文献+

Deep Web信息按需集成研究综述 被引量:2

Research Survey on the Requirement-Oriented Integration of Deep Web Information
原文传递
导出
摘要 Deep Web信息按需集成研究的是如何根据用户的需求从海量的Web数据库中获取所需的信息.本文在对近10年来该领域研究进行综述的基础上,提出Deep Web信息按需集成框架,它包括Deep Web搜索引擎、接口集成、数据源描述、查询执行等4个方面内容.同时,围绕用户的个性化、查询结果的用户满意度和查询效率等评价指标,对Deep Web按需集成的未来发展方向进行展望. Requirement-oriented integration of Deep Web information refers to acquiring meaningful information from tremendous Web databases according to users; needs. On the basis of the research survey in the past 10 years, a new framework for the requirement-oriented integration of Deep Web information is proposed, which is composed of four components: search engine for Deep Web, interface integration, source description and query execution. According to this framework, the research work in the last decades classified and discussed in-depth. The key evaluation factors of the requirement oriented integration for Deep Web information are concluded, namely, personalization, user's satisfaction and query efficiency. At last, the suggestions for future research directions are pointed out.
出处 《武汉大学学报(理学版)》 CAS CSCD 北大核心 2009年第4期465-472,共8页 Journal of Wuhan University:Natural Science Edition
基金 国家重点基础研究发展规划(973)项目(2007CB310806) 国家自然科技资源平台项目(2005DKA21208-11)
关键词 搜索引擎 接口集成 个性化 用户满意度 search engine interface integration personalization user' s satisfaction
  • 相关文献

参考文献50

  • 1BrightPlanet. com. The Deep Web: Surfacing Hidden Value[ EB/OL]. [2008-05-28]. http://brightplanet. com/resources/details/deepweb, html.
  • 2Chang K C,He B,I.i C,et al. Structured Databases on the Web: Observations and Implications [ C/OL][2008-05-28]. http://eagle, cs. uiuc. edu/ pubs/2004/ dwsurvey-siKmodrecord-chl pz-aug04, pd f .
  • 3Ortega-Binderberger M. Integrating Similarity Based Retrieval and Query Refinement in Databases [D]. Urbana-Champaign : UIUC, 2002.
  • 4Motto A. Vague: A User Interface to Relational Databases That'Permits Vague Queries[J]. ACM Transactions on Office Information Systems, 1998,6(3): 187-214.
  • 5Nambiar U, Kambhampati S. Answering Imprecise Queries over Web Databases[C]//Proceedings of the 31st VLDBConference. New York.. ACM Press, 2005 : 1350-1353.
  • 6Nambiar U, Kambhampati S. Answering Imprecise Queries over Autonomous Web Databases [C/OL]. [2008- 05-28]. http://rakaposhi, eas. asu. edu/ICDEO6-cmrdy. pdf .
  • 7He B, Patel M, Zhang Z, el al. Accessing the Deep Web: A Survey [J]. Communications of the ACM. ( CACM), 2007,50(5) : 94-101.
  • 8Raghavan S, Molina H G. Crawling the Hidden Web [C/OL]. [2008-05-28]. http://www, vldb. org/conf/ 2001/P129. pdf.
  • 9Cope J ,Craswell N, Hawking D. Automated Discovery of Search Interfaces on the Web[C/OL]. [2008-05- 28]. http://crpit, com/confpapers/CRPITV17Cope. pdf.
  • 10Barbosa I.,Freire J. Searching for Hidden-Web Databases[C/OL]. [2008-05-28]. http ://webdb2005. uhas- selt. be/webdb05_eproceedings, pdf .

二级参考文献63

  • 1Gravano L, Garcia-Molina H, Tomasic A. Gloss: Textsource discovery over the Intemet. ACM Trans. on Database Systems, 1999, 24(2):229-246.
  • 2Yi L, Liu B. Web page cleaning for Web mining through feature weighting. In: Cohn AG, ed. Proc. of the 18th Int'l Joint Conf. on Artificial Intelligence (IJCAI 2003). Acapulco: Kluwier Academic Publisher, 2003.64-75.
  • 3Bergholz A, Chidlovskii B. Crawling for domain-specific hidden Web resources. In: Spaccapietra S, ed. Proc. of the 4th Int'l Conf. on Web Information Systems Engineering. Rome: IEEE Computer Society, 2003. 125-133.
  • 4Barbosa L, Freire J, Silva A. Organizing hidden-Web databases by clustering visible Web documents. In: Doqac A, ed. Proc. of IEEE the 23rd Int'l Conf. on Data Engineering. Istanbul: IEEE Computer Society, 2007. 326-335.
  • 5Gravano L, Ipeirotis PG, Sahami M. QProber: A system for automatic classification of hidden-Web databases. ACM TOIS, 2003, 21(1):1-41.
  • 6He B, Tao T, Chang KCC. Organizing structured Web sources by query schemas: A clustering approach. In: Oravano L, ed. Proc. of ACM the 13th Conf. on Information and Knowlege Management. Washington: ACM Press, 2004.22-31.
  • 7Baeza-Yates R, Ribeiro-Neto B. Modem Information Retrieval. Boston: Addison Wesley, 1999. 27-30.
  • 8The UIUC Web integration repository. 2007. http://metaquerier.cs.uiuc.edu/repository/datasets/tel-8/index.html
  • 9Thomopolos S, Buche P, Haemmerle O. Fuzzy sets defined on a hierarchical domain. IEEE Trans. on Knowledge and Data Engineering, 2006,16(10): 1395-1409.
  • 10Wang J, Loehovsky F. Data-Rich section extraction from HTML pages. In: Cham TS, ed. Proc. of the 3rd Int'l Conf. on Web Information Systems Engineering. Singapore: IEEE Computer Society Press, 2002. 1-10.

共引文献70

同被引文献25

  • 1马翠嫦.国外数字图书馆可用性评价研究综述[J].现代图书情报技术,2007(2):1-6. 被引量:31
  • 2李虹.面向用户的数字图书馆信息服务模式研究[J].情报杂志,2007,26(8):134-136. 被引量:22
  • 3Madhavan J,Jeffery S,Cohen S,et al.Web-scale dataintegration:You can only afford to pay as you go[C]//Proceedings of the 5th Biennial Conference on Inno-vative Data Systems Research(CIDR).Los Alami-tos:IEEE Computer Society Press,2007:342-350.
  • 4Elmagarmid A K,Ipeirotis P G,Verykios V S.Dupli-cate record detection:A survey[J].IEEE Transac-tions on Knowledge and Data Engineering,2007,19(1):1-16.
  • 5Winkler W E.Methods for record linkage and Bayes-ian networks[C/OL].[2010-12-20].http://www.amstat.org/Sections/Srms/Proceedings/y2002/Files/JSM2002-000648.pdf.
  • 6Verykios V S,Moustakides G V,Elfeky M G.ABayesian decision model for cost optimal record matc-hing[J].The VLDB Journal,2003,12(1):28-40.
  • 7Verykios V S,Moustakides G V.A generalized costoptimal decision model for record matching[C]//Pro-ceedings of the 2004 International Workshop on In-formation Quality in Information Systems.NewYork:ACM Press,2004:20-26.
  • 8Cochinwala M,Kurien V,Lalk G,et al.Efficient datareconciliation[J].Information Sciences,2001,137(1-4):1-15.
  • 9Christen P.Automatic record linkage using seedednearest neighbour and support vector machine classifi-cation[C]//Proceeding of the 14th ACM SIGKDDInternational Conference on Knowledge Discoveryand Data Mining.New York:ACM Press,2008:151-159.
  • 10Bilenko M,Mooney R,Cohen W,et al.Adaptive namematching in information integration[J].IntelligentSystems IEEE,2005,18(5):16-23.

引证文献2

二级引证文献5

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部