期刊文献+

基于演化版本的Deep Web查询接口维护方法 被引量:1

Deep Web search interface maintenance method based on evolution version
下载PDF
导出
摘要 针对现有Deep Web信息集成系统没有考虑查询接口动态性的特点,造成本地接口与网络接口查询能力不对等的问题,提出一种基于演化版本的Deep Web查询接口维护方法。该方法通过构建本地接口的版本化模型来刻画接口的增量变化,识别变动比较活跃的属性集合;然后采取试探性查询来构建最优查询语句,获取网络接口数据源的变动信息,演化出本地接口的下一个版本,实现对本地查询接口数据源的信息维护的迭代过程。实验结果表明,该方法降低了深网环境变化对Deep Web信息集成带来的影响,确保了Deep Web查询接口的准确率和查全率的稳定性。 In order to solve the problems existed in the traditional Deep Web information integration system that without con- sidering the dynamic feature of search interface, causing local interface and network interface query ability is not equal. Therefore,this paper proposed a Deep Web search interface maintenance method based on evolution version. In this method, constructing the version models of local search interface was to express the incremental change of it ,and to extract the active attribute set. Next, generating the best query string with the set and probing query was to extract the change content and get the next version of local interface. Finally,it could realize the iterative maintenance of local search interface data source. The experimental results show that this method is able to decrease the impact caused by deep Web network changing, and keep the recall and precision of Deep Web search interface in a stable state.
出处 《计算机应用研究》 CSCD 北大核心 2015年第11期3345-3348,共4页 Application Research of Computers
基金 国家自然科学基金资助项目(71271117) 博士研究生创新基金资助项目(CXZZ13_0689)
关键词 DEEP WEB 查询接口 演化版本 接口维护 Deep Web search interface evolution version interface maintenance
  • 相关文献

参考文献16

  • 1Bergman M K. The Deep Web:surfacing hidden value [ J ]. The Jour- nal of Electronic Publishing ,200l ,7( 1 ) :3-21.
  • 2Fetterly D, Manasse M, Najork M,et el. A large-scale study of the' cw~- lution of Web pages[ C ]//Pro of the 12th International Conlerence on World Wide Web. 2003:669-678.
  • 3Madhavan J,Ko D,Kot L,et el. Google' s Deep Web trawl[ C ~//Pr,~ of VLDB Endowment. 2008 : 1241-1252.
  • 4Chang K C C, He B,Zhang Z. Mining semantics fnr large scale inl~'- gration on the Web:evidences,insights and challenges [ J ~. SIGKDD Explorations, 2004,6 ( 2 ) : 67 -76.
  • 5王英,左祥麟,左万利,王鑫.基于本体的Deep Web查询接口集成[J].计算机研究与发展,2012,49(11):2383-2394. 被引量:3
  • 6Wang Tiantian, Li Guo, Duan Qingling, et el. Deep Web integrated query interface construction method based on Apriofi algorithm [ J ]. Journal of Information and Computational Science, 2013,10 ( 15 ) :5063-5075.
  • 7Li Aiping,Miao Jiajia, Jia Yan. Research on broken mappings delec- ting method based on fuzzy aggregation operators in Deep Web integra- tion environment [ C]//Proc of International Conference on E-Busi- ness and E-Government. 2010 : 125-128.
  • 8Florescu D, Koller D,Levy A Y. Using probabiIistie information in da- ta integration[ C ]//Proc of the 23rd International Cont~rence on ~ cry Large Data Bases. 1997:216-225.
  • 9Sanna A D, Dong X, Hatevy A. Bootstrapping pay-as-you-go data inte- gration systems[ C ]//Proc of ACM SIGMOD International Conference on Management of Data. 2008 : 861 - 874.
  • 10The UIUC Web integration repository[ EB/OL]. (2007). http ://rec- ta querier, cs. uiuc. edu/repository/datasets/tel-8/index, html.

二级参考文献47

  • 1缪嘉嘉,李爱平,贾焰,吴泉源.Deep Web集成中数据模式映射失效检测方法研究[J].计算机研究与发展,2008,45(z1):222-227. 被引量:2
  • 2Chang KCC, He B, Li CK, Patel M, Zhang Z. Structured databases on the Web: Observations and implications. SIGMOD Record, 2004,33(3):61-70.
  • 3BrightPlanet.com. The deep Web: Surfacing hidden value. 2000. http://brightplanet.com
  • 4He H, Meng WY, Yu C, Wu ZH. WISE-Integrator: An automatic integrator of Web search interfaces for e-commerce. In: Proc. of the 29th Int'l Conf. on Very Large Data Bases. San Fransisco: Morgan Kaufmann Publishers, 2003.357-368.
  • 5Wu WS, Yu C, Doan AH, Meng WY. An interactive clustering-based approach to integrating source query interfaces on the deep Web. In: Proc. of the 24th ACM SIGMOD Int'l Conf. on Management of Data. Paris: ACM Press, 2004. 95-106.
  • 6Peng Q, Meng WY, He H, Yu C. WISE-Cluster: Clustering e-commerce search engines automatically. In: Proc. of the 6th ACM Int'l Workshop on Web Information and Data Management. Washington: ACM Press, 2004. 104-111.
  • 7He B, Tao T, Chang KCC. Clustering structured Web sources: A schema-based, model-differentiation approach. In: Proc. of the 9th Int'l Conf. on Extending Database Technology. Heraklion: Springer-Verlag, 2004. 536-546.
  • 8Zhao HK, Meng WY, Wu ZH, Raghavan V, Yu C. Fully automatic wrapper generation for search engines. In: Proc. of the 14th Int'l World Wide Web Conf. Chiba: ACM Press, 2005.66-75.
  • 9Zhai YH, Liu B. Web data extraction based on partial tree alignment. In: Proc. of the 14th Int'l World Wide Web Conf. Chiba: ACM Press, 2005.76-85.
  • 10Chang KCC, He B, Zhang Z, Toward large scale integration: Building a MetaQuerier over databases on the Web. In: Proc, of the 2rid Int'l Conf. on Innovative Data Systems Research. Asilomar, 2005, 44-55.

共引文献30

同被引文献3

引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部