期刊文献+

一种基于数据挖掘的Deep Web模式匹配方法 被引量:1

A METHOD OF DEEP WEB SCHEMA MATCHING BASED ON DATA MINING
下载PDF
导出
摘要 模式匹配是DeepWeb异构信息集成中的关键问题.介绍了一种整体性匹配方法,即同时发现大量模式,并一次性进行匹配。主要通过分析和比较两种已经存在的大规模模式匹配原型系统:MGS和DCM,结合它们核心算法的优点,提出一种新的基于数据挖掘技术的算法(Correlated-clustering)。该算法先利用积极相关发现组匹配,再通过概念相似度的计算聚类同义属性,最后进行匹配选择。实验结果表明,本算法全面、效率高,充分体现了整体性方法的思想。 Schema matching is a critical problem in Deep Web heterogeneous information integration. In this paper it introduces a holistic matching approach, which finds many schemas simultaneously and one-off matches them. We mainly analyzed and compared two existing large scale schema matching archetypal system:MGS and DCM, and proposed a new algorithm based on data mining, named as Correlated-clustering,which combines the advantages of the two existing systems. This algorithm first mines group attributes by positively correlated attributes, and then clusters the synonymous attributes by calculating the similarity of each two concepts, finally makes matching selection from above results. The experiment result shows the effectiveness and completeness of our algorithm, which demonstrates the conception of holistic schema matching.
作者 钟昕 伏玉琛
出处 《计算机应用与软件》 CSCD 2009年第5期46-49,共4页 Computer Applications and Software
基金 国家自然科学基金项目(60673092) 江苏省高校自然科学基金项目(07KJD520187)
关键词 DEEP WEB 模式匹配 整体性方法 数据挖掘 Deep Web Schema matching Holistic approach Data mining
  • 相关文献

参考文献18

  • 1Bergman M K. The deep web:Surfacing hidden value [ J ]. Tech. rep. , BrightPlanet LLC. Dec. 2000.
  • 2Chang K C -C. ,He B,Li C,et al. The UIUC web integration repository. Computer Science Department, University of Illinois at Urbana- Champaign. http ://metaquerier. cs. uiuc. edu/repository [ OL ].
  • 3Chang K C -C ,He B,Zhang Z. Toward large scale integration:Building a metaquerier over databases on the web [ C ]. In CIDR 2005 Conference.
  • 4He B,Chang K C -C. Statistical schema matching across web query interfaces[ C ]. In SIGMOD 2003 Conference.
  • 5He H, Meng W, Yu C, et al. Wise-integrator: An automatic integrator of web search interfaces for e-commerce[ C ]. In VLDB 2003 Conference.
  • 6Rahm E, Bemstein P A. A survey of approaches to automatic schema matching[J]. VLDB Journal,2001,10(4) :334-350.
  • 7Wang J,Wen J-R, Lochovsky F,et al. Instance-based schema matching for web databases by domain-specific query probing [ C ]. In VLDB 2004 Conference.
  • 8Wu W, Yu C T, Doan A, et al. An. interactive clustering-based approach to integrating source query interfaces on the deep web [ C ]. In SIGMOD 2004 Conference.
  • 9He B, Chang K C -C, Han J. Automatic complex schema matching across web query interfaces:A correlation mining approach[ C ]. Technical Report UIUCDCS-R-2003-2388, Dept. of Computer Science, UIUC, Dec. 2003.
  • 10Madhavan J, Bernstein P A, Doan A, et al. Corpus-based schema matching[ C ]. In ICDE Conference,2005.

同被引文献5

引证文献1

二级引证文献2

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部