一种基于数据挖掘的Deep Web模式匹配方法被引量：1

A METHOD OF DEEP WEB SCHEMA MATCHING BASED ON DATA MINING

下载PDF

导出

摘要模式匹配是DeepWeb异构信息集成中的关键问题.介绍了一种整体性匹配方法,即同时发现大量模式,并一次性进行匹配。主要通过分析和比较两种已经存在的大规模模式匹配原型系统:MGS和DCM,结合它们核心算法的优点,提出一种新的基于数据挖掘技术的算法(Correlated-clustering)。该算法先利用积极相关发现组匹配,再通过概念相似度的计算聚类同义属性,最后进行匹配选择。实验结果表明,本算法全面、效率高,充分体现了整体性方法的思想。 Schema matching is a critical problem in Deep Web heterogeneous information integration. In this paper it introduces a holistic matching approach, which finds many schemas simultaneously and one-off matches them. We mainly analyzed and compared two existing large scale schema matching archetypal system：MGS and DCM, and proposed a new algorithm based on data mining, named as Correlated-clustering,which combines the advantages of the two existing systems. This algorithm first mines group attributes by positively correlated attributes, and then clusters the synonymous attributes by calculating the similarity of each two concepts, finally makes matching selection from above results. The experiment result shows the effectiveness and completeness of our algorithm, which demonstrates the conception of holistic schema matching.

作者钟昕伏玉琛

机构地区苏州大学计算机科学与技术学院

出处《计算机应用与软件》 CSCD 2009年第5期46-49,共4页 Computer Applications and Software

基金国家自然科学基金项目(60673092) 江苏省高校自然科学基金项目(07KJD520187)

关键词 DEEP WEB 模式匹配整体性方法数据挖掘 Deep Web Schema matching Holistic approach Data mining

分类号 TP311.13 [自动化与计算机技术—计算机软件与理论] TN919.81 [电子电信—通信与信息系统]

引文网络
相关文献

参考文献18

1Bergman M K. The deep web:Surfacing hidden value [ J ]. Tech. rep. , BrightPlanet LLC. Dec. 2000.
2Chang K C -C. ,He B,Li C,et al. The UIUC web integration repository. Computer Science Department, University of Illinois at Urbana- Champaign. http ://metaquerier. cs. uiuc. edu/repository [ OL ].
3Chang K C -C ,He B,Zhang Z. Toward large scale integration:Building a metaquerier over databases on the web [ C ]. In CIDR 2005 Conference.
4He B,Chang K C -C. Statistical schema matching across web query interfaces[ C ]. In SIGMOD 2003 Conference.
5He H, Meng W, Yu C, et al. Wise-integrator: An automatic integrator of web search interfaces for e-commerce[ C ]. In VLDB 2003 Conference.
6Rahm E, Bemstein P A. A survey of approaches to automatic schema matching[J]. VLDB Journal,2001,10(4) :334-350.
7Wang J,Wen J-R, Lochovsky F,et al. Instance-based schema matching for web databases by domain-specific query probing [ C ]. In VLDB 2004 Conference.
8Wu W, Yu C T, Doan A, et al. An. interactive clustering-based approach to integrating source query interfaces on the deep web [ C ]. In SIGMOD 2004 Conference.
9He B, Chang K C -C, Han J. Automatic complex schema matching across web query interfaces:A correlation mining approach[ C ]. Technical Report UIUCDCS-R-2003-2388, Dept. of Computer Science, UIUC, Dec. 2003.
10Madhavan J, Bernstein P A, Doan A, et al. Corpus-based schema matching[ C ]. In ICDE Conference,2005.

同被引文献5

1王涛,李舟军,胡小华,颜跃进,陈火旺.一种高效的数据流挖掘增量模糊决策树分类算法[J].计算机学报,2007,30(8):1244-1250. 被引量：18
2李昕,吕义,王丽艳.基于C/S模式学分制教务管理系统[J].辽宁工学院学报,2000,20(1):54-55. 被引量：10
3徐琳,吕磊,洪志全.基于B/S结构的高校教务办公自动化系统的设计与实施[J].电脑与信息技术,2001,9(3):27-29. 被引量：11
4朱煜,赵谨,高敦岳.基于C/S体系结构的一卡通局域网管理系统[J].计算机工程,2002,28(2):214-215. 被引量：7
5刘涛,李振星,唐卫清,唐荣锡.基于J2EE和XML的网站自动构建平台[J].计算机辅助设计与图形学学报,2003,15(6):760-765. 被引量：5

引证文献1

1吕宏,杨光.一种基于Java及数据挖掘技术的学生管理系统[J].价值工程,2012,31(8):123-124. 被引量：2

二级引证文献2

1申斌,李利民.基于MVC模式S2SH框架的库存管理系统[J].实验室研究与探索,2014,33(11):113-117. 被引量：10
2周游,张国华.基于SSM框架智慧养老系统设计[J].软件,2021,42(10):47-49. 被引量：3

1杨利萍,邹琪.基于先验形状信息的水平集图像分割[J].计算机科学,2012,39(8):288-291. 被引量：4
2孙晓霞,刘晓霞,谢倩茹.模糊C-均值(FCM)聚类算法的实现[J].计算机应用与软件,2008,25(3):48-50. 被引量：34
3仇明,朱小琳.基于Webservice校园信息系统集成的研究[J].宁波职业技术学院学报,2011,15(5):49-52. 被引量：4
4罗华雯,赵敬中,黄建春,祝翠琴,宋瀚涛.利用JAVA实现基于Web的异构数据库的联合使用[J].计算机应用研究,2000,17(7):104-106. 被引量：8
5毕蓉蓉,刘渊,翟学敏.异构信息集成系统中查询处理的优化研究[J].微电子学与计算机,2008,25(6):171-174.
6孙成柱,陈威.基于本体的企业异构信息集成系统[J].福建电脑,2015,31(9):4-5.
7高会贤,郑晓势,赵彦玲.说话人识别技术探讨[J].电声技术,2008,32(1):52-55.
8蒋盛益,阮幼林,李庆华.面向混合属性的高效聚类算法研究[J].计算机工程,2006,32(12):47-49.
9王静.基于网络日志的用户查询推荐[J].河南科技,2016,35(7):50-51. 被引量：1
10蒋盛益,李庆华.一种基于引力的聚类方法[J].计算机应用,2005,25(2):286-288. 被引量：9

计算机应用与软件

2009年第5期

浏览历史

内容加载中请稍等...

一种基于数据挖掘的Deep Web模式匹配方法被引量：1

参考文献18

同被引文献5

引证文献1

二级引证文献2

相关作者

相关机构

相关主题

浏览历史

一种基于数据挖掘的Deep Web模式匹配方法 被引量：1

参考文献18

同被引文献5

引证文献1

二级引证文献2

相关作者

相关机构

相关主题

浏览历史

一种基于数据挖掘的Deep Web模式匹配方法被引量：1