期刊文献+

A Clustering Algorithm for Planning the Integration Process of a Large Number of Conceptual Schemas 被引量:1

A Clustering Algorithm for Planning the Integration Process of a Large Number of Conceptual Schemas
原文传递
导出
摘要 When tens and even hundreds of schemas are involved in the integration process, criteria are needed for choosing clusters of schemas to be integrated, so as to deal with the integration problem through an efficient iterative process. Schemas in clusters should be chosen according to cohesion and coupling criteria that are based on similarities and dissimilarities among schemas. In this paper, we propose an algorithm for a novel variant of the correlation clustering approach that addresses the problem of assisting a designer in integrating a large number of conceptual schemas. The novel variant introduces upper and lower bounds to the number of schemas in each cluster, in order to avoid too complex and too simple integration contexts respectively. We give a heuristic for solving the problem, being an NP hard combinatorial problem. An experimental activity demonstrates an appreciable increment in the effectiveness of the schema integration process when clusters are computed by means of the proposed algorithm w.r.t, the ones manually defined by an expert. When tens and even hundreds of schemas are involved in the integration process, criteria are needed for choosing clusters of schemas to be integrated, so as to deal with the integration problem through an efficient iterative process. Schemas in clusters should be chosen according to cohesion and coupling criteria that are based on similarities and dissimilarities among schemas. In this paper, we propose an algorithm for a novel variant of the correlation clustering approach that addresses the problem of assisting a designer in integrating a large number of conceptual schemas. The novel variant introduces upper and lower bounds to the number of schemas in each cluster, in order to avoid too complex and too simple integration contexts respectively. We give a heuristic for solving the problem, being an NP hard combinatorial problem. An experimental activity demonstrates an appreciable increment in the effectiveness of the schema integration process when clusters are computed by means of the proposed algorithm w.r.t, the ones manually defined by an expert.
出处 《Journal of Computer Science & Technology》 SCIE EI CSCD 2015年第1期214-224,共11页 计算机科学技术学报(英文版)
关键词 conceptual schema schema integration CLUSTERING conceptual schema, schema integration, clustering
  • 相关文献

参考文献1

二级参考文献23

  • 1Domingos P, Richardson M. Mining the network value of cus- tomers. In Proc. the 7th ACM SIGKDD, Aug. 2001, pp.57- 66.
  • 2Richardson M, Domingos P. Mining knowledge-sharing sites for viral marketing. In Proc. the 8th ACM SIGKDD, July 2002, pp.61-70.
  • 3Kempe D, Kleinberg J M, Tardos E. Maximizing the spread of in?uenee through a social network. In Proc. the 9th ACM SIGKDD, Aug. 2003, pp.137-146.
  • 4Newman M E J, Girvan M. Finding and evaluating commu- nity structure in networks. Physical Review E, 2004, 69(2): 026113.
  • 5Newman M E J, Fast algorithm for detecting community structure in networks. Physical Review E, 2004, 69(6): 066133.
  • 6Papadopoulos S, Kompatsiaxis Y, Vakali A, Spyridonos P. Community detection in social media. Data Mining and Knowledge Discovery, 2012, 24(3): 515-554.
  • 7Shi J, Malik . Normalized cuts and image segmentation. IEEE Transactions on Pattern Analysis and Machine Intel- ligence, 2000, 22(8): 888-905.
  • 8Luxburg U. A tutorial on spectral clustering. Statistics and Computing, 2007, 17(4): 395-416.
  • 9Rahn E, Bernstein P A. A survey of approaches to automatic schema matching. J. Very Large Data Base, 2001, 10(4): 334-350.
  • 10Yang X, Procopiuc C M, Srivastava D. Summarizing rela- tional databases. PVLDB, 2009, 2(1): 634-645.

同被引文献4

引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部