期刊文献+

分布式数据不一致性检测的实现与优化

Inconsistency Detection in Distributed Data: Implement,Meliorate
下载PDF
导出
摘要 数据的不一致性检测是数据清洗中一个重要的主题。传统集中式数据的不一致性检测问题可以使用基于SQL的技术得到解决,而对于分布式的数据,往往面临着诸多挑战。目前研究者提出了基于函数条件依赖的不一致性检测技术对该问题进行了深入研究,将分布式不一致性检测问题转化成最优化问题,并提出了若干可行的解决算法。本文介绍了分布式数据下的基于函数条件依赖的不一致性检测问题,并实现了基于最优化问题的分布式检测算法,最后组织相关实验进行验证和改进。 Detecting inconsistency is one of the central issues in data cleaning. There have been effective methods based on SQL techniques to detect inconsistency in centralized database. However,it's far more challenging when the database is distributed. There have been some studies on data inconsistency that is based on conditional functional dependency,formulating the inconsistency detecting problems as optimization problems,in which several effective algorithms were developed.This paper introduces the detection problem of inconsistency on distributed data,which is based on the conditional functional dependencies. Then,the paper develops the characterizations of the conditional functional dependencies,the fragment of dataset and the optimization problem and relevant algorithms of inconsistency detection. Finally,the paper organizes several experiments to verify and meliorate these algorithms.
出处 《智能计算机与应用》 2015年第3期57-60,64,共5页 Intelligent Computer and Applications
基金 国家自然科学基金(61173022)
关键词 分布式数据 不一致性 条件函数依赖 最优化 Distributed Data Inconsistency Conditional Functional Dependency Optimizations
  • 相关文献

参考文献6

  • 1周傲英,金澈清,王国仁,李建中.不确定性数据管理技术研究综述[J].计算机学报,2009,32(1):1-16. 被引量:185
  • 2ECKERSONW W. Data quality and the bottom line: Achieving busi-ness success throu^i a commitment to high quality data[ J ]. The DataWarehousing Institute,2002: 1 -36.
  • 3ANOKHINP,MOTRO A. Data integration: Inconsistency detectionand resolution based on source properties [ C ] //Proceedings of FMII-01, International Workshop on Foundations of Models for Informa-tion Integration 2001,Viterix):FMIDO, 2001:1 - 15.
  • 4FAN W, GEERTS F, JIA X, et al. Conditional functional dependen-cies for capturing data inconsistencies [ J ]. ACM Transactions on Da-tabase Systems (TODS),2008, 33(2) : 1 -39.
  • 5GUPTA A,SAGIV Y,ULLMAN J D,et al. Constraint checking withpartial information [ C ] //Proceedings of the thirteenth ACM SIGACT-SIGMOD - SIGART symposium on Principles of database systems1994,Minnesota: ACM, 1994: 45 -55.
  • 6FAN W, GEERTS F,MA S, et al. Detecting inconsistencies in dis-tributed data [ C ] // Proceedings of IEEE 26th International Confer-ence on Data Engineering 2010,California : IEEE Computer Society,2010: 64-75.

二级参考文献98

  • 1金澈清,钱卫宁,周傲英.流数据分析与管理综述[J].软件学报,2004,15(8):1172-1181. 被引量:161
  • 2谷峪,于戈,张天成.RFID复杂事件处理技术[J].计算机科学与探索,2007,1(3):255-267. 被引量:54
  • 3Deshpande A, Guestrin C, Madden S, Hellerstein J M, Hong W. Model-driven data acquisition in sensor networks// Proceedings of the 30th International Conference on Very Large Data Bases. Toronto, 2004:588-599
  • 4Madhavan J, Cohen S, Xin D, Halevy A, Jeffery S, Ko D, Yu C. Web-scale data integration: You can afford to pay as you go//Proceedings of the 33rd Biennial Conference on Innovative Data Systems Research. Asilomar, 2007:342-350
  • 5Liu Ling. From data privacy to location privacy: Models and algorithms (tutorial)//Proceedings of the 33rd International Conference on Very Large Data bases. Vienna, 2007: 1429- 1430
  • 6Samarati P, Sweeney L. Generalizing data to provide anonymity when disclosing information (abstract)//Proeeedings of the 17th ACM SIGACT-SIGMOD-SIGART Symposium on Principles of Database Systems. Seattle, 1998:188
  • 7Cavallo R, Pittarelli M. The theory of probabilistic databases//Proceedings of the 13th International Conference on Very Large Data Bases. Brighton, 1987:71-81
  • 8Barbara D, Garcia-Molina H, Porter D. The management of probabilistic data. IEEE Transactions on Knowledge and Data Engineering, 1992, 4(5): 487-502
  • 9Fuhr N, Rolleke T. A probabilistic relational algebra for the integration of information retrieval and database systems. ACM Transactions on Information Systems, 1997, 15(1): 32-66
  • 10Zimanyi E. Query evaluation in probabilistic databases. Theoretical Computer Science, 1997, 171(1-2): 179-219

共引文献184

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部