期刊文献+

基于操作历史图的分布式Key-Value数据库一致性检测算法 被引量:9

Consistency Checking Algorithm for Distributed Key-Value Database Based on Operation History Graph
下载PDF
导出
摘要 分布式数据库系统的副本机制在提高系统可靠性及性能的同时,导致了多副本数据管理的一致性问题;数据一致性的实现需要一致性协议模型来进行预防,也需要一致性检测算法对非一致数据进行检测。首先,对读写操作记录之间的时序关系、安全一致性及并行一致性原则等概念进行定义;其次,根据操作记录集合中读写操作之间的并行与时序关系,提取出操作记录集合向操作记录图转化的规则,并在此基础上设计了操作记录向历史记录图的转化算法;然后,以历史记录图为输入,设计了违反一致性查找算法,查找并返回图中所有违反安全与并行一致性读操作的集合;最后,基于Cassandra进行实验并将读写一致性设置为ONE,通过YCSB产生并行读写压力测试,与同类算法的对比实验验证了所提算法在功能与效率两方面的优越性。 The replica mechanism of distributed database system not only improves reliability and performance of the overall system,but also leads to the consistency problem of multi-replica data management mechanism.To keep the consistency of data,a consistency protocol model is needed to avoid data’s inconsistency events.Moreover,consistency checking algorithms are also needed to detect inconsistent data.Firstly,the concepts of temporal relations,security consistency,and concurrent consistency between read and write operations are defined.Secondly,according to the parallel and temporal relationship between read and write operations that recorded in the set of operations,the rules of transforming operation record set to operation record graph are extracted,and then the algorithm of transforming operation records into operation record graph is also designed.Then,taking the set of operation record graph as input,a violation operation search algorithm is designed to find the set of inconsistent read operations which have violated security and parallel consistency.Finally,experiments are conducted based on Cassandra and the read-write consistency is set to ONE.YCSB generates parallel read-write stress tests.The comparative experiments with similar algorithms verify the advantages of the proposed algorithm in both function and efficiency.
作者 廖彬 张陶 李敏 于炯 国冰磊 刘炎 LIAO Bin;ZHANG Tao;LI Min;YU Jiong;GUO Bing-lei;LIU Yan(College of Statistics and Data Science,Xinjiang University of Finance and Economics,Urumqi 830012,China;School of Information Science and Engineering,Xinjiang University,Urumqi 830046,China;Department of Medical Engineering and Technology,Xinjiang Medical University,Urumqi 830011,China;School of Software,Tsinghua University,Beijing 100084,China)
出处 《计算机科学》 CSCD 北大核心 2019年第12期213-219,共7页 Computer Science
基金 国家自然科学基金项目(61562078,61462079,61862060) 新疆天山青年计划项目(2018Q073)资助
关键词 分布式数据库 Key-Value数据库 一致性原则 一致性检测 DAG图 Distributed database Key-Value database Consistency principle Consistency check DAG diagram
  • 相关文献

参考文献4

二级参考文献37

  • 1陈勇旭,陈梦杰,刘雪冰,宋杰.基于MapReduce的连接聚集查询算法研究[J].计算机研究与发展,2013,50(S1):306-311. 被引量:7
  • 2Han J,Kamber M.数据挖掘:概念与技术[M].北京:机械工业出版社,2007.
  • 3田国忠.多DAG共享资源调度的若干问题研究[D].北京:北京工业大学,2014.
  • 4Rahm E, Do H H. Data cleaning: problems and current approaches[J]. IEEE Data Engineering Bulletin, 2000, 23(4): 3-13.
  • 5Ponniah P. Data warehousing fundamentals: a comprehensive guide for IT professionals[M]. Hoboken, NJ, USA: John Wiley & Sons, 2004.
  • 6Batini C, Scannapieco M. Data quality: concepts, methodologies and techniques[M]. New York, USA: Springer, 2006.
  • 7Benge J, Jordan G M W, Smith P, et a1. Global data management survey: the new economy is the data economy[R]. Coopers, Price Waterhouse, 2001.
  • 8Eckerson W W. Data quality and the bottom line[R/OL]. The Data Warehouse Institute (2002)[2014-09-10]. http:// www.tdwi.org/researchidisp1ay.aspx?ID=6064.
  • 9Andritsos P, Fuxman A, Miller R J. Clean answers over dirty databases: a probabilistic approach[C]//Proceedings of the 22nd International Conference on Data Engineering, Atlanta, USA, Apr 3-7,2006. Piscataway, NJ, USA: IEEE, 2006: 30.
  • 10Silbers C A, Korth H F. Database system concepts[M]. New York, USA: McGraw-Hill, 1986.

共引文献24

同被引文献99

引证文献9

二级引证文献9

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部