期刊文献+

分布式数据库聚合计算性能优化 被引量:10

Performance optimization of distributed database aggregation computing
下载PDF
导出
摘要 针对分布式数据库在分析应用方面的聚合计算性能较低的问题,以MongoDB数据库为研究实例,提出了一种基于片键和索引的数据库性能提升方法。首先,通过分析业务特征指导选择的片键字段,该字段需要保证数据在分片节点上的均匀布局;其次,通过研究分布式数据库的索引效率,利用删除查询字段索引的方法进一步提升计算性能,该方法能充分利用硬件资源提高聚合计算的性能。实验结果表明,采用高基数粒度的分片片键能够让数据在集群上均匀地分布在各个数据节点上,而舍弃索引使用全表查询能够有效提高聚合计算的速度,聚合计算优化方法能够有效提高聚合计算的性能。 Aiming at the problem of low computational performance of distributed database in analysis applications, taking MongoDB database as an example, a method was put forward to improve the performance of database based on chip and index. Firstly, the characteristics of the business was analyzed to guide the choice of shard key field, and the selected key field needed to ensure that the data is evenly distributed on the cluster nodes. Secondly, by studying the index efficiency of the distributed database, the method of deleting the query field index was used to further improve the computing performance, which could make full use of hardware resources to improve the performance of aggregation computing. The analysis and experimental results show that the shard key field with high cordinality can distribute data evenly on each data node in the cluster, and the use of full table query can effectively improve the convergence speed, thus the optimization method can effectively improve the performance of aggregation computing.
作者 肖子达 朱立谷 冯东煜 张迪 XIAO Zida ZHU Ligu FENG Dongyu ZHANG Di(School of Computer Science, Communication University of China, Beijing 100024, China Beijng Key Laboratory of Big Data in Security & Protection Industry, Beijing 100024, China)
出处 《计算机应用》 CSCD 北大核心 2017年第5期1251-1256,共6页 journal of Computer Applications
基金 国家自然科学基金资助项目(61730063)~~
关键词 NO SQL Mongo DB MAPREDUCE 聚合计算 性能优化 Not Only SQL (NoSQL) MongoDB MapReduee aggregation computing performance optimization
  • 相关文献

参考文献4

二级参考文献78

  • 1刘清,黄兆华.G-逻辑及其归结推理[J].计算机学报,2004,27(7):865-873. 被引量:28
  • 2黄兆华,邓毅雄.粒计算及其应用的研究[J].华东交通大学学报,2005,22(5):124-128. 被引量:7
  • 3赵君,张春海,李华.基于XML中间件的分布式数据库的数据分片策略[J].计算机工程与设计,2006,27(3):466-468. 被引量:11
  • 4GHANDEHARIZADEH S,DEWITT D.Hybrid-range partitioning strategy:A new declustering strategy for multiprocessor database machines[A].In:Proc.of the 16th VLDB Conf[C].Brisbance,Australia,1990.481-492.
  • 5NAVATHE S,CERI S,WIEDERHOLD G,et al.Vertical Partitioning Algorithms for Database Design[J].ACM Transactions on Database Systems,1984,9(4):680-710.
  • 6杨利,昌月楼.并行数据库技术[M].长沙:国防科技大学出版社,2001.
  • 7LIU Q,SUN H.Theoretical Study of Granular Computing[A].LNAI 4062,The Proceedings of RSKT2006[C].by Springer,China,July 2006.93-102.
  • 8SHOWRON A.Toward intelligent systems:Calculi of information granules[J].Bulletin of International Rough Set Society,2001,5(1/2):9-30.
  • 9SKOWRON A,STEPANIUK J,PETERS JF.Extracting patterns using information granules[J].Bulletin of International Rough Set Society,2001,5(1/2):135-142.
  • 10PETERS JF,SKOWRON A,SURAJ Z,et al.Measures of Inclusion and Closeness of Information Granules:A Rough Set Approach[A].Proceedings of the Third International Conference on RSCTC'2002[C].October 14-16,2002.300-307.

共引文献71

同被引文献70

引证文献10

二级引证文献83

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部