期刊文献+

一种使用索引式备份的范围查询方法 被引量:2

Range Search Method Based on Indexed Backup in Distributed Data Store
下载PDF
导出
摘要 范围查询是数据库支持的重要功能之一.在分布式数据库中,范围查询具有结果不唯一且数据量大的特点,因而中间结果的传输成本较高;同时,在以唯一主键散列存储的数据上查询非主键列上的范围条件,得到的命中数据较为分散,造成传输开销大和计算复杂度的问题.针对以上问题和基于高可用系统中数据存在多个备份的观察,本文设计了一种使用索引式备份的范围查询方法,通过将备份数据依据索引列重构实现范围查询友好的索引式备份,优化基于非主键列的范围查询任务.使用索引式备份的范围查询具有数据聚集的特点,能够有效减少数据传输成本、提高计算效率.实验结果显示,在TPC-H基准测试中,本方法相对于传统B+树索引方法具有8.4至16.7倍的性能提升,且具有较好的可扩展性. Range searching is one of the essential functionality that database should provide. In distributed database, the transmission cost of intermediate data is relatively high because of the non-unique, high-volume results matched by range conditions. Besides, range- searching on non-primary key columns will get scattered results when data is indexed by primary key, leading low efficiency in trans- mission and computation. In this paper, we design a range search method based on indexed backup, based on observation that data has multiple backups. Data in backup is reorganized by indexed columns to answer the range queries more efficiently. In this way, data fil- tered by range conditions is stored together,which largely benefit transmission and computation. Experiments on TPC-H benchmark show that range-searching on indexed backup outperforms B + tree index methods by 8.4X-16.7X,and the design has good scalability.
作者 薛翔 沈斯杰 陈榕 XUE Xiang;SHEN Si-jie;CHEN Rong(Institute of Parallel and Distributed Systems,Software School,Shanghai Jiaotong University,Shanghai 200240,Chin)
出处 《小型微型计算机系统》 CSCD 北大核心 2018年第8期1781-1786,共6页 Journal of Chinese Computer Systems
基金 国家重点研发计划项目(2016YFB1000500)资助 国家自然科学基金青年项目(61003002)资助
关键词 分布式数据库 范围查询 二级索引 索引式备份 distributed database range query secondary index indexed backup
  • 相关文献

同被引文献14

引证文献2

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部