期刊文献+

一种基于Solr的HBase海量数据二级索引方案 被引量:15

A Secondary Index Scheme of Big Data in HBase Based on Solr
下载PDF
导出
摘要 针对HBase不提供二级索引和华为的hindex方案难以满足海量数据检索速度需求的问题,文章设计了基于Solr的HBase二级索引方案SIHBase(Solr Indexing HBase)。该方案使用HBase的Coprocessor(协处理器)为数据表的创建、修改、删除以及数据的插入、更新、删除和恢复等操作都实现了相应的回调函数,通过回调函数向Solr发送相关请求,以实现在Solr中自动为HBase建立和维护二级索引,保证数据与索引的一致性。该方案具有良好的通用性,可以同时为多张表的多列数据建立索引。该方案扩展了HBase的客户端功能,增加了直接查询Solr的接口,利用Solr提供的高效、灵活、多样的检索功能实现对HBase海量数据的快速检索。最后,与hindex进行了二级索引的查询性能对比实验,证明了该方案在查询速度上要远快于hindex。 For the problem that HBase cannot provide secondary indexes and Huawei hindex scheme is difficult to meet the project demand of retrieval speed, designed an HBase secondary index scheme named SIHBase(Solr Indexing HBase) based on Solr. The scheme uses HBase Coprocessor to achieve the related callback function for creating, changing, deleting operations of the data tables and inserting, updating, deleting recovering operations of data. Thus, it can create and manage secondary indexes in Solr for HBase automatically and ensure the consistency of the data and index. The scheme has favorable generality and can create index for multi-column data of multiple tables in the meantime. And then extended the client-side function of HBase, increased the direct query interface of Solr, using efficient, flexible and diversified retrieval functions which Solr provided to realize quick retrieval for the mass data of HBase. Finally, a contrast experiment about query performance with hindex show that SIHBase was turned out to be much faster than hindex in query speed.
出处 《信息网络安全》 CSCD 2017年第8期39-44,共6页 Netinfo Security
基金 国家科技支撑计划[2012BAH18B05] 国家自然科学基金[61272447] 四川省科技厅计划项目[16ZHSF0483]
关键词 HBASE 二级索引 SOLR 快速检索 HBase secondary index Solr quick retrieval
  • 相关文献

参考文献4

二级参考文献46

  • 1HBase :bigtable-like structured storage for hadoop hdfs [ EB/OL ]. http ://hadoop. apache, org/hbase/,2010.
  • 2Fan Chang, Jeffrey Dean, Sanjay Chemawat, et al. Bigtable: a dis- tributed storage system for structured data[ C ]. Proceedings of 7th USENIX Symposium on Operating Systems Design and Implemen- tation( OSDI'06 ), Seattle, WA, USA: USENIX Association, 2006 : 205-218.
  • 3Dhruba Borthakur. The hadoop distributed file system:Architecture and design [ EB/OL ]. http://hadoop, apache, org/hdfs ,2011.
  • 4Ramaswamy Hafiharaa,Bigit Hore,Chen Li,et al. Processing spatial- keyword (SK) queries in geographic information retrieval (GIR) sys- tems[ A]. Proceedings of the lgth International Conference on Scientif- ic and Statistical Database Managem (SSDBM '07) [ C ]. Washing- ton,DC,USA:IEEE Computer Society,2007:16-25.
  • 5Ian De Felipe, Vagelis Hristidis, Naphtali Rishe. Keyword search on spatial databases[ A] . Proceedings of the 2008 IEEE 24th Interna- tional Conference on Data Engineering (ICDE '08 ) [ C]. Washing- ton, DC, USA: IEEE Computer Society, 2008:656 -665.
  • 6Cong Gao ,Christian S Jensen,Wu Ding-ming. Efficient retrieval of the top-k most relevant spatial web objects [ J ]. Proceedings of VLDB Endowment,2009,2( 1 ) :337-348.
  • 7Jolo B Rocha-Junior, Orestis Gkorgkas, Simon Jonassen, et al. Ef- ficient processing of Top-k spatial keyword queries [ A ]. Proceed- ings of the 12th International Conference on Advances in Spatial and Temporal Databases ( SSTD ' 11 ) [ C ]. Berlin, Heidelberg : Springer-Verla,2011:205-222.
  • 8Guo Wei, Guo Jing, Hu Zhi-yong. Spatial database indexing tech- nique [ M ]. Shanghai: Shanghai Jiao Tong University,Press,2006.
  • 9Ooi, Mcdonell K J, Sacks R Davis. Spatial kd-tree: an indexing mechanism for spatial database [ A ]. Proceedings of the 11 th Annu- M International Computer Software and Applications Conference ( COMPSAC '87 ) [ C ]. Washington, DC, USA: IEEE Computer Society, 1987:433-438.
  • 10DEAN J, GHEMAWAT S. MapReduce: simplified data processing on large clusters[ C]// OSDI'04: Proceedings of the Sixth Confer- ence on Symposium on Operating System Design and Implementa- tion. Berkeley: USENIX Association, 2004:137 - 150.

共引文献53

同被引文献97

引证文献15

二级引证文献73

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部