期刊文献+

基于Hadoop和HBase的分布式索引模型的研究 被引量:5

Research of distributed indexing model based on Hadoop and HBase
下载PDF
导出
摘要 随着互联网上信息量飞速增长,海量数据的索引出现了难题,现行的索引方案已经难以提供高效、可靠的服务,为此,设计并实现了一种针对海量数据进行索引的平台模型。该平台模型首先利用Solr分布式索引器生成索引文件,然后利用Hadoop分布式集群,以HDFS分布式文件系统、Map Reduce分布式并行计算模型、Zookeeper同步协同系统以及Hbase分布式数据库技术来处理、协调管理索引和存储海量数据,最后通过实验测试,该平台模型可以克服现行的海量数据索引时存在的效率低的问题,同时具有良好的扩展性和可靠性。 With the rapid growth of the amount of information on the Internet,massive data index appeared,the current index scheme has been difficult to provide efficient and reliable service,therefore,designing and implementing an index for massive data platform model. The platform model using Solr distributed index is generated index file,and then uses the Hadoop cluster,HDFS distributed file system,Map Reduce distributed computing model, Zookeeper synchronous collaborative system and Hbase distributed database technology to handle the coordination management,indexing and storage of mass data,finally,through the experimental test,the platform model can overcome the existing efficiency of mass data index of low current problems,and has good expansibility and reliability.
出处 《信息技术》 2015年第6期109-111,114,共4页 Information Technology
关键词 HADOOP集群 SOLR HBASE Hadoop cluster Solr Hbase
  • 相关文献

参考文献7

  • 1Tom White.Hadoop权威指南[M].周敏,等译.北京:清华大学出版社,2011.
  • 2Borthakur D.HDFS Architechture Guide[Z/OL].http:∥hadoop.apache.orgldocs/r1.0.4ihdfs_design.html,February 2013.
  • 3Borthakur D.The hadoop distributed file system:Architecture and design[Z/OL].http:∥hadoop.apache.org/common/docs.2007.
  • 4郑启龙,房明,汪胜,王向前,吴晓伟,王昊.基于MapReduce模型的并行科学计算[J].微电子学与计算机,2009,26(8):13-17. 被引量:39
  • 5Apache Zookeeper maintained by Apache[EB/OL].http:∥zookeeper.apache.org/,2013.
  • 6Apache HBase maintained by Apache[EB/OL].http:∥hbase.apache.org/,2013.
  • 7Indexed HBase maintained by Y.Kulbak[EB/OL].https:∥github.com/ykulbak/ihbase,2011.

二级参考文献1

共引文献38

同被引文献36

引证文献5

二级引证文献22

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部