期刊文献+

基于行键的HBase大数据文件存储转换与快速检索研究 被引量:11

Research on storage transformation and fast retrieval of big data files in HBase based on row-key
下载PDF
导出
摘要 针对传统关系型数据库很难满足数据的快速存储与检索的问题,研究了基于数据文件字段映射表、文件对象字段、HBase列映射表和存储转换执行方案映射表解决文件对象的异构性和存储转换的通用性问题。提出了自定义RowKey行键的规则与生成算法,给出了基于映射表与行键的数据转换与存储流程及算法;最后基于行键前缀匹配或关键字匹配方式实现了不同需求的数据快速访问与检索,且具有较强的通用性。 In traditional and relational database,it is difficult to meet the needs of data storage and quick retrieval due to such huge amount of data. For this problem,this paper studied the mapping model of data file field,the relationship between data file field and HBase column,and the importing execution plan,which had solved the heterogeneity of file object and the universality of storage transformation. It put forward the"RowKey"generation rules and algorithm,and provided the algorithm of data transformation based on mapping model and "RowKey". Then according to the mapping rules of data file fields with HBase table column,the data in data file was transformed into HBase. It realized the fast data access and retrieval according to prefix matching in"RowKey"or keyword matching,which had strong commonality and could be widely used in HBase large data storage applications.
作者 圣文顺 徐爱萍 Sheng Wenshun;Xu Aiping(Pujiang Institute,Nanjing Tech University,Nanjing 211200,China;School of Computer,Wuhan University,Wuhan 430072,China)
出处 《计算机应用研究》 CSCD 北大核心 2019年第12期3806-3810,共5页 Application Research of Computers
基金 国家重点研发计划重点专项资助项目(2017YFC0803700) 江苏省高校自然科学研究面上项目(19KJD520005)
关键词 大数据 文件存储 行键 特征值 快速检索 big data file storage row key eigenvalue rapid retrieval
  • 相关文献

参考文献5

二级参考文献34

  • 1江小平,李成华,向文,张新访,颜海涛.k-means聚类算法的MapReduce并行化实现[J].华中科技大学学报(自然科学版),2011,39(S1):120-124. 被引量:79
  • 2张洪岩,王钦敏,周成虎,励惠国.“数字地球”与地理信息科学[J].地球信息科学,2001,3(4):1-4. 被引量:11
  • 3International Telecommunication Union. ITU Intemet Reports 2005: The Interact of Things [ R]. UIT, 2005.
  • 4RFID WORKING GROUP. Interact of Things in 2020: Roadmap for the future [ EB/OL]. [ 2011 - 05 - 12]. http://www, smart-sys- tems-integration, org/public/intemet-of-things.
  • 5CONTI J P. The lntemet of things [ J]. lET Communications Engi- neer, 2006, 4(6):20-25.
  • 6CHAIKEN R, JENKINS B, LARSON P-A, et al. SCOPE: Easy and efficient parallel processing of massive data sets [ j]. Proceed- ings of the VLDB Endowment, 2008, 1(2) : 1265 - 1276.
  • 7YICK J, MUKHERJEE B, GHOSAL D. Wireless sensor network survey [J]. Computer Networks, 2008, 52(12): 2292-2330.
  • 8Oracle Corporation. Oracle real application elustem [EB/OL]. [ 2011 - 07 - 01 ]. http://www, oracle, eom/teehnology/produets/database/clustering.
  • 9The Apache Software Foundation. Hadoop [ EB/OL]. [ 2011 - 08 - 12]. http://hadoop, apache, org/.
  • 10The Apache Software Foundation. Apache HBase [ EB/OL]. [ 2011 -08 -04]. http://hadoop, apache, org,/hbase/.

共引文献109

同被引文献116

引证文献11

二级引证文献10

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部