期刊文献+

基于MTR与Impala结合的数据查询优化研究 被引量:1

Research on Data Query Optimization Based on MTR and Impala
下载PDF
导出
摘要 以大数据的查询技术为中心,研究了当前一些主流的查询方法以及在此基础上的优化改进。MapReduce是一种编程模型,将存储在HDFS中的文件分块再整合以达到加速实现数据查询的目的,在此方法的基础上优化得出Map-Trim-Reduce编程模型,然后与Impala查询引擎相结合,利用M印-Trim-Reduce处理复杂数据的长处弥补Impala的短处,提前处理Impala的预处理数据,达到提高大数据查询效率的目的。 This paper takes the large data query technology as the center,and researches some main current query methods and the optimization based on them.MapReduce is a programming model,which integrates the file blocks stored in the HDFS to achieve the purpose of accelerating the realization of data query.Based on this method,an improved Map-Trim-Reduce programming model is obtained,and then it is combined with the Impala query engine.Use Map-Trim-Reduce to deal with the advantages of complex data to make up for the shortcomings of Impala,and deal with the Impala preprocessing data,so as to improve the efficiency of large data query.
机构地区 东北石油大学
出处 《微型电脑应用》 2016年第6期29-31,共3页 Microcomputer Applications
基金 中国石油科技创新基金研究项目(2013D-5006-0203) 黑龙江省科技攻关项目(GZ09A120) 黑龙江省教育厅科学技术研究项目(12521050)
关键词 大数据 Map-Trim-Reduce mpala Big Data Map-Trim-Reduce Impala
  • 相关文献

参考文献2

二级参考文献13

  • 1颜开. 新一代数据分析利器:Google Dremel原理分析[R].2012.
  • 2MELNIK S,GUBAREV A,LONG Jing-jing,et al. Dremel:interactive analysis of Web-scale datasets[J].Proceedings of the VLDB Endowment,2010,3(1):330-339.
  • 3Cloudera Company. CDH4和Impala文档[EB/OL].http://www. cloudera. com/content/support /en/documentation. html.
  • 4Cloudera Impala:Real-time queries in apache Hadoop,for real[EB/OL].(2012-10). http://blog. cloudera. com/blog/2012/10/cloudera-impala-real-time-queries-in-apache-hadoop-for-real/.
  • 5Apache Hadoop[EB/OL].http://hadoop. apache. org.
  • 6Apache Hive[EB/OL].http://hive. apache. org/.
  • 7DEAN J,GHEMAWAT S. MapReduce:simplified data processing on large clusters[C] //Proc of the 6th Symposium on Operating Systems Design and Implementation. 2004.
  • 8DITTRICH J,RICHTER S,SCHUH S. Efficient OR Hadoop:why not both?[J].Datenbank Spektrum,2013,13(1):17-22.
  • 9HDFS architecture guide[EB/OL].(2013-08-04). http:// hadoop. apache. org/docs/ r1. 2. 1/hdfs_de-sign. html.
  • 10Intel. Optimizing Hadoop deployments[EB/OL].(2010-05-23). http://communities. intel. com/ servlet/JiveServletdownloadBody/5645-102-1-8759.

共引文献11

同被引文献2

引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部