期刊文献+

基于Hadoop的地理国情监测高性能计算分析研究 被引量:1

Research on the High Performance Computing and Analysis of Geographical Conditions Monitoring Based on Hadoop
下载PDF
导出
摘要 随着常态化地理国情监测在我国的全面开展,地理国情时空数据规模不断增大,使得对地理国情数据综合分析和信息挖掘的要求越来越高。本文针对传统空间数据存储和分析平台存在的性能瓶颈问题,提出了一种基于Hadoop的地理国情高性能计算框架,论述了通过服务封装实现分布式并行计算能力与成熟平台地理空间分析能力的无缝聚合实现最优化地理国情分析计算的理念,并以地理国情地表覆盖面积统计为例,探讨了分布式计算技术在地理国情海量数据分析方面的适用性。结果表明,随着数据量的增加,分布式计算集群的性能优势逐渐凸显,比传统集中式空间数据库更适合于海量数据的分析计算,而集群的良好横向拓展性,也为大规模地理国情空间数据分析提供了可行解决方案,研究结果将为进一步构建开放性地理国情分析服务平台奠定基础。 With the normalization of the geographical conditions monitoring in China,the amount of geographical conditions spatio-temporal data is continuously increasing,which makes the demand for comprehensive analysis and information mining of geographical conditions data become higher and higher.To resolve the bottleneck problems of traditional data storage and analysis platform,a high performance computing and analysis framework of geographical conditions is proposed.Then,the idea of seamlessly combining distributed parallel computing technology and mature geographical spatial analysis technology with service encapsulates to achieve optimized calculation is discussed.Finally,a case study of geographical conditions surface area statistic is carried out to confirm the capability of distributed computing technology in huge amounts spatial data analysis.The result shows that with the increasing of data amount the performance advantage of distributed computing cluster is gradually highlighted.The cluster shows the good performance of horizontal sacaling up,which will provide a feasible solution for largescale spatial data analysis in geographical conditions monitoring.The research results confirme a good foundation for further construction of the geographical conditions analysis and service for open platform.
作者 高崟 GAO Yin(National Geomatics Center of China,Beijing 100830,China)
出处 《地理信息世界》 2018年第2期50-55,71,共7页 Geomatics World
关键词 地理国情监测 海量空间数据 高性能计算 HADOOP National Geographical Conditions Monitoring huge amounts of spatial data high performance computing Hadoop
  • 相关文献

参考文献7

二级参考文献62

  • 1孙娜,吴立增,苑津莎,王琨.电力设备数据仓库的设计开发[J].电力系统通信,2005,26(9):50-53. 被引量:3
  • 2林峰,胡牧,蒋元晨,倪斌.电力调度综合数据平台体系结构及相关技术[J].电力系统自动化,2007,31(1):61-64. 被引量:87
  • 3王继业.电力企业数据中心建立及其对策[J].中国电力,2007,40(4):69-73. 被引量:21
  • 4Apache HadoopOrg. Hadoop[EB/OL]. [2011-02-11].http://hadoop.apache.org.
  • 5AshishThusoo, JoydeepSenSarma, Namit Jain, et al. Hive—a petabyte scale data warehouse using Hadoop[C] //Data Engineering (ICDE), 2010 IEEE 26th InternationalConference on. 15, April 2010: 996-1005.
  • 6Jiang D,Tung A K H,Chen Gang. MAP-JOIN-REDUCEtoward scalable and efficient data analysis on largeclusters[J]. Knowledge and Data Engineering, 2011:1299-1311.
  • 7国家电网公司.输变电设备状态监测系统总体框架设计[S].北京:国家电网公司,2010.
  • 8Ostermann I A, Yigitbasi S, Fahringer M N,et al.Performance analysis of cloud computing services formany-tasks scientific computing[ J] .IEEE Trans onParallel and Distributed Systems, 2011: 931-945.
  • 9Baliga J,Ayre R W A, Hinton K,et al. Green cloudcomputing: balancing energy in processing, storage, andtransport[J]. Proceedings of the IEEE, 2011: 149-167.
  • 10Curtis A R, Wonho Kim.Mahout: low-overheaddatacenter traffic management using end-host-basedelephant detection[C] // 2011 Fourth IEEE InternationalConference on Utility and Cloud Computing, 2011:1629-1637.

共引文献242

同被引文献11

引证文献1

二级引证文献2

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部