期刊文献+

基于云计算的景区数据仓库应用研究 被引量:3

Research on Data Warehouse Application of Tourist Areas Data Based on Cloud Computing
下载PDF
导出
摘要 云计算、物联网、大数据等新兴信息技术的发展与应用在提高景区信息化服务水平的同时,也对景区海量信息资源的有效利用提出了严峻挑战。面对超大规模、非结构化的海量数据,传统的基于关系型数据库的数据仓库已很难有效支持景区的数据存储与分析工作。基于此文中提出了一种基于云计算技术的景区数据仓库,通过采用HDFS对数据进行分布式存储管理,利用MapReduce设计海量数据的分析模式,使用HiveQL语言实现数据仓库与前端表现层的交互,能够有效解决景区海量数据的数据管理问题。以黄山风景区为实际背景的实验结果表明了该数据仓库的正确性和有效性。 The emergence of new information technologies, such as cloud computing, internet of things, big data, etc, greatly enhances the level of area of information technology services. However, how to effectively utilize the scenic area of information resources is a great challenge. Faced large scale and unstructured mass data, the data warehouse based on the traditional relational database has been difficult to effectively support the data storage and analysis in scenic area. Based on this, propose a scenic area data warehouse based on cloud computing technology, adopting HDFS for distributed storage of data, using MapReduce to design massive data analysis model, with HiveQL language to implement the interaction between data warehouse and front-end presentation layer, which can solve the data management problem of massive data in scenic area. Taking Huangshan as example, the experimental results indicate the data warehouse is correct and feasible.
出处 《计算机技术与发展》 2014年第9期198-201,205,共5页 Computer Technology and Development
基金 国家自然科学基金重点项目(71331002) 智慧景区客流量预测系统项目(10120106011)
关键词 云计算 数据仓库 MAPREDUCE ETL cloud computing data warehouse MapReduce ETL
  • 相关文献

参考文献16

  • 1孟小峰,慈祥.大数据管理:概念、技术与挑战[J].计算机研究与发展,2013,50(1):146-169. 被引量:2371
  • 2王珊,王会举,覃雄派,周烜.架构大数据:挑战、现状与展望[J].计算机学报,2011,34(10):1741-1752. 被引量:614
  • 3White T. Hadoop : the definitive guide [ M ]. [ s. 1. ] : O'Reilly, 2012.
  • 4Iosup A, Ostermann S, Yigitbasi M N, et al. Performance anal- ysis of cloud computing services for many- tasks scientific computing[J]. IEEE Transactions on Parallel and DistributedSystems ,2011,22 (6) :931-945.
  • 5Baliga J, Ayre R W A, Hinton K, et al. Green cloud compu- ting: balancing energy in processing, storage, and transport [ J]. Proceedings of the IEEE,2011,99( 1 ) : 149-167.
  • 6Lee K H, Lee Y J,Choi H,et al. Parallel data processing with MapReduce : a survey [ J ]. ACM SIGMOD Record, 2012,40 (4) :11-20.
  • 7李伟卫,李梅,张阳,申爱丽.基于分布式数据仓库的分类分析研究[J].计算机应用研究,2013,30(10):2936-2939. 被引量:10
  • 8王德文,肖凯,肖磊.基于Hive的电力设备状态信息数据仓库[J].电力系统保护与控制,2013,41(9):125-130. 被引量:40
  • 9Vassiliadis P, Simitsis A, Skiadopoulos S. Conceptual modeling for ETL processes[ C ]//Proceedings of the 5th ACM interna- tional workshop on data warehousing and OLAP. [ s. 1. ] : ACM ,2002 : 14-21.
  • 10Wang T, Hu J,Zhou H. Design and implementation of an ETL approach in business intelligence project [ M ]//Practical ap-plications of intelligent systems. Berlin : Springer,2012.

二级参考文献272

共引文献2923

同被引文献14

引证文献3

二级引证文献4

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部