期刊文献+

层次维编码片段立方体生成算法应用研究 被引量:2

Application Research on Hierarchical Dimension Encoding Fragment Cube Algorithm
下载PDF
导出
摘要 数据量大、数据多维是水利普查数据的重要特征。根据水利普查决策分析的需要,在对数据立方体技术研究的基础上,基于部分物化策略,提出了建立层次维编码片段立方体(HDEFC)。利用维度属性的概念分层特性,在层次维片段中采用混合索引(B-tree和Bit Code)技术对每个层次维的层次属性进行二进制编码,再利用生成的维度编码代替原表中关键字,非层次维片段中采用倒排索引技术对每个片段子立方体进行物化,减少了多表连接操作,从而提高OLAP查询效率。实验结果表明,生成的HDEFC占用较小的存储空间,查询方法在面对高维的复杂查询时具有优势。通过建立水利普查数据分析系统,说明了该方法能够有效地解决因数据量庞大、维度多导致的数据计算和查询效率低下等问题,降低了物化水利普查成果数据立方体的时间和空间成本。 Large amount of and multidimensional data is an important feature of water census data.According to the need of water census decision analysis,on the basis of data cube technology and partial materialization strategy,the establishment of Hierarchical Dimension Encoding Fragment Cube( HDEFC) is put forward.By the concept hierarchy characteristics of the dimension attribute,hybrid index( B-tree and Bit Code) technology is used to execute binary coding for hierarchy properties of each dimension,and the generated dimension code is applied to replace the key in the original table.In addition,non hierarchical dimension fragment uses inverted index technology to materialize each sub cube,so as to reduce the multi table join operation and improve OLAP query efficiency. Experiments showthat the generated HDEFC occupies less storage space,and the query method has advantages in the face of high dimensional complex query.Through the establishment of water census data analysis system showthat the method can effectively solve the problem of lowefficiency of data calculation and query because of the huge amount of and multi-dimensional data,which reduces the cost of time and space of the material of water census results data cube.
出处 《计算机技术与发展》 2017年第2期134-138,共5页 Computer Technology and Development
基金 国家科技支撑计划课题(2015BAB07B01) 水利部公益性行业科研专项(201501022)
关键词 水利普查 数据多维 数据立方体 数据分析系统 层次维编码片段 water census multi-dimensional data data cube data analysis system hierarchical dimension encoding fragment
  • 相关文献

参考文献6

二级参考文献52

  • 1刘飞跃,徐新茹,徐萍.Oracle BIEE的概述与应用[J].中山大学学报(自然科学版),2009,48(S1):344-346. 被引量:10
  • 2张宁,李强娇.基于ERP的企业数据仓库设计[J].计算机工程与设计,2005,26(2):351-353. 被引量:10
  • 3郭涛,张浩,陆剑峰.数据挖掘系统开发及Oracle Data Mining应用[J].机械工程与自动化,2005(2):49-51. 被引量:2
  • 4许惠君,李彩林,刘晓安.数据挖掘技术在水库调度中的研究与应用[J].计算机与数字工程,2006,34(9):61-63. 被引量:7
  • 5胡孔法,陈崚,顾颀,蔡俊杰,董逸生.数据仓库系统中一种高效的多维层次聚集算法[J].计算机集成制造系统,2007,13(1):196-201. 被引量:4
  • 6[1]S Chaudhuri, U Dayal. An overview of data warehousing and OLAP technology. SIGMOD Record, 1997, 26(1): 65~74
  • 7[2]P E O'Neil, D Quass. Improved query performance with variant indexes. In: J Peckham ed. Proc of the ACM SIGMOD Int'l Conf on Management of Data. New York: ACM Press, 1997. 38~49
  • 8[3]C Y Chan, Y E Ioannidis. Bitmap index design and evaluation. In: L M Haas, A Tiwary eds. Proc of the ACM SIGMOD Int'l Conf on Management of Data. New York: ACM Press, 1998. 355~366
  • 9[4]M C Wu. Query optimization for selections using bitmaps. In: A Delis, C Faloutsos, S Ghandeharizadeh eds. Proc of the ACM SIGMOD Int'l Conf on Management of Data. New York: ACM Press, 1999. 227~238
  • 10[5]K Wu, E J Otoo, A Shoshani. A performance comparison of bitmap indexes. In: H Paques, L Liu, D Grossman eds. Proc of the 10th Int'l Conf on Information and Knowledge Management. New York: ACM Press, 2001. 559~561

共引文献45

同被引文献13

引证文献2

二级引证文献2

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部