期刊文献+

面向OLGP的多维信息网络数据仓库模型设计 被引量:5

Design of Multi-Dimensional Information Network Datawarehouse Model for Online Graph Processing
下载PDF
导出
摘要 信息网络的出现使信息由简单的数值型数据演化成较复杂的图网络结构。如何对基于图的信息网络数据进行良好的组织和存储成为一个亟待解决的问题。利用维建模的方法对基于图的信息网络数据进行模型设计,提出了多维信息网络仓库模型。该模型由边事实表、节点事实表、信息维连接属性表以及拓扑维节点属性表组成,能够为在线图处理提供底层的数据平台。实验表明该模型在消除冗余、查询时间、存储空间上均较泛关系表有明显优势。新模型在1.25万篇ACM论文上的查询时间稳定在几十毫秒,较泛关系表的查询时间约减少一个数量级。在空间性能上,随着论文数量的增加,该模型存储空间开销的增长速度远小于泛关系表的增长速度。 With the emergence of information network, the information evolves from simple numerical data to complex graph network. How to organize and store the information network data becomes an urging problem. This paper proposes a multi-dimension information network datawarehouse model (MINDM), which aims to providethe data foundation to online graph processing. The MINDM includes edge fact table, node fact table, information link attribution table and topology node attribution table. The experimental results show that the M1NDM can eliminate redundancy, reduce the cost of average query time, and save the space storage. The query time remains stable within a few milliseconds while performing queries on the 12.5 thousand ACM papers real dataset, keeping sharp comparison to van relation model with more than hundreds of milliseconds for the same processing stage. With the number of papers growing, the storage space of the proposed model increases much slower than the van relation model.
出处 《计算机科学与探索》 CSCD 2014年第1期51-60,共10页 Journal of Frontiers of Computer Science and Technology
基金 国家自然科学基金Nos.61103043 61173099 国家"十二五"科技支撑计划重点项目No.2012BAG04B02 中央高校基本科研业务费专项资金No.2010SCU11053 四川省科技支撑计划No.2011GZ0223 高等学校博士学科点专项科研基金No.20110181120062 武汉大学软件工程国家重点实验室开放基金No.SKLSE2012-09-26 华为创新计划 No.YJCB2011036RE~~
关键词 信息网络 信息维 拓扑维 在线图处理 多维信息网络数据仓库模型 InfoNetwork informational dimension topological dimension online graph processing multi-dimensionalinformation network datawarehouse model
  • 相关文献

参考文献3

二级参考文献16

  • 1陆化普,石冶.Complexity of Public Transport Networks[J].Tsinghua Science and Technology,2007,12(2):204-213. 被引量:13
  • 2Han Jiawei,Kamber Micheline,范明,孟小峰,等译.数据挖掘概念与技术[M].北京:机械工业出版社,2007:424-479.
  • 3Han Jiawei, Yan Xifeng, Yu P S. Scalable OLAP and mining of information networks[C]//Proceedings of the 12th International Conference on Extending Database Technology: Advances in Database Technology (EDBT '09). New York, NY, USA: ACM, 2009: 1159.
  • 4Han Jiawei, Sun Yizhou, Yu P S, et al. Mining knowledge from databases: an information network analysis approach[C]// Proceedings of the 2010 ACM SIGMOD International Conference on Management of Data (SIGMOD '10). New York, NY, USA: ACM, 2010: 1251-1252.
  • 5Sun Yizhou, Tang Jie, Han Jiawei, et al. Community evolution detection in dynamic heterogeneous information networks[C]//Proceedings of the 8th Workshop on Mining and Learning with Graphs (MLG '10). New York, NY, USA: ACM, 2010: 137-146.
  • 6Sun izhou, Wu Tianyi, Han Jiawei, et al. BibNetMiner: mining bibliographic information networks[C]//Proceedings of the 2008 ACM SIGMOD International Conference on Management of Data (SIGMOD '08). New York, NY, USA: ACM, 2008: 1341-1344.
  • 7Gao Jing, Liang Feng, Han Jiawei, et al. On community outliers and their efficient detection in information networks[C]// Proceedings of the 16th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD '10). New York, NY, USA: ACM, 2010: 813-822.
  • 8Qu Qiang, Zhu Feida, Han Jiawei, et al. Efficient topological OLAP on information networks[C]//Proceedings of the 16th International Conference on Database Systems for Advanced Applications (DASFAA ' 11). Berlin: Springer-Verlag, 2011: 389-403.
  • 9Zhao Peixiang, Li Xiaolei, Han Jiawei, et al. Graph cube: on warehousing and OLAP multidimensional networks[C]//Proceedings of the 2011 ACM SIGMOD International Conference on Management of Data (SIGMOD ' 11). New York, NY, USA: ACM, 2011 : 853-864.
  • 10Li Chuan, Yu P S, Zhao Lei, et al. InfoNetOLAPer: integrating InfoNetWarehouse and InfoNetCube with InfoNetOLAP[J]. Proceedings of the VLDB Endowment, 2011, 4(12): 1422-1425.

共引文献25

同被引文献35

  • 1杨尚乾,李川,唐常杰,曾卫.基于拓扑维上卷的空航信息网络枢纽节点发现[J].华中科技大学学报(自然科学版),2012,40(S1):280-283. 被引量:2
  • 2曾卫,李川,唐常杰,杨尚乾.复杂空航信息网络枢纽节点的高效发现[J].华中科技大学学报(自然科学版),2012,40(S1):394-397. 被引量:1
  • 3程刚.基于知识管理的企业信息资源管理保障体系研究[J].情报理论与实践,2006,29(1):91-96. 被引量:12
  • 4王柏,吴巍,徐超群,吴斌.复杂网络可视化研究综述[J].计算机科学,2007,34(4):17-23. 被引量:44
  • 5王珊.李翠平,李盛恩等.数据仓库与数据分析教程[M].高等教育出版社,2012:110-132.
  • 6Han Jiawei, Yan Xifeng, Yu P S. Scalable OLAP and Miningof Information Network[C]. Proceedings of the 12th International Confer- ence on Extending Database Technology: Advances in Database Technology (EDBT09). New York,NY, USA: ACM, 2009:1159 .
  • 7Han Jiawei, Sun Yizhou, Yu P S, et al. Mining Knowledgefrom Databases: an Information Network Analysis Approach[C]. Proceedings of the 2010International Conference on Management of Data (SIGMOD10). New York, NY, USA:ACM, 2010: 1251-1252.
  • 8Li C, Yu P S, Zhao L, et al. Infonetolaper: Integrating Infonetwarehouse and Infonetcube with Infonetolap[C]. Proc of VLDB. 2011, 4.
  • 9Gupta M, Gao J, Yan X, et al. Top-k Interesting Subgraph Discovery in Information Networks[C]. Data Engineering (ICDE), 2014 IEEE 30th International Conference on. IEEE, 2014: 820-831.
  • 10Pei-xiang Zhao, Jia-wei Han, Yi-zhou Sun. P-Rank: A Comprehensive Structural Similarity Measure over Information Networks. Proc. 2009 ACM Conf. on Information and Knowledge Management(CIKM'09), Hong Kong, China, Nov. 2009.

引证文献5

二级引证文献5

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部