期刊文献+

不确定图上的Top-k稠密子图挖掘算法 被引量:5

Mining Top-k Dense Subgraphs from Uncertain Graphs
下载PDF
导出
摘要 该文研究了从不确定图上挖掘top-k稠密子图的问题.由于图数据具有内生不确定性,确定图上稠密子图的定义和挖掘算法在不确定图上均不适用.因此,该文提出了不确定图上期望稠密度的概念,并给出了其在多项式时间内的计算方法.基于此,该文定义了不确定图中导出子图之间的一种偏序关系.利用该偏序关系,将不确定图中的导出子图有效地组织成一棵搜索树.该文严格证明了此搜索树中可以完整无重复地覆盖不确定图上的所有导出子图.据此,该文提出了针对此搜索树的一种分支界限搜索算法DS,用于精确挖掘top-k稠密子图.该文还提出了不相交top-k稠密子图的概念,并给出了一种基于束搜索的启发式近似搜索算法LS.在多组数据集上的实验结果表明,文中提出的DS算法具有很高的效率和很好的扩展性,可用于处理大规模图数据.启发式近似搜索算法LS可以快速发现不相交top-k稠密子图. This paper investigates the problem that mining top-kdense subgraphs from uncertain graphs.Since uncertainties are inherent in graph data,traditional concepts and algorithms on mining dense subgraphs are not applicable to uncertain graphs.Hence,this paper firstly purposes the expected density concept and show the computing method to compute it in polynomial time.Based on this definition,we define a partial order on all induced subgraphs in uncertain graphs.Through this partial order,all induced subgraphs are organized to be an enumeration tree.It's carefully proved that each induced subgraph will occur in this enumeration tree exactly once.We give out a branch and bound search algorithm DS on this tree that produces top-k dense subgraphs.Meanwhile,we purpose the definition of disjoint top-k dense subgraphs and show a heuristic approximation algorithm LS based on beam search.Extensive experiments on multiple datasets indicates that the DS algorithm are both efficient and scalable,which can be used to process large graph data.The approximation algorithm LS holds an excellent performance both in efficiency and approximate quality.
出处 《计算机学报》 EI CSCD 北大核心 2016年第8期1570-1582,共13页 Chinese Journal of Computers
基金 国家自然科学基金(61173023 61532015)资助
关键词 不确定图 top-k稠密子图 期望稠密度 分支界限搜索 数据挖掘 uncertain graph top-kdense subgraph expected density branch and bound search data mining
  • 相关文献

参考文献2

二级参考文献3

  • 1Omar Benjelloun,Anish Das Sarma,Alon Halevy,Martin Theobald,Jennifer Widom. Databases with uncertainty and lineage[J] 2008,The VLDB Journal(2):243~264
  • 2Nilesh Dalvi,Dan Suciu. Efficient query evaluation on probabilistic databases[J] 2007,The VLDB Journal(4):523~544
  • 3邹兆年,李建中,高宏,张硕.从不确定图中挖掘频繁子图模式[J].软件学报,2009,20(11):2965-2976. 被引量:32

共引文献33

同被引文献29

引证文献5

二级引证文献6

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部