MRSM:挖掘具有代表性的极大频繁子图

MRSM:a new algorithm for mining maximal frequent representative subgraphs

下载PDF

导出

摘要基于随机化思想,提出了一种新的挖掘具有代表性的极大频繁子图的算法——MRSM算法。该算法在第一步挖掘极大频繁子图过程中,采用基于随机化的方法,利用已挖掘到的结果,提高算法的效率;在第二步聚类过程中,综合考虑了频繁模式在支持度和结构上的相似性,使得聚类的质量更好。在真实和模拟数据集上的实验结果证实了MRSM算法的有效性。 A new algorithm for maximal frequent representative subgraph mining （MRSM）, called the MRSM algorithm for short, is proposed based on the randomized strategy. The new algorithm uses the mined patterns to improve its efficiency in the stage of mining maximal frequent subgraphs, and in the stage of clustering, it comprehensively considers the similarity in both structure and support of frequent patterns to improve its clustering performance. The extensive experiments on real and synthetic datasets verified the effectiveness and efficiency of the new algorithm, and showed that it can extract high-quality representative patterns.

作者杨艳屈松刘勇

机构地区黑龙江大学计算机科学技术学院黑龙江省数据库与并行计算重点实验室

出处《高技术通讯》 CAS CSCD 北大核心 2013年第4期337-344,共8页 Chinese High Technology Letters

基金国家自然科学基金(60973081) 黑龙江省自然科学基金(F201011) 黑龙江省高校科技创新团队建设计划项目(2013TD012) 黑龙江省教育厅科学技术研究面上项目(11551352 12531476) 哈尔滨市青年科技创新人才研究(2012RFQXG096 2012RFQXS094)资助项目

关键词数据挖掘极大频繁子图代表模式随机算法 Data mining, maximal frequent subgraph, representative pattern, randomized algorithms

分类号 TP301.6 [自动化与计算机技术—计算机系统结构]

引文网络
相关文献

参考文献14

1Agrawal R, Srikant R. Fast algorithms for mining associa- tion rules in large database. In:Proceedings of the 20th International Conference on Very Large Databases, San- tiago, Chile, 1994. 487-499.
2Zaki M J. Efficiently mining frequent trees in a forest. In: Proceedings of the 8th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Edmonton, Canada, 2002. 71-80.
3Yah X F, Han J W. gSpan: Graph-based substructure Pattern mining. In : Proceedings of the IEEE International Conference on Data Mining, Maebashi City, Japan, 2002. 548-551.
4Huan J, Wang W, Prins J. Efficient mining of frequent subgraphs in the presence of isomorphism. In: Proceeding of the IEEE International Conference on Data Mining, Melbourne, USA, 2003. 549-552.
5Nijssen S, KoK J N. A Quickstart in frequent structure mining can make a difference. In: Proceedings of the 10th ACM SIGKDD International Conference on Knowl- edge Discovery and Data Mining, Seattle, USA, 2004.549-552.
6Huan J, Wang W, Prins J, et al. SPIN: mining maximal frequent subgraphs from graph databases. In: Proceedings of the 10th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Seattle, USA, 2004. 286-295.
7Chaoji V, Hasan M A, Salem S, et al. ORIGAMI: a no- vel and effective approach for mining representative or- thogonal graph patterns. Statistical Analysis and Data Mining, 2008, 1 (2) : 67-84.
8Zhang S J, Yang J, Li S R. RING: an integrated method for frequent representative subgraph mining. In: Proceed- ings of the IEEE International Conference on Data Min- ing, Miami, USA, 2009. 1082-1087.
9Hasan M A, Zaki M J. Output space sampling for graph patterns. In: Proceedings of the 35th International Con- ference on Very Large Databases, Loyn, France, 2009. 730-741.
10Hasan M A, Zaki M J. Musk: uniform sampling of k maximal patterns. In: Proceedings of the SIAM Interna- tional Conference on Data Mining, Sparks, USA, 2009. 650-661.

1刘勇,高宏,李建中.基于联合意义度量的Top-K图模式挖掘[J].计算机学报,2010,33(2):215-230. 被引量：3
2李健,叶有培,韩牟.一种基于harris角点的抗几何攻击的数字水印算法[J].太原理工大学学报,2008,39(6):576-580. 被引量：2
3李秦,张馨东,童甲佳,李宇博.基于线性表的闭频繁项集挖掘算法[J].兰州大学学报（自然科学版）,2011,47(4):122-126.
4李明.业务代表模式在业务逻辑集成中的应用[J].微处理机,2013,34(6):39-41.

高技术通讯

2013年第4期

浏览历史

内容加载中请稍等...

MRSM:挖掘具有代表性的极大频繁子图

参考文献14

相关作者

相关机构

相关主题

浏览历史