期刊文献+

一种基于频繁子树的数据库索引方法

A frequent subtree-based indexing method
下载PDF
导出
摘要 为解决带标号的有根无序树的数据库的索引问题,提出一种新的索引方法,首先挖掘频繁子树,并从中挑选出有判别力的子树作为索引属性,然后将索引属性集合中的子树转换成序列,并将索引组织成前缀树的形式.给出了在此类索引树中进行搜索的算法,并用Apriori剪枝和最大的有判别力的子树来减小搜索空间.实验结果表明:与其他基于路径的索引方法相比,这种基于频繁子树的数据库索引在索引大小和查询代价两方面都有较好的优越性. A new indexing method is proposed to solve the problem of indexing labeled rooted unordered trees. In this method, all frequent suhtrees are generated and discriminative suhtrees are selected among them as indexing features; suhtrees in the feature set into sequences are translated and held in a prefix tree. An algorithm of searching in the index tree is also proposed and there are two optimal techniques: apriori pruning and maximum discriminative subtrees, to reduce the search space. Experi- mental results show that our frequent suhtree-hased indexing method performs better and consumes less space than other path-based indexing methods.
作者 王涛
出处 《华中科技大学学报(自然科学版)》 EI CAS CSCD 北大核心 2008年第3期103-106,共4页 Journal of Huazhong University of Science and Technology(Natural Science Edition)
关键词 数据挖掘 频繁子树 数据库索引 子树搜索 索引树 data mining frequent subtree database indexing suhtree search index tree
  • 相关文献

参考文献10

  • 1Yan X, Yu P S, Han J. Graph indexing: a frequent structure based approach[C]//Proc 2004 ACM-SIG-MOD Int Conf on Management of Data (SIGMOD' 04). New York: ACM Press, 2004: 253-264.
  • 2Cheng H, Yan X, Han J. Seqlndex: indexing sequences by sequential pattern analysis[C]//Proc 2005 SIAM Int Conf on Data Mining (SDM'05). London: Springer, 2005: 316-320.
  • 3Wang C, Hong M, Pei J, et al. Efficient patterngrowth methods for frequent tree pattern mining[C] //The Eighth Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD' 04 ). Los Alamitos: IEEE; Computer Society, 2004:245-252.
  • 4Nijssen S, Kok J N. Efficient discovery of frequent unordered trees[C] // Proceedings of the First International Workshop on Mining Graphs, Trees and Sequences ( MGTS-2003 ). San Jose: AAAI Press, 2003:55-64.
  • 5Chi Y, Yang Y, Muntz R R. HybridTreeMiner: an efficient algorithm for mining frequent rooted trees and free trees using canonical forms [C] // The 16th International Conference on Scientific and Statistical Database Management (SSDBM' 04). San Francisco : Morgan Kaufmann, 2004:450-465.
  • 6Chalmers R, Almeroth K. On the topology of multicast trees: technical report[R]. New York: University of California, Santa Barbara, 2002.
  • 7Shasha D, Wang J T-L, Giugno R. Algorithms and applications of tree and graph searching [C]//Proc 21th ACM Symp Principles of Dalabase Systems (PODS'02). New York: ACM Press, 2002:39-52.
  • 8朱永泰,王晨,洪铭胜,汪卫,施伯乐.ESPM——频繁子树挖掘算法[J].计算机研究与发展,2004,41(10):1720-1727. 被引量:18
  • 9赵传申,孙志挥,张净.基于投影分支的快速频繁子树挖掘算法[J].计算机研究与发展,2006,43(3):456-462. 被引量:14
  • 10汪卫,周皓峰,袁晴晴,楼宇波,施伯乐.基于图论的频繁模式挖掘[J].计算机研究与发展,2005,42(2):230-235. 被引量:17

二级参考文献42

  • 1朱永泰,王晨,洪铭胜,汪卫,施伯乐.ESPM——频繁子树挖掘算法[J].计算机研究与发展,2004,41(10):1720-1727. 被引量:18
  • 2Rakesh Agrawal, Ramakrishnan Srikant. Fast algorithms for mining association rules in large databases. VLDB1994, Santiago,Chile, 1994.
  • 3Heikki Mannila, et al. Search and borders of theories in knowledge discovery. Data Mining and Knowledge Discovery,1997, 1(3): 241~258.
  • 4Jong Soo Park, et al. An effective Hash based algorithm for mining association rules. SIGMOD1995, San Jose, USA, 1995.
  • 5Sergey Brin, et al. Dynamic itemset counting and implication rules for market basket data. SIGMOD1997, Tucson, USA,1997.
  • 6Ramesh C. Agarwal, et al. Depth first generation of long patterns, KDD 2000, Boston, USA, 2000.
  • 7Ramesh C. Agarwal, et al. A tree projection algorithm for generation of frequent itemsets. J. of Parallel and Distributed Computing, 2001, 61(3): 350~371.
  • 8Jiawei Han, Jian Pei, Yiwen Yin. Mining frequent patterns without candidate generation. SIGMOD2000, Dallas, USA, 2000.
  • 9J. Pei, et al.. H-Mine: Hyper-structure mining of frequent patterns in large databases. ICDM'01, San Jose, CA, 2001.
  • 10Mike Perkowitz, Oren Etzioni. Adaptive sites: Automatically learning from user access patterns. WWW' 97, Santa Clara, 1997.

共引文献36

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部