期刊文献+

关联规则的冗余删除与聚类 被引量:15

Pruning and Clustering Discovered Association Rules
下载PDF
导出
摘要 关联规则挖掘常常会产生大量的规则,这使得用户分析和利用这些规则变得十分困难,尤其是数据库中属性高度相关时,问题更为突出.为了帮助用户做探索式分析,可以采用各种技术来有效地减少规则数量,如约束性关联规则挖掘、对规则进行聚类或泛化等技术.本文提出一种关联规则冗余删除算法ADRR和一种关联规则聚类算法ACAR.根据集合具有的性质,证明在挖掘到的关联规则中存在大量可以删除的冗余规则,从而提出了算法ADRR;算法ACAR采用一种新的用项目间的相关性来定义规则间距离的方法,结合DBSCAN算法的思想对关联规则进行聚类.最后将本文提出的算法加以实现,实验结果表明该算法是有效可行的,且具有较高的效率. A common problem in association rule mining is that a large number of rules are often generated from the databases, which makes it difficult for users to analyze and makes use of these rules. This is particularly true for data sets whose attributes are highly correlated. To facilitate exploratory analysis, the number of rules can be reduced significantly by techniques such as mining association rules with constraint items, post-pruning or clustering and summarizing rules. This paper proposed algorithms ADRR and ACAR to overcome this problem. Firstly, algorithm ADRR prunes the discovered associations by removing those redundant associations according to the property of the set, and then algorithm ACAR makes use of the correlation information of the items to measure the distances between rules, Therefore, clustering algorithm DBSCAN is applied to generate the clustering structure suitable for exploratory analysis. Finally, an experiment is conducted on a real-life database and the experimental result shows that the method is practical and effective.
出处 《小型微型计算机系统》 CSCD 北大核心 2006年第1期110-113,共4页 Journal of Chinese Computer Systems
基金 江苏省重点实验室开放基金(KJS03064)资助.
关键词 关联规则 相关性 聚类 association rules correlation clustering
  • 相关文献

参考文献13

  • 1Srikant R, Vu Q, Agrawal R. Mining association rules with item constraints[C]. In: Proc. 1997 Int. Conf. Knowledge Discovery in Databases and Data Mining, Newport Beach, California, 1997,67-73.
  • 2崔立新,苑森淼,赵春喜.约束性相联规则发现方法及算法[J].计算机学报,2000,23(2):216-220. 被引量:62
  • 3Ng R, Lakshmanan L V S, Han Jet al. Exploratory mining and pruning optimizations of constrained associations rules[C]. In:Proc. 1998 ACM-SIGMOD Int. Conf. Management of Data,Seattle, WA, 1998,13-24.
  • 4Aggrawal C, Yu P S. Online generation of association rules[C]. In: Proc. 1998 Int. Conf. Data Engineering, Orlando,FL, 1998,402-411.
  • 5Toivonen H, Klemettinen M, Ronkainen Pet al. Pruning and grouping discovered association rules[Z]. In MLnet Workshop on Statistics, Machine Learning, and Discovery in Databases,Crete, Greece, 1995, 47-52.
  • 6Liu B, Hsu W, Ma Y. Pruning and summarizing the discovered associations[C]. In: Proc. 1999 ACM-SIGKDD Int. Conf.Knowledge Discovery and Data Mining. San Diego, CA. USA,1999,125-134.
  • 7Li J, Shen H, Topor R. Mining the smallest association rule set for predictions[C]. In: Proc. 2001 IEEE Int. Conf. Data Mining. San Jose, California, USA, 2001,361-368.
  • 8Lent B, Swami A, Widom J. Clustering association rules[C].In: Proc. 1997 Int. Conf. Data Engineering, Birminghan, England, 1997,220-231.
  • 9Gupta G K, Strehl A, Ghosh J. Distance based clustering of association rules[C]. In: Proc. Intelligent Engineering Systems Through Artificial Neural Networks , St. Louis, Missouri:ASME Press, 1999, 759-764.
  • 10An A, Khan S, Huang X. Objective and subjective algorithms for grouping association rules [C]. In: Proc 2003 IEEE Int.Conf. Data Mining, Melbourne, Florida, 2003, 477-480.

二级参考文献15

  • 1Han J,Proc of the 21st International Confer-ence on Very L arge Databases,1995年,420页
  • 2E G Hetzler, W M Harris, S Harvre et al. Visualizing the full spectrum of document relationships. In: Proc of the 5th Int'l Society for Knowledge Organization Conference. Würzburg: Ergon, 1998. 168~175
  • 3P C Wong, P Whitney, J Thomas. Visualizing association rules for text mining. In: Proc of IEEE Symposium on Information Visualization(INFOVIS'99). San Francisco: IEEE Computer Society, 1999. 120~123
  • 4M Hao, M Hsu, U Dayal et al. Market basket analysis visualization on a spherical surface. HP Labs, Technical Report: HPL-2001-3, 2001
  • 5H Toivonen, M Klemettinen, P Ronkainen et al. Pruning and grouping discovered association rules. The ECML-95 Workshop on Statistics, Machine Learning, and Knowledge Discovery in Databases, Heraklion, 1995
  • 6G K Gupta, A Strehl, J Ghosh. Distance based clustering of association rules. In: Proc of ANNIE, St. Louis, Missouri: ASME Press, 1999. 759~764
  • 7M Ankerst, M Breunig, H P Kriegel et al. OPTICS: Ordering points to identify the clustering structure. In: Proc of 1999 ACM-SIGMOD Int'l Conf Management of Data (SIGMOD'99). Philadephia: ACM Press, 1999. 49~60
  • 8J Han, Y Fu. Discovery of multiple level association rules from large databases. In: Proc of the 21st Int'l Conf on Very Large Databases(VLDB'95). Zurich: Morgan Kaufmann, 1995. 420~431
  • 9R Srikant, R Agrawal. Mining generalized association rules.In:Proc of the 21st Int'l Conf on Very Large Databases(VLDB'95). Zurich: Morgan Kaufmann, 1995. 407~419
  • 10A Savasere, E Omiecinski, S Navathe. Mining for strong negative associations in a large database of customer transactions. In: Proc of the 14th Int'l Conf on Data Engineering. Orlando: IEEE Computer Society, 494~502

共引文献77

同被引文献92

引证文献15

二级引证文献50

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部