期刊文献+

Analyzing Sequential Patterns in Retail Databases

Analyzing Sequential Patterns in Retail Databases
原文传递
导出
摘要 Finding correlated sequential patterns in large sequence databases is one of the essential tasks in data mining since a huge number of sequential patterns are usually mined, but it is hard to find sequential patterns with the correlation. According to the requirement of real applications, the needed data analysis should be different. In previous mining approaches, after mining the sequential patterns, sequential patterns with the weak affinity are found even with a high minimum support. In this paper, a new framework is suggested for mining weighted support affinity patterns in which an objective measure, sequential ws-confidence is developed to detect correlated sequential patterns with weighted support affinity patterns. To efficiently prune the weak affinity patterns, it is proved that ws-confidence measure satisfies the anti-monotone and cross weighted support properties which can be applied to eliminate sequential patterns with dissimilar weighted support levels. Based on the framework, a weighted support affinity pattern mining algorithm (WSMiner) is suggested. The performance study shows that WSMiner is efficient and scalable for mining weighted support affinity patterns. Finding correlated sequential patterns in large sequence databases is one of the essential tasks in data mining since a huge number of sequential patterns are usually mined, but it is hard to find sequential patterns with the correlation. According to the requirement of real applications, the needed data analysis should be different. In previous mining approaches, after mining the sequential patterns, sequential patterns with the weak affinity are found even with a high minimum support. In this paper, a new framework is suggested for mining weighted support affinity patterns in which an objective measure, sequential ws-confidence is developed to detect correlated sequential patterns with weighted support affinity patterns. To efficiently prune the weak affinity patterns, it is proved that ws-confidence measure satisfies the anti-monotone and cross weighted support properties which can be applied to eliminate sequential patterns with dissimilar weighted support levels. Based on the framework, a weighted support affinity pattern mining algorithm (WSMiner) is suggested. The performance study shows that WSMiner is efficient and scalable for mining weighted support affinity patterns.
作者 Unil Yun
出处 《Journal of Computer Science & Technology》 SCIE EI CSCD 2007年第2期287-296,共10页 计算机科学技术学报(英文版)
关键词 data mining sequential pattern mining sequential ws-confidence weighted support affinity data mining, sequential pattern mining, sequential ws-confidence, weighted support affinity
  • 相关文献

参考文献24

  • 1Ester M. A top-down method for mining most specific frequent patterns in biological sequence data. In Proc. the 4th SIAM Int. Conf. Data Mining, Lake Buena Vista, Florida, USA,April 22-24, 2004, pp.91-101.
  • 2Wang K, Xu Y, Yu J X. Scalable sequential pattern mining for biological sequences. In Proc. the 2003 ACM CIKM Int.Conf. Information and Knowledge Management, Washington DC, USA, November 8-13, 2004, pp.178-187.
  • 3Cheng H, Yan X, Han J. IncSpan: Incremental mining of sequential patterns in large databases. In Proc. the lOth ACM SIGKDD Int. Conf. Knowledge Discovery and Data Mining,Seattle, USA, August 22-25, 2004, pp.527-532.
  • 4Chung H, Yan X, Han J. Seqlndex: Indexing sequences by sequential pattern analysis. In Proc. the 5th SIAM Int.Conf. Data Mining, Newport Beach, USA, April 21-23, 2005,pp.601-605.
  • 5Pinto H, Han J, Pei J, Wang K. Multi-dimensional sequence pattern mining. In Proc. the 2001 A CM CIKM Int. Conf.Information and Knowledge Management, Atlanta, USA,November 5-10, 2001, pp.81-88.
  • 6Kum H C, Pei J, Wang W, Duncan D. ApproxMAP: Approximate mining of consensus sequential patterns. In Proc. the 3rd SIAM Int. Conf. Data Mining, San Francisco, USA, May 1-3, 2003, pp.311-315.
  • 7Yang J, Yu P S, Wang W, Han J. Mining long sequential patterns in a noisy environment. In Proc. the 2002 ACM SIGMOD Int. Conf. Management of Data, Madison, USA, June3-6, 2002, pp.406-417.
  • 8Garofalakis M, Rastogi R, Shim K. SPIRIT: Sequential pattern mining with regular expression constraints. In Proc. 25th Int. Conf. Very Large Data Bases, September 7-10, 1999,Edinburgh, UK, pp.223-234.
  • 9Lorincz H A, Boulicaut J F. Mining frequent sequential patterns under regular expressions: A highly adaptive strategy for pushing constraints. In Proc. the 3rd SIAM Int. Conf. Data Mining, San Francisco, USA, May 1-3, 2003, pp.316 320.
  • 10Pei J, Han J, Wang W. Mining sequential patterns with constraints in large databases. In Proc. the 2002 ACM CIKM Int.Conf. Information and Knowledge Management, McLean,USA, November 4-9, 2002, pp.18-25.

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部