期刊文献+

序列模式挖掘算法综述 被引量:5

Research on sequential pattern mining algorithms
下载PDF
导出
摘要 目前的主要序列模式挖掘算法可以分为3类:①基于Apriori的候选码生成-测试的方法;②基于垂直格式的候选码生成-测试的方法;③基于模式增长的方法.在介绍序列模式挖掘基本概念的基础上,描述了典型的挖掘算法,着重分析第②类序列模式挖掘算法的关键技术,并对各种算法进行详细的分析与比较,总结出它们的优缺点:前两类方法因产生巨大的候选序列而致挖掘代价剧增,而第③类模式增长方法避免了候选序列的产生,但挖掘长模式效率低. Recently sequential pattern mining algorithms can be divided into three classes: a candidate generation-and-test approach based on Apriori, a candidate generation-and-test approach based on vertical format, a pattern-growth method. On the foundation of introduction of the basic concept of sequential pattern mining, this paper describes classical algorithms, place emphasis on analyzing pivotal technique of the second class algorithm, then makes a comparison and analysis among these algorithms and finally summarizes pros and cons of the algorithms: the first two methods generate too much candidates leading to high cost, while the method of pattern-growth avoids candidate. However, it is inefficient when mining long sequences.
出处 《扬州大学学报(自然科学版)》 CAS CSCD 2007年第1期41-46,共6页 Journal of Yangzhou University:Natural Science Edition
基金 国家自然科学基金资助项目(60673060) 国家科技基础条件平台项目(2004DKA20310) 江苏省自然科学基金资助项目(BK2005047) 江苏省高校"青蓝工程"优秀青年骨干教师基金资助项目 扬州大学"新世纪人才工程"优秀青年骨干教师基金资助项目
关键词 序列模式挖掘 候选码生成-测试 数据分布 模式增长 sequential pattern mining candidate generation-test data distribution pattern-grow
  • 相关文献

参考文献4

二级参考文献73

  • 1R.Agrawal,R.Srikant.Mining sequential patterns.ICDE 1995,Taipei,Taiwan,1995.
  • 2R.Srikant,R.Agrawal.Mining sequential patterns:Generalizations and performance improvements.EDBT 1996,Avignon,France,1996.
  • 3J.Pei,J.Han,B.Mortazavi-Asl,et al.PrefixSpan mining sequential patterns efficiently by prefix projected pattern growth.ICDE 2001,Heidelberg,Germany,2001.
  • 4M.Garofalakis,R.Rastogi,K.Shim.SPIRIT:Sequential pattern mining with regular expression constraints.In:Proc.VLDB 1999,San Francisco:Morgan Kaufmann.,1999.223~234.
  • 5C.Bettini,X.S.Wang,S.Jajodia.Mining temporal relationships with multiple granularities in time sequences.Data Engineering Bulletin,1998,21 (1):32 ~ 38.
  • 6J.Han,G.Dong,Y.Yin.Efficient mining of partial periodic patterns in time series database.ICDE 1999,Sydney,Australia,1999.
  • 7H.Mannila,H.Toivonen,A.I.Verkamo.Discovering frequent episodes in sequences.KDD 1995,Montreal,Quebec,Canada,1995.
  • 8R.Agrawal,R.Srikant.Fast algorithms for mining association rules in large databases.The 20th Int'l Conf.Very Large Databases,Santiago,Chile,1994.
  • 9M.J.Zaki.Spade:An efficient algorithm for mining frequent sequences.Machine Learning,2001,42(1/2):31~60.
  • 10J.Ayres,J.Flannick,J.Gehrke,et al.Sequential pattern mining using a bitmap representation.SIGKDD,Edmanton,Alberta,Canada,2002.

共引文献46

同被引文献34

  • 1程政,雷霞,廖翔,马一凯,柏晓丽.数据挖掘在电网安全性评价中的应用[J].电气技术,2010,11(8):97-99. 被引量:4
  • 2陆介平,杨明,孙志挥,鞠时光.快速挖掘全局最大频繁项目集[J].软件学报,2005,16(4):553-560. 被引量:27
  • 3杜耀华,李冬冬,王正志.判据搜索算法及其在DNA序列模式发现中的应用(英文)[J].系统仿真学报,2006,18(5):1169-1177. 被引量:2
  • 4李川川,刘衍珩,田大新.基于序列模式的网络入侵检测系统[J].吉林大学学报(工学版),2007,37(1):121-125. 被引量:7
  • 5Park J S, Psy U. An efficient parallel data mining for association rules [ C ]//Proc of the 4th on Information and Knowledge Management. New York: ACM Press, 1995 : 31 - 36.
  • 6Cheung D W, Hart J, Ng V T, et al. A fast distributed algorithm for mining association rules [ C ]//Proc of the 4th International Conference on Parallel and Distributed Information Systems. Los Alamitos, USA:IEEE Computer Society Press, 1996 : 31 - 44.
  • 7Zaki M. Spade: an efficient algorithm for mining frequent sequences [ J]. Machine Learning, 2001, 41 (2) : 31 -60.
  • 8Pei J, Han J, Pinto H, et al. PrefixSpan: mining sequential patterns efficiently by prefix-projected pattern growth [ J ]. IEEE Transactions on Knowledge & Data Engineering, 2004,16( 1 ) : 1424 - 1440.
  • 9Zhang Changhai, Hu Kongfa, Liu Haidong, et al. FMGSP: an efficient method of mining global sequential patterns[ C ]//Proc of the 4th International Conference on Fuzzy Systems and Knowledge Discovery. Los Alamitos : IEEE Computer Society, 2007 : 761 - 765.
  • 10Srikant R, Agrawal R. Mining sequential patterns: generalizations and performance improvements [ C ]// Proc of 5th International Conference on Extending Database Technology. Heidelberg : Springer, 1996 : 3 - 17.

引证文献5

二级引证文献21

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部