期刊文献+

改进的PrefixSpan算法及其在序列模式挖掘中的应用 被引量:11

Improved Prefixspan Algorithm and Its Application in Sequential Pattern Mining
下载PDF
导出
摘要 由于序列模式挖掘需要花费大量计算时间,并需要占用大量存储空间.减少计算量、节省存储空间开销成为序列模式挖掘的关键.因PrefixSpan算法不产生候选,而适当应用Bitmap数据结构可避免重复扫描数据库,基于此,本文提出了BM-PrefixSpan算法,用于序列模式挖掘,设计并构造了PFPBM(Prefix of First Position on BitMap)表用于记录序列中的每个项在位图中第1次出现的位置.实验结果表明,BM-PrefixSpan算法综合了PrefixSpan和SPAM算法的优点,能够更快、更好地挖掘出序列模式. Because sequential pattern mining needs a lot of computing time and storage space, how to re duce the amount of calculation and storage space becomes the key of the sequential pattern mining algo rithm. Combining the PrefixSpan algorithm with Bitmap data structure, this text proposes an improved se quential pattern mining algorithm BM-PrefixSpan. The PFPBM ( Prefix of First Position on BitMap) table was designed and implemented. When a new item appeared in a sequence, it was recorded in the PFPBM table. The experimental results show that the BM-prefixspan algorithm mines sequential patterns faster and better than others.
出处 《广东工业大学学报》 CAS 2013年第4期49-54,共6页 Journal of Guangdong University of Technology
基金 教育部重点实验室基金资助项目(110411) 广东省自然科学基金资助项目(10451009001004804 9151009001000007) 广东省科技计划项目(2012B091000173) 广州市科技计划项目(2012J5100054 2013J4500028) 韶关市科技计划项目(2010CXY/C05)
关键词 序列模式 前缀投影序列模式挖掘 序列模式挖掘 位图 数据挖掘 sequence pattern PrefixSpan ( Prefix-projected Sequential Pattern Mining) SPAM ( Se-quence Pattern Mining) bitmap data mining
  • 相关文献

参考文献13

  • 1Han J,Kamber M.数据挖掘概念与技术[M].3版.北京:机械工业出版社,2012.
  • 2Agrawal R, Srikant R. Mining sequential patterns [C] . Proceedings of the n" International Conference on Data Engineering. CA: Los Alamitos, IEEE Computer Society, 1995: 3-14.
  • 3Srikant R, Agrawal R. Mining sequential patterns: generalizations and performance improvements [C]. EDBT 96: Proceedings of the 5th International Conference on Extending Database Technology: Advances in Database Technology. UK, London: Springer-Verlag, 1996: 3 -17 .
  • 4Zaki M . SPADE: An Efficient Algorithm for Mining Frequent Sequences [J]. Macheine Learning, 2001 , 42 ( 1 ) : 31-60.
  • 5Han J, Pei J, Mortazavi-Asl B, et al. Freeopan . Frequent pattern-projected sequential pattern mining [C]. Proceedings of the 6th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. USA, New York: ACM, 2000: 355-359.
  • 6Pei J, Han J, Mortazavi-Asl B, et al. Mining sequential patterns hy pattern-growth: the prefix span approach [J] . IEEE Transactions On Knowledge and Data Engineering, 2004,16(11) : 1424-1440.
  • 7Ayres J, Gehrke J, Yiu T, et al. Sequential pattern mining using a bitmap representation [C]. Proceedings of the 8th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. USA, New York: ACM, 2002: 429-435.
  • 8王观君.基于位图的闭序列模式挖掘[D].郑州:河南大学计算机与信息工程学院,2008.
  • 9Peterson E, Tang P. Mining frequent sequential patterns with first-occurrence forests [C]. ACM-SE 46 Proceedings of the 46^th Annual Southeast Regional Conference on XX, USA, New York: ACM, 2008:34-39.
  • 10公伟,刘培玉,贾娴.基于改进PrefixSpan的序列模式挖掘算法[J].计算机应用,2011,31(9):2405-2407. 被引量:12

二级参考文献90

共引文献56

同被引文献104

引证文献11

二级引证文献17

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部