期刊文献+

一种高效的增量式序列模式挖掘算法 被引量:3

Efficient Incremental Mining Algorithm of Sequential Patterns
下载PDF
导出
摘要 现有的增量式挖掘算法在支持度发生变化时,需要对序列数据库进行重复挖掘,为减少由此产生的时空消耗,提出一种高效的增量式序列模式挖掘算法。算法采用频繁序列树作为序列存储结构,当序列数据库和最小支持度发生变化时,通过执行更新操作,实现频繁序列树的更新,利用深度优先遍历频繁序列树找到序列数据库中所有的序列模式。实验结果表明,与IncSpan算法和PrefixSpan算法相比,该算法的挖掘效率较高。 In order to solve the problem that the existed incremental mining algorithms need to mine the sequence database once again,and reduce the time and space consumption generated by repeatly running mining algorithm in the process of the sequential pattern mining,this paper proposes an efficient incremental mining algorithm of sequential patterns.It uses the frequent sequence tree as the storage structure of the algorithm.When the sequence database is updated and the minimum support is changed,it updates the frequent sequence tree by performing the update operation.It finds all the sequential patterns through using depth-first search strategy to traverse the frequent sequence tree.Experimental results show that the algorithm outperforms IncSpan and PrefixSpan in time cost.
作者 刘佳新
机构地区 燕山大学图书馆
出处 《计算机工程》 CAS CSCD 2012年第12期39-41,共3页 Computer Engineering
基金 国家自然科学基金资助项目(61170190) 秦皇岛市科学技术研究与发展计划基金资助项目(201001A018)
关键词 数据挖掘 增量式挖掘 序列模式 投影数据库 频繁序列树 data mining incremental mining sequential pattern project database frequent sequence tree
  • 相关文献

参考文献10

  • 1陈卓,杨炳儒,宋威,宋泽锋.序列模式挖掘综述[J].计算机应用研究,2008,25(7):1960-1963. 被引量:24
  • 2Srikant R,Agrawal R.Mining Sequential Patterns:Generalizationand Performance Improvements[C]//Proc.of the 5th InternationalConference on Extending Database Technology.Avignon,France:[s.n.],1996.
  • 3Pei Jian,Han Jiawei,Mortazavi-Asl B,et al.PrefixSpan:MiningSequential Patterns Efficiently by Prefix-projected PatternGrowth[C]//Proc.of the 17th International Conference on DataEngineering.Washington D.C.,USA:[s.n.],2001.
  • 4刘佳新,严书亭,贺春亮,任家东.基于序列树的增量式序列模式更新算法[J].计算机工程与应用,2011,47(33):143-145. 被引量:1
  • 5Parthasarathy S,Zaki M J,Ogihara M,et al.Incremental andInteractive Sequence Mining[C]//Proc.of the 8th InternationalConf.on Information and Knowledge Management.[S.l.]:ACMPress,1999.
  • 6LU Jie-Ping,LIU Yue-bo,NI wei-wei,LIU Tong-ming,SUN Zhi-hui.A Fast Interactive Sequential Pattern Mining Algorithm[J].Wuhan University Journal of Natural Sciences,2006,11(1):31-36. 被引量:1
  • 7Lin Ming-Yen,Lee Suh-Yin.Improving the Efficiency of Interac-tive Sequential Pattern Mining by Incremental PatternDiscovery[C]//Proc.of the 36th Annual Hawaii InternationalConference on System Sciences.[S.l.]:IEEE Press,2003.
  • 8牛兴雯,杨冬青,唐世渭,王腾蛟.OSAF-tree——可迭代的移动序列模式挖掘及增量更新方法[J].计算机研究与发展,2004,41(10):1760-1767. 被引量:4
  • 9Liu Jiaxin,Yan Shuting,Ren Jiadong.The Design of FrequentSequence Tree in Incremental Mining of Sequential Patterns[C]//Proc.of the 2th IEEE International Conference on SoftwareEngineering and Service Science.Beijing,China:[s.n.],2011.
  • 10刘佳新.一种基于频繁序列树的增量式序列模式挖掘算法[J].计算机与现代化,2012(2):8-10. 被引量:1

二级参考文献65

共引文献25

同被引文献17

  • 1韩家炜 Michelin K.数据挖掘:概念与技术[M].北京:机械工业出版社,2001..
  • 2张坤,陈越,朱扬勇.一种基于前缀树的增量序列挖掘算法[J].计算机工程,2007,33(19):69-71. 被引量:2
  • 3L Cao, C Zhang. Domain-driven, actionable knowledge discovery [ J ]. Intelligent Systems, 2007,22 (4) :78-88.
  • 4L Cao. In-depth behavior understanding and use : The behavior in- formatics approach [ J ]. Information Sciences, 2010, 180 ( 17 ) : 3067 -3085.
  • 5Masseglia F, Poncelet P, Teisseire M. Incremental mining of sequential patterns in large databases[J]. Data &. Knowl- edge Engineering, 2003,46 (1) : 97- 121.
  • 6Hsieh C Y,Yang D L,Wu J. An efficient sequential pattern mining algorithm based on the 2-sequence matrix[C]//Pro- ceedings of the 2008 IEEE International Conference on Data Mining Workshops, Piscataway, New Jersey. IEEE, 2008: 583-591.
  • 7Cheng H, Yan X, Han J. IncSpan; incremental mining of se- quential patterns in large database[C]//Proceedings of the 10th ACM SIGKDD International Conference on Knowl- edge Discovery and Data Mining. New York: ACM, 2004: 527-532.
  • 8Liu J,Yan S,Ren J. The design of frequent sequence tree in incremental mining of sequential patterns[C]//Proeeedings of the IEEE 2nd International Conference on Software En- gineering and Service Science, Piseataway, New Jersey. IEEE, 2011 : 679-682.
  • 9Lin C W, Hong T P, Lu W H, et al. An incremental FUSP- tree maintenance algorithrn[C]//Proeeedings of the 8th In- ternational Conference on Intelligent Systems Design and Applications, Piseataway, New Jersey. IEEE, 2008: 445- 449.
  • 10Wu X, Zhang C, Zhang S. Database classification for multi- database mining [J]. Information Systems, 2005, 30 ( 1 ) ; 71-88.

引证文献3

二级引证文献2

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部