期刊文献+

一种基于规则表达式约束的序列模式增量式挖掘算法 被引量:1

An incremental mining algorithm for sequential pattern based on regular expression constraints
下载PDF
导出
摘要 序列模式挖掘是数据挖掘中的研究热点之一。在挖掘过程中需要用户的参与日益显得重要。为了提高挖掘过程中的交互性,本文提出了一个基于规则表达式约束的序列模式增量式挖掘算法RE_IncUp。该算法首先利用约束对已经挖掘出的频繁序列模式进行预处理,缩小了搜索范围;然后采用模式扩展方法把规则表达式约束和增量挖掘过程融为一体,并且采用先修剪后计算支持度的方法进一步缩小了搜索范围,降低了支持度的计算量。该算法允许用户不断改变约束条件,实现交互式挖掘而且可将挖掘的目标仅仅聚焦到用户感兴趣的模式上。实验表明该算法对序列模式的维护和满足用户的需求都是十分有效的。 Sequential pattern mining is an important and active issue in data mining. The user's participation in the procession of mining is more and more important. In order to reduce response time, an algorithm, RE_IncUp, is proposed in this paper. Firstly, to reduce the searching space, this constraint is used to pre-process the frequent sequential patterns mined previously. Secondly, to decrease the amount of counting, the first-pruning and then-counting method is adopted to reduce the searching space enormously. The method allows users to change constraints to implement interactive mining, and also facilitates the users to focus the mining on their interesting patterns. The experimental analysis shows that the new algorithm is more efficient in the maintenance of the sequential pattern and requirement of users.
出处 《燕山大学学报》 CAS 2007年第5期402-409,共8页 Journal of Yanshan University
基金 河北省博士基金(No.B200322)
关键词 数据挖掘 序列模式 增量式挖掘 规则表达式 data mining sequential pattern incremental mining regular expression
  • 相关文献

参考文献11

  • 1Srikant R,Agrawal R.Mining sequential pattern[C] //Proceedings of the 11th International Conference on Data Engineering.Tapei,Taiwan,1995:3-14.
  • 2Parthasarathy S,Zaki M,Ogihara M,et al..Incremental and interactive sequence mining[C]//Proceeding of the 8th International Conference on Information and Knowledge Management.Kansas,USA,1999:251-258.
  • 3Zaki M.SPADE:An Efficient Algorithm for Mining Frequent Sequences[J].Machine Learning,2001,42 (1):31-60.
  • 4Zheng Q,Xu K,Ma S,et al..The algorithm of updating sequential patterns[C] //Proceedings of the 5th International Workshop on High Performance Data Mining,in conjunction with the 2d SIAM Conference on Data Mining.Washington,USA,2002:356-361.
  • 5Lin Ming-Yen,Lee Suh-Yin.Incremental update on sequential patterns in large database by implicit merging and efficient counting[J].Information System,2004,29 (5):385-404.
  • 6Carofalakis M,Rajeev R,Shim K.SPIRIT:Sequential pattern mining with regular expression constraints[C] //Proceedings of the 25th VLDB Conference.Edinburgh,Scotland,1999:223-234.
  • 7Pei Jian,Han Jiawei,Wang Wei.Mining sequential pattern with constraints in large databases[C] //Proceedings of the 11th International Conference on Informational and Knowledge Management.United States,2002:18-25.
  • 8Lin Ming-Yen,Lee Suh-Yin.Incremental update on sequential patterns in large databases[C] //Proceedings of the 1998 IEEE 10th International Conference on Tools with Artificial Intelligence.Taipei,China,1998:24-31.
  • 9Srikant R,Agrawal R.Mining sequential patterns:Generalizations and performance improvements[C] //Proceedings of the 1996 5th International Conference on Extending Database Technology.Avignon,France,1996:3-17.
  • 10Masseglia F,Poncelet P,Teisseire M.Incremental mining of sequential patterns in large databases[J].Data and Knowledge Engineering,2003,46 (1):97-12.

同被引文献12

  • 1牛兴雯,杨冬青,唐世渭,王腾蛟.OSAF-tree——可迭代的移动序列模式挖掘及增量更新方法[J].计算机研究与发展,2004,41(10):1760-1767. 被引量:4
  • 2陆介平,刘月波,倪巍伟,刘同明,孙志挥.基于PrefixSpan的快速交互序列模式挖掘算法[J].东南大学学报(自然科学版),2005,35(5):692-696. 被引量:6
  • 3陆介平,刘月波,倪巍伟,陈耿,孙志挥.基于投影数据库的序列模式挖掘增量式更新算法[J].东南大学学报(自然科学版),2006,36(3):457-462. 被引量:5
  • 4张坤,朱扬勇.无重复投影数据库扫描的序列模式挖掘算法[J].计算机研究与发展,2007,44(1):126-132. 被引量:17
  • 5Srikant R, Agrawal R. Mining sequential patterns:generaliza- tion and performance improvements [ J ]. Lecture Notes in Computer Science, 1996,1057:3-17.
  • 6Pei J, Han Jiawei, Mortazavi-asl B, et al. PrefixSpan : Mining Sequential Patterns Efficiently by Prefix- Projected Pattern Growth[ C ]//Proceedings of 17th International Conference onData Engineering. Heidelberg:Institute of Electrical and Elec- tronics Engineers Computer Society,2001:215-224.
  • 7Parthasarathy S, Zaki M J, Ogihara M, et al. Incremental and interactive sequence mining[ C ]//Proceedings of 8th Interna- tional Conference on Information and Knowledge Management. New York : ACM, 1999:251-258.
  • 8Lin M Y, Lee S Y. Improving the efficiency of interactive se- quential pattern mining by incremental pattern discovery [ C]//Proceedings of 36th Annual Hawaii International Con- ference on System Sciences. United States : IEEE Computer So-ciety ,2003:68-75.
  • 9Liu J X,Yan S T, Ren J D. The Design of Frequent Sequence Tree in Incremental Mining of Sequential Patterns [ C ]//Pro- ceedings of 2th IEEE International Conference on Software En- gineering and Service Science. United States:IEEE Computer Society ,2011:679-682.
  • 10陈卓,杨炳儒,宋威,宋泽锋.序列模式挖掘综述[J].计算机应用研究,2008,25(7):1960-1963. 被引量:24

引证文献1

二级引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部