期刊文献+

基于项位置索引的闭合连续序列模式挖掘算法

Closed continuous sequence pattern mining algorithm based onitem position index
下载PDF
导出
摘要 为解决现有的闭合序列挖掘算法挖掘大量低效和冗余的序列模式导致的挖掘效率低下,利用频繁项的位置索引,通过索引位置向后扩展频繁连续局部候选项来增长候选序列,采用向前与向后扫描两种策略检查当前候选序列的闭合性,提出了一种具有连续约束的基于频繁项位置索引的闭合连续序列模式挖掘算法LCCS(located closed contiguous sequence data mining)。实验结果表明,LCCS算法通过直接定位频繁项的位置降低数据库的扫描次数,提高了算法挖掘效率,并且得到具有连续约束的闭合序列,有效地解决了产生大量冗余模式的缺陷。 In order to solve the low efficiency problem of mining a large number of inefficient and redundant sequence patterns caused by the existing closed sequence mining algorithms,this paper uses the location index of frequent items to grow candidate sequences by extending the index location backward to frequent continuous local candidates,uses forward and backward scanning strategies to check the closure of current candidate sequences,and proposes a continuous constrained Located Closed Continuous Sequence Data Mining(LCCS)algorithm based on the frequent item position index.The experimental results show that the LCCS algorithm reduces the number of database scans by directly locating the frequent items,improves the efficiency of algorithm mining,and obtains a closed sequence with continuous constraints,which effectively solves the problem of generating a large number of redundant patterns.
作者 矫春兰 刘建宾 JIAO Chunlan;LIU Jianbin(Computer School,Beijing Information Science&Technology University,Beijing 100101,China;Software Engineering Research Center,Beijing Information Science&Technology University,Beijing 100101,China)
出处 《北京信息科技大学学报(自然科学版)》 2020年第6期61-65,70,共6页 Journal of Beijing Information Science and Technology University
基金 “十二五”国家科技支撑计划课题(2014BAH25F03) 北京信息科技大学科研水平提高项目(5211823406)。
关键词 闭合序列 连续约束 项位置索引 序列模式 closed sequence continuous constraint item position index sequence patterns
  • 相关文献

参考文献2

二级参考文献21

  • 1Ayres J,Flannick J,Gehrke J.Sequential pattern mining using a bitmap representation[J].Knowledge Discovery and Data Mining,2002,12(6):429-435.
  • 2Pei J,Han J,Mortazavi B.Prefixspan:mining sequential patterns efficiently by prefix-projected pattern growth[J].Data Engineering,2001,8(4):215-224.
  • 3Han J,Pei J,Mortazavi B.Freespan:frequent pattern-projected sequential pattern mining[J].Knowledge Discovery and Data Mining,2000,14(8):355-359.
  • 4Han J,Pei J,Yin Y.Mining frequent patterns without candidate generation[J].Management of Data,2000,13(5):1-12.
  • 5Zaki M J.SPADE:An efficient algorithm for mining frequent sequences[J].Machine Learning,2001,11(5):31-60.
  • 6Leleu M,Rigotti C,Boulicaut J F.GO-SPADE:Mining sequential patterns over datasets with consecutive repetitions[J].Machine Learning and Data Mining,2003,18(7):293-306.
  • 7Pasquier N,Bastide Y,Taouil R.Discovering frequent closed itemsets for association rules[J].Database Theory,1999,6(1):398-416.
  • 8Yan X,Han J,Afshar R.CloSpan:mining closed sequential patterns in large datasets[J].Data Mining,2003,16(5):40-45.
  • 9Brudick D,Calimlim M,Gehrke J.MAFIA:A maximal frequential itemset algorithm for transactional database[J].Data Engineering,2001,9(4):443-452.
  • 10Pei J,Han J,Mao R.CLOSET:An efficient algorithm for mining frequential closed itemsets[J].Knowledge Discovery,2000,10(5):11-20.

共引文献9

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部