期刊文献+

近似保序序列模式挖掘 被引量:1

Approximate Order-preserving Sequential Pattern Mining
下载PDF
导出
摘要 保序序列模式挖掘旨在时间序列中挖掘保序模式完全相同(最精确)的子序列,其可以用来进行疾病发展趋势预测.但只挖掘最精确的保序模式往往会遗漏一些重要信息.有些保序模式虽然不完全相同,但它们之间仍具有很高的相似性.有鉴于此,本文提出了一种近似保序序列模式挖掘算法(Approximate Order Preserving Pattern Mining:AOPM),该算法能根据输入参数值的不同而挖掘出近似程度不同的保序模式.在候选模式生成方面,AOPM算法采用了基于前后缀拼接的模式融合策略,减少了无意义候选模式的数量.在模式支持度计算方面,AOPM算法首选获取候选模式的全部候选序列,然后在进行模式匹配.本文通过在真实数据集上进行对比实验,验证了AOPM算法的完备性和高效性. Order-preserving sequence pattern mining aims to mine sub-sequences with the same(most accurate)order-preserving pattern in a time series, which can be used to predict the development trend of diseases.However, only mining the most accurate order-preserving pattern often misses some important information.Although some preserving patterns are not the same, they still have high similarities.In view of this, this paper proposes an approximate order-preserving pattern mining algorithm(Approximate Order Preserving Pattern Mining, AOPM),which can mine order-preserving patterns with different degrees of approximation according to different input parameter values.In terms of candidate pattern generation, the AOPM algorithm uses a pattern fusion strategy based on prefix and suffix splicing, which reduces the number of meaningless candidate patterns.In terms of pattern support calculation, the AOPM algorithm first obtains all candidate sequences of candidate patterns and then performs pattern matching.This paper verifies the completeness and efficiency of the AOPM algorithm through comparative experiments on real data sets.
作者 刘锦 武优西 王月华 李艳 LIU Jin;WU You-xi;WANG Yue-hua;LI Yan(School of Artificial Intelligence,Hebei University of Technology,Tianjin 300401,China;School of Economics and Management,Hebei University of Technology,Tianjin 300401,China)
出处 《小型微型计算机系统》 CSCD 北大核心 2023年第3期490-496,共7页 Journal of Chinese Computer Systems
基金 国家自然科学基金项目(61976240)资助。
关键词 模式挖掘 时间序列 保序序列 (δ-γ)距离 模式匹配 pattern mining order-preserving sequence (δ-γ)distance pattern fusion pattern match
  • 相关文献

参考文献5

二级参考文献66

  • 1魏伟杰,张明卫,张斌,王波.基于最小加权支持的加权序列模式挖掘算法[J].吉林大学学报(工学版),2008,38(S2):178-183. 被引量:2
  • 2庄军,李峰,叶世清,刘侃,何皎.HIS系统中药物医嘱序列模式挖掘[J].医疗卫生装备,2007,28(7):34-36. 被引量:2
  • 3Agrawal R, Faloutsos C, Swami A. Efficient similarity search in sequence databases[C]//Proceedings of the 4th International Conference on Foundations of Data Organization and Algorithms (FODO 1993). 1993:69-84.
  • 4Azzouzi M, Nabney I T. Analysing time series structure with Hidden Markov Models[C]//Proceedings of the IEEE Confe- rence on Neural Networks and Signal Processing. 1998:402-408.
  • 5Bagnall A, Janacek G J, Powell M. A likelihood ratio distance measure for the similarity between the fourier transform of time series[C]//Proceedings of the Advances in Knowledge Disco- very and Data Mining, 9th Pacific-Asia Conference (PAKDD2005). 2005:737 743.
  • 6Bagnall A, Davis I., Hills J, et al. Transformation based ensem- bles for time series elassification[C]//Proeeedings of the 2012 SIAM International Conference on Data Mining (SDM 2012). 2012:307 318.
  • 7Balakrishnan S, Madigan D. Decision trees for functional varia- bles[C] // Proceedings of the 2006 International Conference on Data Mining (ICDM 2006). 2006:798 802.
  • 8Batista G, Wang X, Keogh E. A complexity invariant distance measure for time series[C]//Proeeedings of the eleventh SIAM conference on data mining (SDM 2011 ). 2011 : 699-710.
  • 9Berndt D J,Clifford J. Using dynamic time warping to find pat terns in time series[C]//KDD Workshop. 1994 : 359 370.
  • 10Buza K. Fusion methods for time-series classification[D]. Uni- versity of Hildesheim,Germany, 2011.

共引文献154

同被引文献3

引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部