期刊文献+

基于相似性连接的时间序列Shapelets提取 被引量:3

Time Series Shapelets Extraction via Similarity Join
下载PDF
导出
摘要 在时间序列分类问题中,以Shapelets特征为基础的分类算法具有很高的分类准确率和良好的可解释性,因此,高辨别能力Shapelets的提取已成为时间序列研究领域重要的研究热点之一.对于Shapelets提取的研究已取得了很多优秀的成果,但仍存在一些问题,主要是由于通过遍历所有子序列来获取Shapelets的方式非常耗时.尽管可以采取剪枝策略优化该过程,但往往会损失分类准确率.为此,提出一种基于相似性连接的Shapelets提取方法,该方法舍弃逐一判断子序列分类能力的策略,而是以子序列为单位,通过相似性连接的思想构建时序数据间的相似性向量.对于不同类别的时序数据,计算每一对时序数据间的差异向量,进而得到表示时序数据集中不同类别间差异的候选矩阵,然后根据候选矩阵的数值差异,快速筛选出具有高分类能力的Shapelets集合.在真实数据集上的大量实验表明:相比于现有的Shapelets提取方法,这种相似性连接方法所得到的Shapelets在分类任务中不仅具有很好的时间效率,而且能保证高分类准确率. For time series classification,the classifier built by Shapelets has high classification accuracy and,meanwhile,the classification results are easily interpretable. Therefore,the extraction of discriminative Shapelets has attracted a lot of attention in the field of time series data mining. Research on Shapelets extraction has obtained promising achievement,but there are still some problems. The main reason is that the traversal of all time series subsequences to find the discriminative Shapelets is extraordinarily time consuming. Although some pruning techniques can be applied to accelerate the extraction process,they usually reduce the classification accuracy. In this paper,we propose a novel Shapelets extraction method based on similarity join,which abandons the idea of computing each subsequence s discriminative power. In the proposed method,each subsequence is considered as a basic computing unit and the similarity vector of two time series is obtained by the similarity join calculation of their subsequences. For the time series with different class label,we compute the difference vector of each time series pair and merge them into a candidate matrix which represents the differences between different time series class. Thus,we can easily obtain the eligible Shapelets from the candidate matrix. Extensive experimental results in real time series datasets show that,compared with the exist Shapelets extraction methods,the proposed method has high time efficiency while ensuring excellent classification accuracy.
作者 张振国 王超 温延龙 袁晓洁 Zhang Zhenguo;Wang Chao;Wen Yanlong;Yuan Xiaojie(Department of Computer Science and Technology,Yanbian University,Yanji,Jilin 133002;College of Computer Science,Nankai University,Tianjin 300350;College of Cyber Science,Nankai University,Tianjin 300350)
出处 《计算机研究与发展》 EI CSCD 北大核心 2019年第3期594-610,共17页 Journal of Computer Research and Development
基金 国家自然科学基金项目(61772289) 吉林省教育厅"十三五"科学技术项目(JJKH20191125KJ)~~
关键词 时间序列 Shapelets 相似性连接 差异向量 候选矩阵 time series Shapelets similarity join difference vector candidate matrix
  • 相关文献

参考文献3

二级参考文献57

  • 1Shao Jidong, Rong Gang, Lee Jongmin. Learning a data- dependent kernel function for KPCA based nonlinear process monitoring [J]. Chemical Engineering Research & Design, 2009, 87(11A): 1471-1480.
  • 2Zou X, Deng Z, Ge M, et al. GPS data processing of networks with mixed single-and dual-frequency receivers for deformation monitoring [J]. Advances in Space Research, 2010, 46(2): 130-135.
  • 3Barnet V, Lewis T. Outlier in Statistical Data [M]. New York: John Wiley & Sons, 1994.
  • 4Knorr E M, Ng R T. Finding intentional knowledge of distance-based outliers [C] //Proc of the 25th Int Conf on Very Large Data Bases. San Francisco: Morgan Kaufmann, 1999:211-222.
  • 5Ramaswamy S, Rastogi R, Shim K. Efficient algorithms for mining outliers from large data sets [C] //Proc of the ACM SIGMOD Int Conf on Management of Data. New York: ACM, 2000:427-438.
  • 6Markou M, Singh S. Novelty detection: A review--part 2: neural network based approaches [J]. Signal Processing, 2003, 83(12): 2499-2521.
  • 7Mourao-Miranda J, Hardoon D R, Hahn T, et al. Patient classification as an outlier detection problem: An application of the one-class support vector machine [J]. Neuroimage, 2011, 58(3): 793-804.
  • 8Wang J S, Chiang J C. A cluster validity measure with outlier detection for support vector clusteriag [J]. IEEE Trans on Systems Man and Cybernetics, Part B-Cybernetics, 2008, 38(1): 78-89.
  • 9Percival D B, Walden A T. Wavelet Methods for Time Series Analysis [M]. Cambridge: Cambridge University Press, 2006.
  • 10Mallat S, Hwang W L. Singularity detection and processing with wavelets [J]. IEEE Trans on Information Theory, 1992, 38(2): 617-642.

共引文献47

同被引文献12

引证文献3

二级引证文献6

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部