期刊文献+

两阶段的多元时间序列异常检测算法 被引量:5

Two-stage outlier detection in multivariate time series
下载PDF
导出
摘要 提出了一个两阶段的多元时间序列异常检测算法。该算法通过有界坐标系统(BCS)技术计算多元时间序列样本之间的相似性,采用基于距离的方法实现异常检测。算法第一阶段采用K-means算法对数据进行聚类,并按照一个启发式规则对其进行排序;第二阶段在聚类结果上采用循环嵌套算法进行异常检测,并通过两个剪枝规则进行高效剪枝,提高了算法的效率。在两个实际数据集上进行实验,实验结果验证了算法的有效性。 This paper proposed an efficient two-stage algorithm for detecting outliers in multivariate time series (MTS) data- sets. Used the bounded coordinate system (BCS) metric to measure the similarity between two MTS samples, and measured the outlierness of a sample by average distance to its k-nearest neighbors. It partitioned the data into clusters, and used nested loop algorithm to find top-n outliers. Utilized a heuristic and two pruning rules to quickly remove MTS samples that were not possible outlier candidates, reducing significantly the distance computation among objects. Experiments on real-world datasets show the effectiveness of the proposed algorithm.
作者 王欣
出处 《计算机应用研究》 CSCD 北大核心 2011年第7期2466-2469,共4页 Application Research of Computers
基金 国家自然科学基金资助项目(60879022 60832012) 中国民用航空局科技项目(MHRD200801)
关键词 多元时间序列 有界坐标系统 基于距离的异常检测 multivariate time series bounded coordinate system distance-based outlier detection
  • 相关文献

参考文献11

  • 1HODGE V J, AUSTIN J. A survey of outlier detection methodologies [ J]. Artificial Intelligence Review,2004,22 (2) :85-126.
  • 2YANG K, SHAHABI C. A PCA-based similarity measure for multivariate time series[ C ]//Proc of the 2nd ACM International Workshop on Multimedia Databases. New York : ACM Press ,2004:65-74.
  • 3孔成安,李文华,尹湛.利用QAR数据实施飞机性能监控[J].中国民用航空,2008(10):54-54. 被引量:19
  • 4ANGIULLI F, PIZZUTI C. Outlier mining in large high-dimensional data sets[ J]. IEEE Trans on Knowledge and Data Engineering, 2005,17(2) :203-215.
  • 5RAMASWAMY S, RASTOGI R, SHIM K. Efficient algorithms for mining outliers from large data sets[ C ]//Proc of ACM SIGMOD International Conference on Management of Data. New York: ACM Press ,2000:427-438.
  • 6ZHANG Xiao-lei, BEGLEITER H, POPdESZ B, et al. Event related potentials during object recognition tasks[ J]. Brain Research Bulletin, 1995,38 (6) :531-538.
  • 7SINGHAL A, SEBORG D E. Pattern matching in historical batch data using PCA[ J]. IEEE Control Systems Magazine,2002,22 (5) : 53-63.
  • 8SHEN H T, ZHOU X, HUANG Z, et al. Statistical summarization of content features for fast near-duplicate video detection [ C ]//Proc of the 15th ACM International Conference on Multimedia. New York: ACM Press,2007 : 164-165.
  • 9JIANG M F,TSENG S S,SU C M. Two-phase clustering process for outlier detection [ J ]. Pattern Recognition Letters, 2001,22 ( 6-7 ) : 691-700.
  • 10VU N H, GOPALKRISHNAN V. Efficient pruning schemes for distance-based outlier detection [ C ]//Proc of European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases. London : Springer-~erlag,2009:160-175.

共引文献18

同被引文献95

引证文献5

二级引证文献41

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部