期刊文献+

基于高维数据流的异常检测算法 被引量:22

Anomaly Detection Algorithm Based on High-dimensional Data Stream
下载PDF
导出
摘要 传统基于欧氏距离的异常检测算法在高维数据检测中存在精度无法保证以及运行时间过长的问题。为此,结合高维数据流的特点运用角度方差的方法,提出一种改进的基于角度方差的数据流异常检测算法。通过构建最佳数据集网格和最近数据网格的小规模数据流计算集,以快速即时地衡量最新数据点的异常程度,将改进的算法用于无线传感器网络采集的电梯真实数据流检测,实现电梯故障检测。实验结果表明,与ABOD、HODA等算法相比,改进算法能有效识别高维数据流中的异常点,可适用于实时性要求高的传感器高维数据流。 The traditional outlier detection algorithm based on Euclidean distance can not guarantee the accuracy and the running time is too long in high-dimensional data detection. Based on the characteristics of high-dimensional data flow, an improved outlier detection algorithm based on angular variance is proposed by using the method of angle variance. The optimal data set grid and the nearest data grid are constructed to calculate the small scale data flow to measure the abnormal degree of the latest data points, and the improved algorithm is used to the elevator real data flow detection in wireless sensor network acquisition to achieve elevator fault detection. Experimental results show that compared with ABOD and HODA algorithms, the improved algorithm can effectively identify abnormal points in high-dimensional data streams and can be applied to high-dimensional data streams with high real-time requirements.
出处 《计算机工程》 CAS CSCD 北大核心 2018年第1期51-55,共5页 Computer Engineering
基金 国家自然科学基金(61201212 61272449)
关键词 数据挖掘 高维数据流 异常检测 海量数据 角度方差 data mining high-dimensional data stream anomaly detection massive data angle variance
  • 相关文献

参考文献7

二级参考文献70

  • 1金澈清,钱卫宁,周傲英.流数据分析与管理综述[J].软件学报,2004,15(8):1172-1181. 被引量:161
  • 2崔莉,鞠海玲,苗勇,李天璞,刘巍,赵泽.无线传感器网络研究进展[J].计算机研究与发展,2005,42(1):163-174. 被引量:730
  • 3Bolton R J,Hand D J.Statistical fraud detection:A review (with discussion)[J].Statistical Science,2002,17(3):235-255.
  • 4Patcha A,Park J -M.An overview of anomaly detection techniques:Existing solutions and latest technological trends[J].Computer Networks,2007,51(12):3448-3470.
  • 5Eskin E,Arnold A,Prerau M,et al.A geometric framework for unsupervised anomaly detection:Detecting intrusions in unlabeled data[G].Data Mining for Security Applications.Norvell:Kluwer,2002:77-99.
  • 6Lane T,Brodley C E.Temporal sequence learning and data reduction for anomaly detection[J].ACM Trans on Information and System Security,1999,2(3):295-331.
  • 7Li X,Han J.Mining approximate top-k subspace anomalies in multidimensional time-series data[C] //Proc of the 33rd Int Conf on Very Large Data Bases.New York:ACM,2007:447-458.
  • 8Aggarwal R,Gehrke J,Gunopulos D,et al.Automatic subspace clustering of high dimensional data for data mining applications[C] //Proc of ACM SIGMOD Conf on Management of Data.New York:ACM,1998:94-105.
  • 9Barnett V,Lewis T.Outliers in Statistical Data[M].New York:John Wiley and Sons,1994.
  • 10Preparata F,Shamos M.Computational Geometry:An Introduction[M].Berlin:Springer,1985.

共引文献99

同被引文献192

引证文献22

二级引证文献66

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部