期刊文献+

一种新的基于邻居样本分布特征的异常值检测算法研究 被引量:4

A New Outlier Detection Algorithm Based on Neighbor Sample Distribution
下载PDF
导出
摘要 在数据挖掘领域中,研究的重点之一为异常检测技术。针对数据挖掘中实际采集到的样本数据受众多因素的影响所出现的异常值,本文提出一种新的基于邻居样本分布特征的异常检测算法,该算法通过引用邻域的概念能够处理混合属性的数据集,并且单纯考虑一个样本的的邻域大小和样本的邻域密度不能合理的反映该样本的异常程度,该算法考虑邻域中的所有样本。仿真实验结果表明,在处理混合数据中本文提出的数据异常检测算法相较其他异常检测算法有明显的优势。 In the field of data mining, one of the key research for anomaly detection. The abnormal value appears in view of the influence of many factors of sample data in the data mining the actual audience of the collected, this paper proposes a new anomaly detection algorithm based on neighbor distribution feature, the algorithm by reference to the concept of mixed neighborhood the properties of the data set,and only consider the neighborhood density cannot be a sample of the size of the neighborhood and the samples reasonably reflect the abnormal degree of the sample, the algorithm considers all the samples in the neighborhood. The simulation results show that the anomaly detection algorithm compared to other anomaly detection algorithm has obvious advantages in processing the mixed data is put forward the data.
作者 张军 刘文杰 Zhang Jun;Liu Wenjie(Jiangsu Maritime Institute,Nanjing 211170,China;School of Computer and Software,Nanjing University of Information Science & Technology,Nanjing 210044,China)
出处 《科技通报》 北大核心 2017年第1期86-88,141,共4页 Bulletin of Science and Technology
基金 2015年江苏省现代教育技术重点研究课题(项目编号:2015-R-42639)
关键词 混合数据 异常检测 异常值 邻居样本 mixed data anomaly detection outlier neighbor sample
  • 相关文献

参考文献3

二级参考文献31

  • 1薛安荣,鞠时光,何伟华,陈伟鹤.局部离群点挖掘算法研究[J].计算机学报,2007,30(8):1455-1463. 被引量:96
  • 2Knorr E M, Ng R T, Tucakov V. Distance-based outliers: Algorithms and applications. VLDB Journal, 2000, 8 (3) : 237-253.
  • 3Ramaswamy S, Rastogi R, Shim K. Efficient algorithms for mining outliers from large data sets//Proceedings of the 2000 ACM SIGMOD International Conference. Dallas, TX, USA, 2000:427-438.
  • 4Breunig M M, Kriegel H P, Ng R T, Sander J. LOF: lden tifying density-based local outliers//Proceedings of the 2000 ACM SIGMOD International Conference. Dallas, TX, USA, 2000:93-104.
  • 5Papadimitriou S, Kitagawa H, Gibbons P B, Faloutsos C. LOCI: Fast outlier detection using the local correlation into gral//Proceedings of the 19th International Conference on Data Engineering. Bangalore, India, 2003:315-326.
  • 6Aggarwal C C, Yu P S. Outlier detection for high dimensional data//Proceedings of the 2001 ACM SIGMOI) International Conference. Santa Barbara, CA USA, 2001:37 -46.
  • 7Li X, Han J, Kim S, Gonzalez H. ROAM: Rule and motifbased anomaly detection in massive moving object data sets// Proceedings of the 7th SIAM International Conferencc on Data Mining. Minneapolis, Minnesota, 2007:296-307.
  • 8Lee J, Han J, Li X. Trajectory outlier detection: A parti tion-and-detect framework//Proceedings of the 24th Interna tional Conference on Data Engineering. Cancun, Mexico,2008:140-149.
  • 9Huttenlocher D P, Klanderman G A, Rucklidge W A. Corn paring images using the hausdorff distance. IEEE Transac tions on Pattern Analysis and Machine Intelligence, 1993 15(9) : 850-863.
  • 10Tao Yufei, Papadias Dimitris, Sun Jimeng. The TPR * tree: An optimized spatio temporal access method for predictive queries//Proeeedings of the 29th International Conference on Very Large Databases. Berlin, Germany, 2003:790 -801.

共引文献31

同被引文献24

引证文献4

二级引证文献8

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部