期刊文献+

基于局部偏离因子的孤立点检测算法 被引量:5

Outlier Detection Algorithm Based on Local Deviation Factor
下载PDF
导出
摘要 孤立点检测是知识发现中的一个活跃领域,如信用卡欺诈、入侵检测等。研究孤立点的异常行为能发现隐藏在数据集中更有价值的知识。该文提出基于局部偏离因子(LDF)的孤立点检测算法,利用每个数据点的LDF衡量该数据点的偏离程度。实验结果表明,该算法能有效检测孤立点,其效率高于LSC算法。 Outlier detection is a hot research field in knowledge discovery in databases, such as credit card fraud, and intrusion detection, etc. Finding the rare abnormal behaviors or the outliers can be more interesting than finding the common patterns. This paper proposes a new outlier detection algorithm based on Local Deviation Factor(LDF). This algorithm counts the number of each point's LDF to reflect its isolation degree. The experimental results show that this algorithm can efficiently detect outliers and has higher efficiency than outliers detection algorithm LSC.
作者 谭庆 张瑞玲
出处 《计算机工程》 CAS CSCD 北大核心 2008年第17期59-61,共3页 Computer Engineering
基金 河南省科技攻关基金资助项目(0524220059)
关键词 孤立点 k距离邻居 局部偏离因子 outlier k-distance neighbors local deviation factor
  • 相关文献

参考文献6

  • 1Lazarevic A, Srivastava J, Kumar V. PAKDD 2004 Tutorial: Data Mining for Analysis of RareEvents[EB/OL]. (2004-03-26). http://www.deakin.edu.au/-pakdd04/pdf/Tutorial2.pdf.
  • 2Hawkins D. Identification of Outliers[M]. London, England: Chapman and Hall, 1980.
  • 3Laurikkala J, Juhola M, Kentata E. Informal Identification of Outliers in Medical Data[C]//Proc. of the 15th Int'l Workshop on Intelligent Data Analysis in Medicine and Pharmacology. Berlin, Germany:[s. n.], 2000.
  • 4Breunig M M, Kriegel H P, Ng R T, et al. LOF: Identifying Density-based Local Outliers[C]//Proceedings of ACM SIGMOD International Conference on Management of Data. [S. l.]: ACM Press, 2000.
  • 5Data Sets from the Math Forum@Drexel University. American Colleges and Universities[EB/OL]. (2000-01-29). http://mathforum. org/workshops/sum96/data.collections/datalibrary/data.set6.html.
  • 6StatLib Datasets Archive. smoothmeth.[EB/OL]. (1996-03-13). http://stat.cmu.edu/datasets.

同被引文献29

  • 1文琪,彭宏.小波变换的离群时序数据挖掘分析[J].电子科技大学学报,2005,34(4):556-558. 被引量:7
  • 2黄添强,秦小麟,叶飞跃.基于方形邻域的离群点查找新方法[J].控制与决策,2006,21(5):541-545. 被引量:16
  • 3翁小清,沈钧毅.基于滑动窗口的多变量时间序列异常数据的挖掘[J].计算机工程,2007,33(12):102-104. 被引量:16
  • 4Q/GDW 373-2009,电力用户用电信息采集系统功能规范[S].
  • 5Hans P K,Matthias S,Arthur Z.Angle-based Outlier Detection in High-dimensional Data[C]//Proc.of KDD'08,Las Vegas,Nevada,USA:[s.n.],2008.
  • 6Christian B,Katrin H,Nikola S M,et al.CoCo:Coding Cost for Parameter-free Outlier Detection[C]//Proc.of KDD'09.Paris,France:[s.n.],2009.
  • 7Ankur A.Local Subspace Based Outlier Detection[C]//Proc.of the 2nd International Conference on Communications in Computer and Information Science.Noida,India:[s.n.],2009.
  • 8Ye Mao,Li Xue,Maria E O.Projected Outlier Detection in High-dimensional Mixed-attributes Data Set[J].Expert Systems with Applications,2009,36(3):7104-7113.
  • 9Hans P K,Peer K,Erich S,et al.Outlier Detection in Axis-parallel Subspaces of High Dimensional Data[C]//Proc.of the 13th PAKDD'09.Bangkok,Thailand:[s.n.],2009.
  • 10D.Hawkins. Identification of Outliers[M]. London: Chapman and Hall, 1980.

引证文献5

二级引证文献25

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部