期刊文献+

基于邻域和密度的异常点检测算法 被引量:12

Outlier Detection Algorithm Based on Neighborhood and Density
下载PDF
导出
摘要 为了减少基于密度的异常点检测算法邻域查询操作的次数,同时避免ODBSN(Outlier Detection Based onSquare Neighborhood)中有意义异常点的丢失和稀疏聚类中的对象靠近稠密聚类时导致错误的异常点判断,提出了一种基于邻域和密度的异常点检测算法NDOD(Neighborhood and Density based Outlier Detection)。NDOD吸收基于网格方法的思想,以广度优先扩张方形邻域,成倍地减少了邻域查询的次数,从而快速排除聚类点并克服基于网格方法中的"维灾"。新引入的基于邻域的局部异常因子代表候选异常点的异常程度,用于对候选异常点的精选,可避免ODBSN的缺陷,发现更多有意义的异常点。大规模和任意形状的二维空间数据的测试结果表明,该算法是可行有效的。 ODBSN ( Outlier Detection Based on Square Neighborhood) may lose outliers and result in wrong estimation when objects from a sparse cluster close to a denser cluster. To avoid the shortcomings of ODBSN and reduce neighborhood query of representative density based algorithm, a new neighborhood and density based outlier detection algorithm named NDOD (Neighborhood and Density based Outlier Detection) is proposed. By the grid-based method, NDOD expands square neighborhood by breadth-first search, it can reduce neighborhood query drastically, eliminate cluster point quickly and overcome "curse of dimensionality" of grid-based method. A novel neighborhood based local outlier factor is defined for candidate outliers. As a result, outliers so discovered are own a degree of being an outlier and more meaningful. Extensive experiments on large-scale and different shape data sets demonstrate that our algorithm is effective and feasible.
出处 《吉林大学学报(信息科学版)》 CAS 2008年第4期398-403,共6页 Journal of Jilin University(Information Science Edition)
基金 国家高技术研究发展计划(863)基金资助项目(2007AA01Z404)
关键词 数据挖掘 异常点 方形邻域 密度 局部异常因子 data mining outlier square neighborhood density local outlier factor
  • 相关文献

参考文献10

  • 1HAN J, KAMBER M. Data Mining: Concepts and Techniques [ M]. New York: Morgan Kaufmann Publishers, 2001.
  • 2李川川,刘衍珩,田大新.基于序列模式的网络入侵检测系统[J].吉林大学学报(工学版),2007,37(1):121-125. 被引量:7
  • 3HODGE V J, AUSTIN J. A Survey of Outlier Detection Methodologies [ J ]. Artificial Intelligence Review, 2004, 22 (6) : 85-126.
  • 4徐岩,朱恒民.数据挖掘与数据库的集成方法[J].吉林大学学报(信息科学版),2007,25(2):228-232. 被引量:7
  • 5BREUNIG M M, KRIEGEL H P, NG R T, et al. LOF: Identifying Density-Based Local Outliers [ C] //2000 ACM SIGMOD Int'l Conf on Management of Data. New York: ACM Press, 2000: 93-104.
  • 6ESTER M, KRIEGEL H P, SANDER J, et al. A Density-Based Algorithm for Discovering Clusters in Large Spatial Databases with Noise [ C] //2nd ACM SIGKDD Int'l Conf on Knowledge Discovery and Data Mining. New York: ACM Press, 1996: 226-231.
  • 7黄添强,秦小麟,叶飞跃.基于方形邻域的离群点查找新方法[J].控制与决策,2006,21(5):541-545. 被引量:16
  • 8JIN W, TUNG A K H, HAN J, et al. Ranking Outliers Using Symmetric Neighborhood Relationship [ C ] //10th Pacific-Asia Conf on Knowledge Discovery and Data Mining. Berlin: Springer, 2006: 93-104.
  • 9ZHOU S, ZHAO Y, GUAN J, et al. A Neighborhood-Based Clustering Algorithm [ C ] //9th Pacific-Asia Conf on Knowledge Discovery and Data Mining. Berlin: Springer, 2005: 361-371.
  • 10KATAYAMA N, SATOH S. The SR-Tree: An Index Structure for High-Dimensional Nearest Neighbor Queries [ C ] //1997 ACM SIGMOD Int'l Conf on Management of Data. New York: ACM Press, 1997: 369-380.

二级参考文献32

  • 1管恩政,常晓宇,王喆,周春光.快速频繁序列模式挖掘算法[J].吉林大学学报(理学版),2005,43(6):768-772. 被引量:7
  • 2刘光远,董立岩,苑森淼,李永丽,孙涛,关伟洲.多策略数据挖掘系统的分析与设计[J].吉林大学学报(信息科学版),2006,24(6):610-617. 被引量:1
  • 3He Z,Xu X,Deng S.Discovering Cluster-based Local Outliers[J].Pattern Recognition Letters,2003,24(9-10):1642-1650.
  • 4He Z,Xu X,Huang J Z,et al.Mining Class Outliers:Concepts,Algorithms and Applications in CRM[J].Expert Systems with Applications,2004,27(4):681-697.
  • 5Breunig M M,Kriegel H P,Ng R T,et al.LOF:Identifying Density-based Local Outliers[A].Proc of SIGMOD'00[C].Dallas,2000:427-438.
  • 6Ester M,Kriegel H P,Sander J,et al.A Densitybased Algorithm for Discovering Clusters in Large Spatial Databases[A].Proc of KDD'96[C].Portland OR,1996:226-231.
  • 7Barnett V,Lewis T.Outliers in Statistical Data[M].New York:John Wiley,1994.
  • 8Hawkins D M.Identification of Outliers[M].London:Chapman and Hall,1980.
  • 9Rousseeuw P J,Leroy A M.Robust Regression and Outlier Detection[M].New York:John Wiley and Sons,1987.
  • 10Johnson T,Kwok I,Ng R T.Fast Computation of 2-dimensional Depth Contours[A].Proc KDD[C].New York:AAAI Press,1998:224-228.

共引文献27

同被引文献112

引证文献12

二级引证文献64

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部