摘要
离群点检测在是数据挖掘的重要领域,广泛应用在信用卡欺诈检测、网络入侵检测等重要方面,文中在结合层次聚类和相似性,给出高维数据的相似度量函数与类密度的概念,并基于类密度重新定义高维数据的离群点,从而提出一种基于相似度量的离群点检测算法;实验表明:算法对高维数据中的离群点检测有一定的价值。
Outlier detection is an important content in data mining and is widely used in the field of credit card fraud detection, network invasion detection and so on. According to hierarchical clustering and similarity, this paper presents the concept of high dimensional data similarity measurement function and class density, based on class density,the outlier of high dimensional data is redefined so that a kind of outlier detection algorithm based on similarity measurement is proposed. Experiment shows that this algorithm has certain value on outlier detection in high dimensional data.
出处
《重庆工商大学学报(自然科学版)》
2012年第10期96-100,共5页
Journal of Chongqing Technology and Business University:Natural Science Edition
基金
安徽省教育厅自然科学基金项目(05010428)
关键词
离群点
网络入侵
数据挖掘
层次聚类
相似性度量
outlier
network invasion
data mining
hierarchical clustering
similarity measurement