期刊文献+

一种基于加权概率密度的上下文离群检测算法

A CONTEXTUAL OUTLIER DETECTION ALGORITHM BASED ON WEIGHTED PROBABILITY DENSITY
下载PDF
导出
摘要 采用加权概率密度,提出一种上下文离群数据检测算法。利用高斯混合模型和稀疏度矩阵,确定相关子空间;在相关子空间中,采用加权概率密度局部异常因子公式,计算数据对象的离群因子,可以有效反映和刻画数据对象与其周围数据对象的不一致程度;选取离群因子最大的N个数据对象为离群数据,并将离群因子、相关子空间属性取值、局部数据集作为其上下文信息,有效地改善了离群数据的可解释性;采用人工和UCI数据集,实验验证了算法的有效性。 A contextual outlier data detection algorithm is proposed by using weighted probability density.In the algorithm,the Gaussian mixture model and the sparsity matrix were used to determine the correlation subspace.The weighted probability density local anomaly factor formula was used to calculate the outlier factor of the data object in the relevant subspace,which could effectively reflect and describe the degree of inconsistency between data objects and their surrounding data objects.N data objects with the largest outlier factor value were selected as outliers,and the value of outlier factor,correlation subspace attributes and local data sets were taken as their contextual information,effectively improving the interpretability and understandability of outlier data objects.Experimental results validate the effectiveness of this algorithm by using artificial data set and UCI data sets.
作者 白慧 张继福 Bai Hui;Zhang Jifu(School of Computer Science and Technology,Taiyuan University of Science and Technology,Taiyuan 030024,Shanxi,China)
出处 《计算机应用与软件》 北大核心 2024年第2期279-285,共7页 Computer Applications and Software
基金 国家自然科学基金项目(61876122)。
关键词 离群检测 相关子空间 加权概率密度 上下文信息 Outlier detection Correlation subspace Weighted probability density Contextual information
  • 相关文献

参考文献2

二级参考文献3

共引文献43

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部