摘要
利用基于密度的离群数据挖掘算法离群数据不在非离群数据指定的邻域内的特点,改进了原有的离群数据挖掘算法:首先判断数据是否在某个非离群数据指定的邻域内,如果不在,再判断其邻域内数据的个数。通过对二维空间数据测试表明,改进的算法能够快速有效地挖掘出数据集中的离群数据,速度上数倍于原来的算法。
Based on the characteristic that outliers are not included in the appointed neighborhood of inliers, an improved algorithm for outlier mining was proposed. Data was judged whether it was included in the appointed neighborhood of inliers firstly. If the answer was negative, the number of data that was included in the appointed neighborhood was counted. Experimental results show that the improved algorithm is effective and efficient in outlier mining and it is faster than the original algorithm.
出处
《计算机应用》
CSCD
北大核心
2007年第3期559-560,573,共3页
journal of Computer Applications
基金
国家自然科学基金资助项目(60403009)
重庆市自然科学基金资助项目(2005BB2224)