期刊文献+

基于DCNDA算法的数据异常检测 被引量:5

Data anomaly detection based on DCNDA algorithm
下载PDF
导出
摘要 针对CURE算法聚类过程中对噪音点敏感,随机抽样存在局限性,以及对收缩因子敏感且在大型数据集聚类方面效率欠佳的问题,提出一种基于MeanShift核函数平移模型DBSCAN算法改进的CURE算法,即DCNDA(density-based CURE noise detection clustering algorithm)。自适应参数的DBSCAN算法提高初步聚类精度和可靠性,引入质心公式改进CURE算法,避免受收缩因子影响,降低时间复杂度,提高算法全局收敛性和可靠性。仿真结果表明,DCNDA算法在时间复杂度、聚类准确率、异常值检测效率方面优于改进分区CURE算法和PDBSCAN算法。 Aiming at the problem of noise point sensitivity,random sampling limitation and sensitivity to shrink factor and inefficiency in large data set clustering in CURE clustering process,an improved CURE based on MeanShift kernel function translation model DBSCAN algorithm was proposed,namely DCNDA(density-based CURE noise detection clustering algorithm).The adaptive parameter DBSCAN algorithm improved the initial clustering accuracy and reliability.The centroid formula was introduced to improve the CURE algorithm,avoiding the shrinkage factor,reducing the time complexity and improving the global convergence and reliability of the algorithm.The simulation results show that the DCNDA algorithm outperforms the improved partition CURE algorithm and PDBSCAN algorithm in terms of time complexity,clustering accuracy and outlier detection efficiency.
作者 蒋华 季丰 王鑫 王慧娇 JIANG Hua;JI Feng;WANG Xin;WANG Hui-jiao(School of Computer Science and Information Security,Guilin University of Electronic Technology,Guilin 541000,China)
出处 《计算机工程与设计》 北大核心 2018年第11期3425-3430,3485,共7页 Computer Engineering and Design
基金 2016广西高校中青年教师基础能力提升基金项目(ky2016YB150) 桂林电子科技大学研究生教育创新计划基金项目(2017YJCX48)
关键词 DCNDA算法 CURE算法 DBSCAN算法 MeanShift核函数 异常值检测 DCNDA CURE DBSCAN MeanShift kernel function outliers detection
  • 相关文献

参考文献3

二级参考文献22

  • 1Chen M S.Data mining:an overview from database perspective[].IEEE Transactions on Knowledge and Data Engineering.1996
  • 2ROEMMICH D, BOEBEL O, DESAUBIES Y, et aL Argo: the global array of profiling floats [ M]. Observing the Oceans in the 21 st Century. Melbourne, Australia: Godae Project Office, Bureau of Meteorology, 2001:248 -258.
  • 3HASTIE T, STUETZLE W. Principal curves [ J]. Journal of the A- merican Statistical Association, 1989, 84(406): 502-516.
  • 4KEGL B, KRZYZAK A, LINDER T, et al. Learning and design of principal curves [ J]. IEEE Transactions on Pattern Analysis and Machine Intelligence 2000, 22(3): 281-297.
  • 5SHI X, LV Y, FEI Z, et al. A muhivariable statistical process mo-nitoring method based on multiscale analysis and principal curves [ J]. International Journal of Innovative Computing, Information and Control, 2013, 9(4): 1781-1800.
  • 6FERREIRA D D, SEIXAS J M D, CERQUEIRA A S, et al. A new power quality deviation index based on principal curves [ J]. Elec- tric Power Systems Research, 2015, 125:8 - 14.
  • 7CHEN D, CHEN L. Practical constraint K-segment principal curve algorithms for generating railway GPS digital map [ J]. Mathematical Problems in En,ineerin,. 2013, 51 3) : 205 - 209.
  • 8DEAN J, GHEMAWAT S. MapReduce: simplified data processing on large clusters [ C] // Proceedings of the 6th conference on Sym- posium on Opearting Systems Design & Implementation. Berkeley: USENIX Association, 2004, 6: 10.
  • 9Zhang T,Ramakrishnan R,Livny M.BIRCH: An efficient data clustering method for very large databases[].Proceedings of ACM-SIGMOD International Conference on Management of Data.1996
  • 10Ng R,Han J.Efficient and Effective Clustering Methods for Spatial Data Mining. Proc. 1994 Int. Conf. Very Large Data Bases (VLDB’ 94) . 1994

共引文献107

同被引文献38

引证文献5

二级引证文献17

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部