期刊文献+

S-DBSCAN:一种基于DBSCAN发现高密度簇的算法 被引量:5

S-DBSCAN: an algorithm for finding high density clusters based on DBSCAN
下载PDF
导出
摘要 针对基于密度的带有噪声的空间聚类(DBSCAN)算法用于交互式数据挖掘时用户经常调整算法参数以发现感兴趣的知识以及数据集相对稳定的特点,提出了一种基于DBSCAN发现高密度簇的算法—S-DBSCAN算法,确定了需调整的算法参数——对象的邻域范围8(Eps)和满足核心对象条件的£邻域内最小对象个数MinPts,阐述了参数8与MinPts的3种适合S-DBSCAN算法的变化情况,并给出了相应的证明,同时分析了算法的时间复杂度。在对真实和合成数据集的测试中,S-DBSCAN算法相比DBSCAN算法具有较好的效率。 Considering that when the algorithm based on density-based spatial clustering of applications with noise (DB- SCAN) is applied to interactive data mining, certain algorithm parameters are usually adjusted to find new knowl- edge, and the data set used in data mining is relatively stable, this paper presents an algorithm for finding high density clusters based on DBSCAN, called the S-DBSCAN algorithm, and determines the parameters needing to be adjusted, the e, neighborhood of an object, and the MinPts, minimal number of objects of e-neighborhood to form a core object. Then three different combinations of the variations of e-neighborhood and MinPts fit for the S-DB- SCAN algorithm are introduced, and the rightness is demonstrated and the time complexity is analyzed. The experi- ments on real and synthetic data were performed to verify the efficiency and the results show that the S-DBSCAN al- gorithm has a better efficiency than DBSCAN.
出处 《高技术通讯》 CAS CSCD 北大核心 2012年第6期589-595,共7页 Chinese High Technology Letters
基金 863计划(2009AA122220,2009AA122226)资助项目
关键词 基于密度的带有噪声的空间聚类(DBSCAN) S-DBSCAN 高密度簇 聚类 数可变 density-based spatial clustering of applications with noise ( DBSCAN), S-DBSCAN, high densityclusters, clustering, parameter changing
  • 相关文献

参考文献12

  • 1Han J W, Kamber M. Data Mining: Concepts and Tech- niques. 2nd ed. San Francisco: Morgan Kaufmann, 2006. 383-384.
  • 2Yasmina S M, Adriano M. Automatic classification of lo- cation contexts with decision trees. In : Proceedings of the Conference on Mobile and Ubiquitous Systems, Guima- raes, Portugal, 2006. 74-88.
  • 3Zhou C Q, Frankowski D, Ludford P, et al. Discovering personally meaningful places: An interactive clustering approach. ACM Trans InfSyst, 2007, 25(3) : 12.
  • 4Ester M, Kriegel H P. A density-based algorithm for dis- covering clusters in large spatial databases with noise. In: Proceedings of the 2nd International Conference on Knowledge Discovery and Data Mining, Porland, USA, 1996. 226-231.
  • 5Wu F, Gardarin G. Gradual clustering algorithms. In: Proceedings of the 7th International Conference on Data- base Systems for Advanced Applications, Los Alamitos, USA, 2001. 48-55.
  • 6Ester M, Kriegel H P, Sander J, et al. Incremental clus- tering for,mining in a data warehousing environment. In: Proceedings of the 24th International Conference on Very Large Data Bases, New York, USA, 1998. 323-333.
  • 7Bose J H, Andrzejak A, Hogqvist M. Beyond online ag- gregation: parallel and incremental data mining with on- line Map-Reduce. In: Proceedings of the 2010 Workshop on Massive Data Analytics on the Cloud, Raleigh, USA, 2010. 1-6.
  • 8Goyal N, Goyal P, Venkatramaiah K, et al. An efficient density based incremental clustering algorithm in data warehousing environment. In : Proceedings of 2009 Inter- national Conference on Computer Engineering and Appli- cations, Manila, Phillipines, 2009. 556-560.
  • 9Li F, Liu S, Dou Z T, et al. An inheritable clustering al- gorithm suited for parameter changing. In: Proceedings of the 2004 International Conference on Machine Learning and Cybernetics, New York, USA, 2004. 1198-1203.
  • 10Liu S, Feng X J, Feng X. An inheritable algorithm for repeated clustering. In: Proceedings of the International Conference on Computer Science and Software Engineer- ing, Wuhan, China, 2008. 340-343.

二级参考文献8

  • 1Breunig M,Kriegel H P,Ng R T,et al.LOF:Identifying Density-based Local Outliers[A].In:Proc.ACM SIGMOD 2000 Int.Conf.On Management of Data[C],Dalles,TX,2000.
  • 2Liu Q B,Deng Su,Lu C H,et al.Relative Density Based K-nearest Neighbors Clustering Algorithm[A].In:Proc.2003 Int.Conf.on Machine Learning and Cybernetics[C],Xi' an,China,2003,133-137.
  • 3Tang J,Chen Z X,et al.A Robust Outlier Detection Scheme for Large Data Sets[EB/OL].In:http://www.cs.panam.edu/chen/papers.html.
  • 4邵峰晶,于忠清.数据挖掘:原理与算法[M].北京:中国水利水电出版社,2004.
  • 5Ester M,Kriegel H P,Sander J,et al.A Density-Based Algorithm for Discovering Clusters in Large Spatial Databases with Noise[A].In:Proc.2^nd Int.Conf.on Knowledge Discovery and Data Mining[C],Portland,OR,1996:226-231.
  • 6Han J W,Kamber M,Fan M,et al.Data Ming:Conception and Technology[M].Beijin:Machine Press,2001.
  • 7Ester M,Kriegel H P,Sander J,et al.Incremental Clustering for Mining in a Data Warehousing Environment[A].In:Gupta A,Shmueli O,Widom J,eds.,the 24th International Conference on Very Large Data Bases[C],New York,Morgan Kaufmann Publishers Inc.,1998:323-333.
  • 8Ankerst M,Breunig M,Kriegel H P,et al.OPTICS:Ordering Points To Identify the Clustering Structure[A].In:Proc.ACM SIGMOD'99,Int.Conf.On Management of Data[C],Philadelphia,PA,1999.

共引文献12

同被引文献28

引证文献5

二级引证文献17

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部