摘要
针对现有聚类算法计算复杂度普遍较高的问题,提出了一种基于定位的方法。该算法采用空间定位的方法将数据对象映射到特征空间中,并利用空间立方体的某些特殊顶点定位任一数据点;通过计算数据点与空间立方体顶点群的距离差异,完成聚类过程。在电信数据集上的实验结果表明,算法的时间复杂度降至O(N)级别。
出处
《电子技术应用》
北大核心
2007年第4期118-120,123,共4页
Application of Electronic Technique
基金
国家自然科学基金项目(No.60432010)
参考文献9
-
1GUHA S, RASTOGI R, SHIM K. CURE: An effcient clustering algorithm for large databases. In: Haas LM,Tiwary A, eds. Proc. of the ACM SIGMOD Int'l Conf.on Management of Data. Seattle: ACM Press, 1998:73-84http://citeseer.ist.psu.edu/guha98cure.html
-
2ZHANG T, RAMAKRISHNAN R, LIVNY M. BIRCH: An efficient data clustering method for very large databases.In: Jagadish HV, Mumick IS, eds. Proc. of the 1996 ACM SIGMOD Int'l Conf. on Management of Data. Montreal: ACM Press, 1996:103-114.
-
3MCQUEEN J. Some methods for classfication and Analysis of Multivariate Observations. In: LeCam, L., Neyman, J.,eds.Proceddings of the 5^th Berkeley Symposium on Mathematical Statistics and Probability, 1967:281-297.
-
4ANKERST M, BREUNIG M M, KRIEGEL H P, et al.OPTICS: Ordering points to identify the clustering structure In: Delis A, Faloutsos C, Ghandeharizadeh S, eds. Proc.ACM SIGMOD Int'l Conf. on Management of Data.Philadelphia; ACM Press, 1999:49-60.
-
5ESTER M, KRIEGEL H P, SANDER J,et al. A density-based algorithm for discovering clusters in large spatial databases with noises. Proceedings of the 2^th International Conference on Knowledge Discovery and Data Mining(KDD'96), AAAI Press,1996:226-231.
-
6WANG W, YANG J, MUNTZ R R. STING: A statistical information grid approach to spatial data mining. Proc. of the 23rd Int'l Conf. on Very Large Data Bases. Athens:Morgan Kaufmann, 1997:186-195.
-
7Integral Solutions Limited. ISL clementine user guide and reference manual, 1994.
-
8DASH M, LIU H. Feather selection for clustering. In Pacific-Asia Conference on Knowledge Discouvery and Data Mining, 2000:110-121.
-
9钱卫宁,周傲英.从多角度分析现有聚类算法(英文)[J].软件学报,2002,13(8):1382-1394. 被引量:86
二级参考文献36
-
1[1]Fasulo, D. An analysis of recent work on clustering algorithms. Technical Report, Department of Computer Science and Engineering, University of Washington, 1999. http://www.cs.washington.edu.
-
2[2]Baraldi, A., Blonda, P. A survey of fuzzy clustering algorithms for pattern recognition. IEEE Transactions on Systems, Man and Cybernetics, Part B (Cybernetics), 1999,29:786~801.
-
3[3]Keim, D.A., Hinneburg, A. Clustering techniques for large data sets - from the past to the future. Tutorial Notes for ACM SIGKDD 1999 International Conference on Knowledge Discovery and Data Mining. San Diego, CA, ACM, 1999. 141~181.
-
4[4]McQueen, J. Some methods for classification and Analysis of Multivariate Observations. In: LeCam, L., Neyman, J., eds. Proceedings of the 5th Berkeley Symposium on Mathematical Statistics and Probability. 1967. 281~297.
-
5[5]Zhang, T., Ramakrishnan, R., Livny, M. BIRCH: an efficient data clustering method for very large databases. In: Jagadish, H.V., Mumick, I.S., eds. Proceedings of the 1996 ACM SIGMOD International Conference on Management of Data. Quebec: ACM Press, 1996. 103~114.
-
6[6]Guha, S., Rastogi, R., Shim, K. CURE: an efficient clustering algorithm for large databases. In: Haas, L.M., Tiwary, A., eds. Proceedings of the 1998 ACM SIGMOD International Conference on Management of Data. Seattle: ACM Press, 1998. 73~84.
-
7[7]Beyer, K.S., Goldstein, J., Ramakrishnan, R., et al. When is 'nearest neighbor' meaningful? In: Beeri, C., Buneman, P., eds. Proceedings of the 7th International Conference on Data Theory, ICDT'99. LNCS1540, Jerusalem, Israel: Springer, 1999. 217~235.
-
8[8]Ester, M., Kriegel, H.-P., Sander, J., et al. A density-based algorithm for discovering clusters in large spatial databases with noises. In: Simoudis, E., Han, J., Fayyad, U.M., eds. Proceedings of the 2nd International Conference on Knowledge Discovery and Data Mining (KDD'96). AAAI Press, 1996. 226~231.
-
9[9]Ester, M., Kriegel, H.-P., Sander, J., et al. Incremental clustering for mining in a data warehousing environment. In: Gupta, A., Shmueli, O., Widom, J., eds. Proceedings of the 24th International Conference on Very Large Data Bases. New York: Morgan Kaufmann, 1998. 323~333.
-
10[10]Sander, J., Ester, M., Kriegel, H.-P., et al. Density-Based clustering in spatial databases: the algorithm GDBSCAN and its applications. Data Mining and Knowledge Discovery, 1998,2(2):169~194.
共引文献85
-
1刘英林,陈玉柱,丁文静,程红云.钢卷表面缺陷分布特征发现方法研究[J].冶金自动化,2020,44(1):27-31. 被引量:2
-
2毛颖颖,杨新凯.融合拓扑势的自适应层次聚类算法研究[J].计算机应用研究,2020,37(S01):37-39.
-
3李华,贾雪.基于FM度量的自适应K-Means聚类的工业生产运行基准挖掘[J].长春大学学报,2022,32(4):22-27.
-
4Qi Zhang,Jianshe Cao,Yanfeng Sui.Development of a research platform for BEPCⅡ accelerator fault diagnosis[J].Radiation Detection Technology and Methods,2020,4(3):269-276.
-
5郭景峰,赵玉艳,边伟峰,李晶.基于改进的凝聚性和分离性的层次聚类算法[J].计算机研究与发展,2008,45(z1):202-206. 被引量:15
-
6王建会,申展,胡运发.一种实用高效的聚类算法[J].软件学报,2004,15(5):697-705. 被引量:26
-
7张虎,郑家恒,刘江.语料库词性标注一致性检查方法研究[J].中文信息学报,2004,18(5):11-16. 被引量:9
-
8杨涛,李龙澍.一种基于粗糙集聚类的数据约简算法[J].系统仿真学报,2004,16(10):2195-2197. 被引量:5
-
9张虎,郑家恒,刘江.汉语语料库词性标注自动校对方法研究[J].计算机应用,2005,25(1):17-19. 被引量:1
-
10栾丽华,吉根林.一种基于四叉树的快速聚类算法[J].计算机应用,2005,25(5):1001-1003. 被引量:6
-
1日本推出世界上最小电脑[J].初中生(锐作文),2009(3):62-62.
-
2世界上最小的电脑[J].中国计算机用户,2008(31):68-68.
-
3日本推出世界上最小电脑[J].新闻天地(上半月刊),2008(12):54-54.
-
4骰子大小的计算机[J].科学世界,2008(10):11-11.
-
5日本推出世界上最小电脑[J].英才,2008(10):26-26.
-
6汪佳玲,胡本木,孙越泓.一种基于定位更新技术的人工蜂群聚类算法[J].南京师大学报(自然科学版),2015,38(4):95-102. 被引量:2
-
7薛宁.放在手心里的电脑[J].科技展望(幻想大王),2008(11):6-6.
-
8世界上最小的电脑[J].发明与创新(大科技),2008(10):28-28.
-
9日本推出世界上最小的计算机[J].系统工程与电子技术,2008,30(11):2102-2102.
-
10掌控“星际网络”的世界最小计算机[J].读写算(科技知识动漫),2008(12):4-4.