一种基于Divide-and-Merge聚类算法的改进算法

An Improved Algorithm Based on Divide-and-Merge Clustering Algorithm

下载PDF

导出

摘要 BNAK-Divide-and-Merge聚类算法是基于David等人提出的Divide-and-Merge算法的一种改进算法。Divide-and-Merge算法是一种将自顶向下的分裂方法和自底向上的聚合方法相结合的聚类算法。虽然这个聚类算法已经通过众多实验表明其聚类的效率和质量,但是它在数据集很大的情况下分裂会很耗时间和空间资源,并且它需要阈值来确定聚类个数的方法也不是很理想。针对以上两个主要不足,对原算法进行改进。 BNAK-Divide-and-Merge clustering algorithm is an improved algorithm which is based.on the Divide-and-Merge clustering algorithm proposed by David et al. Divide-and-Merge is a methodology which combines a top-down divide method with a bottom-up merge method. Although it has been proved to be a method with high efficiency and quality of clustering by implementing lots of relevant experiment, its divide phase will consume too much time and space resources when it is applied to very huge sets; furthermore the method which can figure out the number of clustering with a threshold is also not best. Accordingly, improves the original algorithm to overcome the two major flaws mentioned above.

作者黄智武张东站段江娇

机构地区厦门大学信息科学与工程学院

出处《现代计算机》 2010年第5期4-8,共5页 Modern Computer

基金国家自然科学基金项目(No.50604012)

关键词聚类算法分裂方法聚合方法时间和空间资源聚类个数 Clustering Algorithm Divide Method Merge Method Time and Space Resources Clustering Number

分类号 TP18 [自动化与计算机技术—控制理论与控制工程]

引文网络
相关文献

参考文献12

1David Cheng,Ravi Kannan,A Divide-and-Merge Methodology for Clustering,In ACM New York,NY,USA,2006,Pages:1499-1525,2007:37-65.
2Ravi Kannan,Santosh Vempala,Adrian Vetta,On Clusterings:Good,Bad and Spectral,Journal of the ACM (JACM) Archive Volume 51,Issue 3 (May 2004) Table of Contents,Pages:497-515.
3Charles J.Alpert,So -Zen Yao.Spectral Partitioning:The More Eigenvectors,The Better,Design Automation,1995.DAC 95.32nd Conference.
4Maila,M.,Shi,J,A Random Walks View of Spectral Segmentation,International Conference on AI and Statistics (AISTAT),Key West,FL,January 4-7,2001.
5Von Luxburg,U.,O.Bousquet M.Belkin:Limits of Spectral Clustering.Advances in Neural Information Processing Systems 17:Proceedings of the 2004 Conference,857-864.(Eds.) Saul,L.K.,Y.Weiss,L.Bottou,MIT Press,Cambridge,MA,USA (07 2005).
6Pang-Ning TAN.Michael Steinbach:Introduction to Data Mining,Published by Pearson Education,Inc.,Publishing As Addison Wesley.
7Blake CL,Merz CJ,UCI Machine Learning Repository of Machine Learning Databases.1998.http://www.ics.uci.edu/-mlearn/MLSummary.html.
8K.Lang.20 Newsgroups Data Set.http://www.ai.mit.edu/people/jrennie/20newsgrups/.
9Shi J B,Malik J,Normalized Cuts and Image Segmentation,IEEE Transaction on Pattern Analysis and Macine Intelligence,2000,22(8):888-905.
10Fan R.K.Chung,Spectral Graph Theory,AMS Bookstore,ISBN 0821803158,9780821803158:2-5.

1吴慧,庞超.浅谈数据库查询优化策略[J].硅谷,2011,4(5):177-177. 被引量：1
2乔予思,步国超,张辰.51环境下按键驱动程序的优化[J].电子技术与软件工程,2013(9):34-34.
3迟冬祥,徐刚,胡之惠,范光宇,辜碧容.肝脏MR图像的初步分割[J].上海电机学院学报,2008,11(2):125-127. 被引量：5
4费玲玲,李云辉,李繁.数据挖掘技术在CRM中的应用研究[J].成都纺织高等专科学校学报,2012,29(2):26-30.
5何明,刘毅,常盟盟,吴小飞.基于上下文项目评分分裂的协同过滤推荐[J].计算机科学,2017,44(3):247-253. 被引量：3
6苏健,谢良波,杨颖,文光俊,孟庆微.基于空闲时隙消除的超高频RFID防碰撞算法[J].电子学报,2017,45(2):307-314. 被引量：15
7程文斐,谭小彬,徐鹏.基于贪婪策略的高效可靠多播路由协议[J].计算机工程,2012,38(21):97-99.
8李聪,孙殿柱,刘华东,白银来.R＊树结点的主元分界分裂方法[J].中国农机化学报,2015,36(2):283-286.
9刘建强,兰巨龙.基于最短生成树的抗攻击节点分裂方法[J].计算机应用研究,2010,27(10):3935-3937.
10王铮,李伟生,王锐,王洋.计数查找算法的研究[J].计算机与信息技术,2007(12):22-23.

现代计算机

2010年第5期

浏览历史

内容加载中请稍等...

一种基于Divide-and-Merge聚类算法的改进算法

参考文献12

相关作者

相关机构

相关主题

浏览历史