基于中心点及密度的分布式聚类算法

Distributed Clustering Algorithm Based on Centers and Density

下载PDF

导出

摘要针对分布式聚类算法DBDC存在的不足,提出一种基于中心点及密度的分布式聚类算法DCUCD。将数据分布计算出的虚拟点作为核心对象,核心对象的代表性随算法的执行次数提高,聚类即是对所有核心对象分类的过程。理论分析和实验结果表明,该算法能有效处理噪声和分布不规则的数据点,时间效率和聚类质量较好。 In order to overcome the shortcomings of the DBDC,a distributed clustering based on centers and density which called DCUCD is proposed.It works based on the centers and the density.The virtual core objects are generated from the distributed data and the quality is better if the algorithm runs more times.Clustering is the same as the process to classify all of the core objects.Theoretical analysis and experimental results testify that DCUCD can effectively deal with the problem of local noise,and discover clusters of arbitrary shape.It can generate high quality clusters and cost a little time.

作者冯少荣张东站

机构地区厦门大学信息科学与技术学院

出处《计算机工程》 CAS CSCD 北大核心 2010年第19期56-58,共3页 Computer Engineering

基金国家自然科学基金资助项目(50604012)

关键词数据挖掘分布式聚类中心点噪声 data mining distributed clustering centers noise

分类号 TP311.133 [自动化与计算机技术—计算机软件与理论]

引文网络
相关文献

参考文献7

1黄学宇,魏娜,陶建锋.基于人工免疫聚类的异常检测算法[J].计算机工程,2010,36(1):166-169. 被引量：13
2纪洲鹏,周军,何明.基于变精度粗糙集的Web用户聚类方法[J].计算机工程,2010,36(3):44-46. 被引量：2
3Januzaj E, Kriegel H P, Pfeifle M. DBDC: Density Based Distributed Clustering[C]//Proc. of EDBT,04. [S. l.]: Springer, 2004: 88-105.
4Januzaj E, Kriegel H P, Pfeifle M. Scalable Density-based Distributed Clustering[C]//Proc. of PKDD'04. Pisa, Italy: Springer, 2004:231-244.
5郑金彬,卓义宝.基于密度的分布式聚类算法研究[J].计算机工程,2008,34(17):65-67. 被引量：5
6Zhou Jun, Liu Zhijing. Distributed Clustering Based on K-means and CPGA[C]//Proc. ofFSKD'08. Jinan, China: [s. n.], 2008.
7Jiang Guoxing, Yang Zhiya. A Distributed Clustering Algorithm Based on Cluster Stability for Mobile Ad Hoc Networks[C]//Proc. ofWiCOM'08. Dalian, China: [s. n.], 2008: 1-6.

二级参考文献12

1陈子军,王鑫昱,李伟.一种Web日志会话识别的优化方法[J].计算机工程,2007,33(1):95-97. 被引量：18
2刘立军,周军,梅红岩.Web使用挖掘的数据预处理[J].计算机科学,2007,34(5):200-201. 被引量：22
3戴英侠,连一峰.系统安全与入侵检测[M].北京:清华大学出版社,2000.
4Zuben F J. Learning and Optimization Using the Clonal Selection Principl[C]//Proc. of the IEEE Int'l Conf. on Evolutionary Computation. [S. l.]: IEEE Press, 1999.
5De S K, Krishna P R. Clustering Web Transactions Using Rough Approximation[J]. Fuzzy Sets and Systems, 2004, 148(1): 134-138.
6Kumar P, Krishna P R, Bapi R S, et al. Rough Clustering of Sequential Data[J]. Data & Knowledge Engineering, 2007, 63(2): 183-199.
7Liu Haibin, Keselj V. Combined Mining of Web Server Logs and Web Contents for Classifying User Navigation Patterns and Predicting Users' Future Requests[J]. Data & Knowledge Engineering, 2007, 61(2): 304-330.
8Ankerst M, Breunig M M, Kriegel H P, et al. Ordering Points to Identify the Clustering Structure[C]//Proc. of ACM SIGMOD International Conference on Management of Data. Philadelphia, USA: ACM Press, 1999.
9Brecheisen S, Kriegel H R Kroger P, et al. Visually Mining Through Cluster Hierarchies[C]//Proc. of SIAM Int'l Conf. on Data Mining. Orlando, USA: [s. n.], 2004.
10Ester M, Kriegel H P, Sander J, et al. Incremental Clustering for Mining in a Datawarehousing Environment[C]//Proc. of the 24th Int'l Conf. on Very Large Databases. New York, USA: [s. n.], 1998.

共引文献17

1钱鑫,张龙波,田爱奎,邓齐志,汪金苗.一种面向数据密集型计算环境的聚类算法[J].济南大学学报（自然科学版）,2013,27(1):11-15. 被引量：3
2祁志伟,张永平.一种遗传与密度聚类结合的入侵检测算法[J].大众科技,2010,12(6):66-67.
3杨铭魁.改进的变长检测器产生算法[J].计算机工程,2010,36(15):174-175. 被引量：1
4赵恩来,郝文宁,赵水宁,韩宪勇.改进的基于密度方法的态势聚类显示算法[J].计算机工程,2010,36(18):35-37. 被引量：9
5彭敏,唐俊.检测器实时生成算法及其应用[J].计算机工程,2010,36(19):180-181.
6赵恩来,郝文宁,赵飞,陈刚,邵校莎莎.改进的基于密度的航迹聚类算法[J].计算机工程,2011,37(9):270-272. 被引量：15
7彭敏.基于免疫的网络入侵检测与风险预测模型[J].计算机工程,2011,37(11):141-143. 被引量：3
8刘勇,尚永爽,王怡苹.基于免疫模型的故障诊断方法及应用[J].计算机工程,2011,37(16):5-7. 被引量：5
9郑艳君.人工免疫算法在入侵检测系统中的应用[J].电脑知识与技术,2012,8(6):3965-3968.
10林秀丹,毛国君.基于密度网格的分布式数据流聚类算法[J].计算机工程,2012,38(16):70-73. 被引量：6

1王先平,张永芬.基于SOA架构的分布式聚类算法的Web服务模型研究[J].数字技术与应用,2014,32(4):136-137. 被引量：4
2钱鑫,张龙波,田爱奎,邓齐志,汪金苗.一种面向数据密集型计算环境的聚类算法[J].济南大学学报（自然科学版）,2013,27(1):11-15. 被引量：3
3张倩,李铁军,陈虹宇,邵桂芳.基于产生式规则的机器人动态避障[J].厦门大学学报（自然科学版）,2010,49(2):166-170.
4周鹏,张骏,史忠科.Oracle Spatial中基于电子地图的路径寻优预处理研究[J].计算机应用研究,2006,23(3):144-147. 被引量：1
5郑苗苗,吉根林.一种基于密度的分布式聚类算法[J].南京大学学报（自然科学版）,2008,44(5):536-543. 被引量：10
6郜帅,张宏科.时延受限传感器网络移动Sink路径选择方法研究[J].电子学报,2011,39(4):742-747. 被引量：24
7曹俊,尹东,张荣,毕一帆.基于虚拟点的可见光和SAR图像配准研究[J].光电工程,2009,36(11):79-84. 被引量：6
8李旺,戴明强.具有战时随机延误与损耗的多配送中心路径优化[J].火力与指挥控制,2012,37(2):184-189. 被引量：2
9吉根林,姚瑶.一种分布式隐私保护的密度聚类算法[J].智能系统学报,2009,4(2):137-141. 被引量：2
10王郑拓,冯振礼,叶国云,徐月同,傅建中.基于人工蜂群算法的双机器人路径规划分析[J].焊接学报,2015,36(2):97-100. 被引量：13

计算机工程

2010年第19期

浏览历史

内容加载中请稍等...

基于中心点及密度的分布式聚类算法

参考文献7

二级参考文献12

共引文献17

相关作者

相关机构

相关主题

浏览历史