摘要
粗糙K-means聚类算法具有较强的处理边界不确定数据能力,但该算法也存在对初始聚类中心选取敏感,以及采用固定权重和阈值方式而导致聚类结果不稳定、精度下降等问题。许多研究工作从不同角度致力于解决这些问题。引入人工蜂群算法(ABC)从三方面对算法进行了改进:首先,以下近似和边界集中数据对象个数与对象在数据集中空间分布的差异性乘积的比值为基础,设计了一种更为合理的动态调整下近似和边界集的权重方法。其次,为加快算法的收敛速度,给出了一种与迭代次数相关联的自适应阈值ε的实现方法。最后,通过构造蜜源位置的适应度函数,引导蜂群向高质量蜜源全局搜索,把蜂群每次迭代得到的最优源位置作为初始聚类中心,并在此基础上进行交替聚类。实验结果表明,改进后的算法提高了聚类结果的稳定性,获得了较好的聚类效果。
The rough K-means clustering algorithm has strong ability to deal with data with uncertain boundaries.However,this algorithm also has limitations such as sensitivity to the selection of initial clustering centers,and use of fixed weights and thresholds resulting in unstable clustering results and decreased accuracy.A lot of research has been devoted to solving these problems from different angles.With introduction of artificial bee colony(ABC)algorithm,the algorithm is improved from three aspects.Firstly,based on the ratio of the number of objects in lower approximate set and the boundary set to the product of the difference of the objects in the dataset,a more reasonable method of dynamically adjusting the weights of approximation and boundary set is designed.Secondly,in order to speed up the convergence speed of the algorithm,an implementation method of adaptive thresholdεassociated with the number of iterations is given.Thirdly,by constructing the fitness function of the nectar source location,the bee colony is guided to search for high-quality nectar sources globally.The best position of honey source obtained by each iteration is taken as the initial cluster center,and the cluster is carried out on the basis of this.Experimental results show that the improved algorithm improves the stability of the clustering results and obtains better clustering effect.
作者
叶廷宇
叶军
王晖
王磊
YE Tingyu;YE Jun;WANG Hui;WANG Lei(School of Information Engineering,Nanchang Institute of Technology,Nanchang 330000,China;Jiangxi Province Key Laboratory of Water Information Cooperative Sensing and Intelligent Processing(Nanchang Institute of Technology),Nanchang 330000,China)
出处
《计算机科学与探索》
CSCD
北大核心
2022年第8期1923-1932,共10页
Journal of Frontiers of Computer Science and Technology
基金
国家自然科学基金(61562061,61663028)
江西省自然科学基金(20212BAB202022)
江西省教育厅科技项目(GJJ170995)。