双端聚类的自动调整聚类联邦学习

Automatically adjusted clustered federated learning for double-ended clustering

下载PDF

导出

摘要联邦学习(FL)是一种分布式机器学习方法,旨在共同训练全局模型,然而全局模型难以胜任多数据分布情况。为应对多分布挑战,引入聚类联邦学习,以客户端分组方式优化共享多模型。其中,服务器端聚类难以修正分类错误,而客户端聚类则对初始模型的选择至关重要。为解决这些问题,提出自动调整聚类联邦学习(AACFL)框架,所提框架采用双端聚类整合服务器端和客户端聚类。首先用双端聚类将客户端分为可调整集群,其次自动调整局部客户端身份,最后获取正确的客户集群。在非独立同分布下,在3个经典联邦数据集上的评估实验结果表明,AACFL能够在双端聚类结果存在错误的情况下通过调整获得正确集群,当簇数为4,客户端数为100时,与联邦平均(FedAvg)算法、聚类联邦学习(CFL)和IFCA(Iterative Federated Clustering Algorithm)等方法相比,有效地提高模型收敛速度和获得正确聚类结果的速度,准确率平均提升0.20~23.16个百分点。验证了所提框架能够高效聚类,并提高模型收敛速度和准确率。 Federated Learning(FL)is a distributed machine learning method that aims to jointly train a global model,but the global model is difficult to handle multi-data distribution situations.To deal with the multi-distribution challenge,clustered federated learning was introduced to optimize shared multiple models in a client grouping manner.Among them,server-side clustering was difficult to correct classification errors,while client-side clustering was crucial to the selection of the initial model.To solve these problems,an Automatically Adjusted Clustered Federated Learning(AACFL)framework was proposed,which used double-ended clustering to integrate server-side and client-side clustering.Firstly,double-ended clustering was used to divide client ends into adjustable clusters.Then,local client end identities were adjusted automatically.Finally,the correct client clusters were obtained.AACFL was evaluated on three classical federated datasets under non-independent and identically distributed conditions.Experimental results show that AACFL can obtain correct clusters through adjustment when there are errors in the double-ended clustering results.Compared with FedAvg(Federated Averaging)algorithm,CFL(Clustered Federated Learning),IFCA(Iterative Federated Clustering Algorithm)and other methods,AACFL can effectively improve the model convergence speed and the speed of obtaining correct clustering results,and has the accuracy improved by 0.20-23.16 percentage points on average with the number of clusters is 4 and the number of clients is 100.Therefore,the proposed framework can cluster efficiently and improve model convergence speed and accuracy.

作者尹春勇周永成 YIN Chunyong;ZHOU Yongcheng(School of Computer Science,School of Cyberspace Security,Nanjing University of Information Science and Technology,Nanjing Jiangsu 210044,China;School of Software,Nanjing University of Information Science and Technology,Nanjing Jiangsu 210044,China)

机构地区南京信息工程大学计算机学院、网络空间安全学院南京信息工程大学软件学院

出处《计算机应用》 CSCD 北大核心 2024年第10期3011-3020,共10页 journal of Computer Applications

基金国家自然科学基金资助项目(6177282)。

关键词联邦学习聚类异构数据分布式机器学习神经网络 Federated Learning(FL) clustering heterogeneous data distributed machine learning neural network

分类号 TP181 [自动化与计算机技术—控制理论与控制工程]

引文网络
相关文献

1窦真兰,江晶晶,张春雁,张洋.基于双层博弈理论的跨区域综合能源型微电网能量优化共享策略[J].电气应用,2024,43(9):75-83.
2马骞,郭庆平,孙衍,李翀,程雯婕,渠海.康复管理在肝胆胰术患者中的运用效果及手术部位感染预防作用[J].中华实验外科杂志,2024,41(9):2050-2050.
3朱韵卓,张义富.面向医疗物联网的一种通信高效联邦学习方法设计[J].中阿科技论坛（中英文）,2024(10):49-53.
4Jia Yan,Xiaoquan Yang,Peifen Weng.A robust implicit high-order discontinuous Galerkin method for solving compressible Navier-Stokes equations on arbitrary grids[J].Acta Mechanica Sinica,2024,40(8):96-119.
5郭倩,赵津,过弋.基于分层聚类的个性化联邦学习隐私保护框架[J].信息网络安全,2024(8):1196-1209.
6孙艳华,王子航,刘畅,杨睿哲,李萌,王朱伟.个性化联邦学习的相关方法与展望[J].计算机工程与应用,2024,60(20):68-83.
7李璇,任娟.中心群组化孕期保健模式在初产妇中的应用及效果探析[J].中国社区医师,2024,40(23):101-103.
8张文敏,黄友丽.中西医结合围产保健对孕产妇围产期并发症的影响[J].中国社区医师,2024,40(23):152-154.
9张锐.家族办公室:金融产业链顶端的华丽之舞[J].中关村,2024(9):26-29.
10吴维鑫,侯会文,石乐义.基于深度学习和联邦学习的工控入侵检测研究[J].微电子学与计算机,2024,41(9):22-31.

计算机应用

2024年第10期

浏览历史

内容加载中请稍等...

双端聚类的自动调整聚类联邦学习

相关作者

相关机构

相关主题

浏览历史