面向多源数据的个性化联邦学习框架

Personalized Federal Learning Framework for Multi-Source Data

下载PDF

导出

摘要在联邦学习中,中心服务器聚合来自不同的客户端经过差分隐私扰动后的模型,其中差分隐私噪声添加的大小和隐私预算的分配直接影响到模型的可用性,现有的研究大多基于平衡的数据和固定的隐私预算,在处理多源不平衡数据时难以权衡精度与隐私保护水平,针对该问题提出了一种具有自适应差分隐私噪声添加的联邦学习框架,采取基于沙普利值的贡献度证明算法计算不同数据来源的客户端的贡献度,并依据贡献度为不同客户端在梯度更新的过程中添加差异化的差分隐私噪声,继而实现个性化的隐私保护。理论和实验分析表明该框架面对多源不平衡数据时不仅可以为不同参与方提供更加细化的隐私保护水平,同时在模型性能方面也比传统的FL-DP算法高出1.3个百分点。 In federated learning,the central server aggregates and models from different clients after differential privacy perturbation,in which the size of differential privacy noise addition and the allocation of the privacy budget directly affect the usability of the model,most of the existing studies are based on balanced data and fixed privacy budgets,which makes it difficult to trade-off the accuracy and the level of privacy protection when dealing with imbalanced data from multiple sources.To address this problem,a federated learning framework with adaptive differential privacy noise addition is proposed,which adopts a contribution proof algorithm based on the Shapley value to compute the contribution degree of clients with different data sources,and based on the contribution degree,differentiated differential privacy noise is added for different clients in the process of gradient updating,and then personalized privacy protection is achieved.Theoretical and experimental analyses show that this framework can not only provide a more fine-grained level of privacy protection for different participants when facing multi-source unbalanced data,but also outperforms the traditional FL-DP algorithm by 1.3 percentage points in terms of model performance.

作者裴浪涛陈学斌任志强翟冉 PEI Langtao;CHEN Xuebin;REN Zhiqiang;ZHAI Ran(School of Science,North China University of Science and Technology,Tangshan,Hebei 063210,China;Hebei Province Key Laboratory of Data Science and Application(North China University of Science and Technology),Tangshan,Hebei 063210,China;Tangshan Data Science Laboratory(North China University of Science and Technology),Tangshan,Hebei 063210,China)

机构地区华北理工大学理学院河北省数据科学与应用重点实验室(华北理工大学) 唐山市数据科学重点实验室(华北理工大学)

出处《计算机工程与应用》 CSCD 北大核心 2024年第19期278-287,共10页 Computer Engineering and Applications

基金国家自然科学基金(U20A20179)。

关键词联邦学习差分隐私沙普利值不平衡数据 federated learning differential privacy Shapley value unbalanced data

分类号 TP309 [自动化与计算机技术—计算机系统结构]

引文网络
相关文献

1刘旭,刘颂凯,杨超,张磊,段雨舟,晏光辉.基于逐步特征增广梯度提升的暂态功角稳定评估及可解释性分析[J].现代电力,2024,41(5):844-853.
2龚懿昀,于海波,王韵,康丽,曾建潮.双模型驱动的多偏好策略自适应差分演化算法[J].中北大学学报（自然科学版）,2024,45(5):638-646.
3康海燕,王骁识.基于数据特征相关性和自适应差分隐私的深度学习方法研究[J].电子学报,2024,52(6):1963-1976.
4李启涛,张世明,孙振勇,郑亚慧.多波束点云中复杂河道断面地形的自动提取方法[J].海洋测绘,2024,44(4):30-34.
5吕鲲,罗星雨,靖继鹏.融合CFDP和CPM分析的关键核心技术识别及其路径分析——以芯片光刻领域为例[J].图书情报工作,2024,68(16):75-89.
6刘宜欣,周蕾,徐梦明,金肖依,路晓.基于TVS和电容的浪涌保护电路[J].传感器技术与应用,2024,12(5):752-756.
7方国华,郑旺,吴承君,颜敏.梯级水库优化调度的改进ODDDP算法研究[J].水电能源科学,2024,42(9):179-184.

计算机工程与应用

2024年第19期

浏览历史

内容加载中请稍等...

面向多源数据的个性化联邦学习框架

相关作者

相关机构

相关主题

浏览历史