期刊文献+

兼顾通信效率与效用的自适应高斯差分隐私个性化联邦学习

Communication-efficient and Utility-A ware Adaptive Gaussian Differential Privacy for Personalized Federated Learning
下载PDF
导出
摘要 近年来,由于联邦学习中的通信参数(或梯度)会给参与方本地敏感数据带来重大的隐私泄露风险,联邦学习隐私保护引起了广泛的关注.然而,梯度交换频繁、数据分布异构、参与方本地硬件资源受限等一系列不可避免的因素给联邦学习隐私保护增加了挑战难度.为了以一种统一的方式同时有效地解决数据隐私、模型效用、通信效率以及参与方数据非独立同分布等四个方面的问题,本文提出了一种新的兼顾通信效率与效用的自适应高斯差分隐私个性化联邦学习(Communication-efficient and Utility-aware Adaptive Gaussian Differential Privacy for Personalized Federated Learning,CUAG-PFL)方法.具体而言,本文提出一种动态层级压缩模型梯度的方案先为通信模型梯度每一层动态生成特定的压缩率,再根据压缩率构造对应的确定性二进制测量矩阵去除梯度冗余信息.随后,通过同时优化裁剪阈值、敏感度和噪声尺度等隐私相关参数来对压缩的模型梯度执行自适应高斯差分隐私操作.此外,本文对CUAG-PFL进行了严格的隐私分析.为了验证CUAG-PFL在隐私、效用、通信效率以及个性化四个方面的优势,本文在CIFAR-10和CIFAR-100两个真实联邦数据集上进行了大量实验模拟、对比和分析,结果表明CUAG-PFL能够提高参与方本地数据隐私性、通信效率和模型效用,同时解决了数据非独立同分布的问题.特别地,即使在隐私预算仅为0.92且上行通信量减少68.6%时,CUAG-PFL因隐私保护和梯度压缩所引起的模型效用损失仅为1.66%. In recent years,there has been an increasing focus on the privacy protection in the field of federated learning(FL).This widespread attention is mainly due to the fact that commu-nication parameters(or gradients)during the process of collaborative learning among the central server and various participants can cause the significant risk of the privacy leakage.In other words,the communication process in the FL system poses a potential threat of exposing the sensitive data belonging to local participants,which has raised heightened concerns among researchers and practitioners.Furthermore,in addition to the challenge of the privacy protection in FL,a series of other unavoidable factors such as the frequent gradients exchange,the hetero-geneous data distribution among local participants,and limited resources available on the local hardware need to be simultaneously taken into consideration.These factors obviously add diffi-culties to the challenge of the privacy protection in FL.In order to effectively address four critical issues of data privacy,model utility,communication efficiency,and non-independently and iden-tically distributed data among local participants in a unified manner,this paper proposes a novel Communication-efficient and Utility-aware Adaptive Gaussian Differential Privacy for Personal-ized FL method,called CUAG-PFL.Specifically,a dynamic layer-compression scheme for model gradients in the FL system is proposed.This scheme aims to improve the communication efficien-cy as much as possible and reduce the loss of the model utility caused by compression and recon-struction through dynamically customizing the compression rate for each layer of communication gradients,and then constructing the corresponding deterministic binary measurement matrix based on the compression rate.This designed deterministic binary measurement matrix can effec-tively remove the redundant information of model gradients that needs to be uploaded to the cen-tral server.Subsequently,the adaptive Gaussian differential privacy operation is performed on compressed model gradients of local participants.This operation involves optimizing the main privacy-related parameters such as the clipping threshold,the sensitivity,and the noise scale.By optimizing these parameters at the same time,this operation ensures that the privacy of the local data is preserved,while allowing each model of the corresponding local participant to have the satisfactory performance.In addition,the rigorous privacy analysis of the proposed CUAG-PFL is presented in this paper.In order to validate the superiority of the proposed CUAG-PFL in four critical aspects of data privacy,model utility,communication efficiency,and personalization,a large number of experimental simulations,comparisons,and analyses are conducted on two classic real-world federated datasets,i.e.,CIFAR-10 and CIFAR-100.All experimental results and analyses show that the proposed CUAG-PFL can simultaneously improve the privacy of local sensitive data,the communication efficiency and the model utility,as well as address the problem of non-independently and identically distributed data among local participants in the FL system.In particular,it is worth emphasizing that even when the privacy budget is only 0.92 and the amount of the upstream communication is reduced by 68.6%,the loss of the model performance caused by both the privacy protection and the communication gradients compression is just 1.66%for the proposed CUAG-PFL.
作者 李敏 肖迪 陈律君 LI Min;XIAO Di;CHEN LÜ-Jun(College of Com puter Science,Chongqing University,Chongqing 400044)
出处 《计算机学报》 EI CAS CSCD 北大核心 2024年第4期924-946,共23页 Chinese Journal of Computers
基金 国家重点研发项目(No.2020YFB1805400) 国家自然科学基金(No.62072063) 重庆市研究生科研创新项目(No.CYB22063)资助.
关键词 自适应高斯差分隐私 隐私-效用权衡 动态层级压缩 通信高效 个性化联邦学习 隐私计算 adaptive Gaussian differential privacy privacy-utility trade-off dynamic hierarchical compression high-efficient communication personalized federated learning private computing
  • 相关文献

参考文献7

二级参考文献70

  • 1Donoho D L.Compressed sensing.IEEE Transactions on Information Theory,2006,52(4):1289-1306.
  • 2Baraniuk R,et al.A simple proof of the restricted isometry property for random matrices.Constructive Approximation,2008,28(3):253-263.
  • 3Candes E J.The restricted isometry property and its implications for compressed sensing.Comptes Rendus Mathematique,2008,346(9-10):589-592.
  • 4Candes E J et al.Robust uncertainty principles:Exact signal reconstruction from highly incomplete frequency information.IEEE Transactions on Information Theory,2006,52(2):489-509.
  • 5Candes E J,Tao T.Near-optimal signal recovery from randora projections,Universal encoding strategies?IEEE Transactions on Information Theory,2006,52(12):5406-5425.
  • 6Romberg J.Imaging via compressive sampling.IEEE Signal Processing Magazine,2008,25(2):14-20.
  • 7Candes E J,Tao T.Decoding by linear programming.IEEE Transactions on Information Theory,2005,51(3):4203-4215.
  • 8Cand,et al.Sparsity and incoherence in compressive sampiing.Inverse Problems,2007,23(3):969-985.
  • 9Candes E,Tao T.The dantzig selector:Statistical estimation when P is much larger than n.Annals of Statistics,2007,35(6):2313-2351.
  • 10Chen S S,Donoho D L,Saunders M A.Atomic decomposition by basis pursuit.SIAM Journal on Scientific Computing,2001,43(1):129-159.

共引文献338

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部