期刊文献+

基于联邦学习的Gamma回归算法 被引量:1

FL-GRM:Gamma Regression Algorithm Based on Federated Learning
下载PDF
导出
摘要 在水文学、气象学以及保险理赔评估等领域中,通常假设因变量服从Gamma分布,相比多元线性回归,在Gamma分布假设下建立起的Gamma回归具有更出色的拟合效果。以往获得Gamma回归模型的方法是将数据集中起来进行训练,当数据是由多方提供时,在不交换数据的情况下训练满足隐私保护的Gamma回归模型成为需要解决的问题。为此,提出了一种多方安全的纵向联邦Gamma回归算法,该算法首先使用迭代法推导出纵向联邦Gamma回归模型的对数似然估计表达式,然后结合工程实际确定模型的连接函数,进而构造损失函数建立参数的梯度更新策略,最后对同态加密后的各方参数进行融合更新,获得联邦学习后的Gamma回归模型。在两种公开数据集上进行性能测试,实验结果表明,所提联邦Gamma回归算法在不交换数据的前提下,可有效利用多方数据的价值生成Gamma回归模型,该模型对数据的拟合效果逼近数据在集中情况下学习到的Gamma回归模型,优于单方独立学习获得的Gamma回归模型。 People commonly hypothesize that an independent variable follows a Gamma distribution in many areas,including hydrology,meteorology and insurance claim.Under the Gamma distribution assumption,Gamma regression model enables an outstanding fitting effect,compared with multivariate linear-regression model.Previous studies may be able to obtain a Gamma regression model trained only on a public dataset.However,when the datasets are provided by multiple parties,how to seek to address the problem of data privacy by training Gamma regression model without exchanging the data itself?A secure multi-party federated Gamma regression algorithm has been applied to this area.Firstly,the log-likelihood function is derived with the iterative method.Secondly,the link function is determined according to the fact,and the gradient updating strategy is constructed by the loss function.Finally,the parameters with homomorphic encryption are updated,then the training is completed.The model is tested on two public datasets,and the results show that under the premise of privacy protection our method can effectively use the value of multi-party data to generate Gamma regression model.The fitting performance of our method is better than that of Gamma regression model implements in a single part,and is close to the result yielded by centralized data learning model.
作者 郭艳卿 李宇航 王湾湾 付海燕 吴铭侃 李祎 GUO Yan-qing;LI Yu-hang;WANG Wan-wan;FU Hai-yan;WU Ming-kan;LI Yi(School of Information and Communication Engineering,Dalian University of Technology,Dalian,Liaoning 116024,China;Research Center of InsightOne Tech Co.,Ltd.,Beijing 100028,China)
出处 《计算机科学》 CSCD 北大核心 2022年第12期66-73,共8页 Computer Science
基金 国家自然科学基金(62076052,62106037,U1936117) 中央高校基本科研业务费(DUT20TD110,DUT20RC(3)088) 国家社科基金重大项目(19ZDA127) 模式识别国家重点实验室开放课题项目(202100032)。
关键词 联邦学习 Gamma回归 同态加密 隐私保护 多方安全计算 Federated learning Gamma regression Homomorphic encryption Privacy protection Secure multi-party computation
  • 相关文献

参考文献5

二级参考文献28

  • 1管汉雄,熊颖,申楠茜,樊艳青,邵剑波,李宏军,李小明,胡道予,朱文珍,金征宇.新型冠状病毒肺炎(COVID-19)临床影像学特征[J].放射学实践,2020,0(2):125-130. 被引量:255
  • 2文英.人类活动强度定量评价方法的初步探讨[J].科学与社会,1998(4):56-61. 被引量:56
  • 3胡志斌,何兴元,李月辉,朱教君,李小玉.岷江上游地区人类活动强度及其特征[J].生态学杂志,2007,26(4):539-543. 被引量:45
  • 4Anderson D, Feldblum S, Modlin C, et al. A Practitioner's Guide to Generalized Linear Models [J]. CAS Exam Study Note Casualty Actuarial Society - Arlington, 2007, 2: 1-116.
  • 5卡尔斯R,胡法兹M,达呐J,等著.唐启鹤,胡太忠,成世学译.现代精算风险理论(第一版)[M].北京:科学出版社,2005.
  • 6Nelder J A, Mccullagh P. Generalized Linear Models [M]. London: Chapman & Hall, 1989.
  • 7Lemaire J. Bonus-Malus systems in automobile insurance [M]. Netherlands: Kluwer Academic Publishers, 1995.
  • 8Fu L, Moncher R B. Severity distributions for GLMs: gamma or lognormal: evidence from monte carlo simulations [J]. Casualty Actuarial Society Discussion Paper Program Casualty Actuarial Society- Arlington, 2004:149 230.
  • 9吴征镒 朱彦丞.云南植被[M].北京:科学出版社,1987..
  • 10徐志刚,庄大方,杨琳.区域人类活动强度定量模型的建立与应用[J].地球信息科学,2009,11(4):452-460. 被引量:37

共引文献79

同被引文献23

引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部