Transfer learning is an effective method to predict the energy consumption of information-poor buildings by learning transferable knowledge from operational data of information-rich buildings.However,it is not recomme...Transfer learning is an effective method to predict the energy consumption of information-poor buildings by learning transferable knowledge from operational data of information-rich buildings.However,it is not recommended to directly use the operational data without protection due to the risk of leaking occupants’privacy.To address this problem,this study proposes a federated learning-based method to learn transferable knowledge from building operational data without privacy leaking.It trains a transferable federated model based on the operational data from the buildings similar to the target building with limited data.An advanced secure aggregation algorithm is adopted in the training process to ensure that no one can infer private information from the training data.The federated model obtained is evaluated by comparing it with the standalone model without federated learning based on 13 similar office buildings from the Building Data Genome Project.The results show that the federated model outperforms the standalone model concerning the prediction accuracy and training time.On average,the federated model achieves a 25.4%decrease in CV-RMSE when the target building has limited operational data.Even if the target building has no operational data,the federated model still achieves acceptable accuracy(CV-RMSE is 22.2%).Meanwhile,the training time of the federated model is 90%less than that of the standalone model.The research insights can help develop federated learning-based methods for solving the data silos problem in building energy management.The methodology and analysis procedures are reproducible and all codes and data sets are available on Github.展开更多
Federated learning is a new type of distributed learning framework that allows multiple participants to share training results without revealing their data privacy.As data privacy becomes more important,it becomes dif...Federated learning is a new type of distributed learning framework that allows multiple participants to share training results without revealing their data privacy.As data privacy becomes more important,it becomes difficult to collect data from multiple data owners to make machine learning predictions due to the lack of data security.Data is forced to be stored independently between companies,creating“data silos”.With the goal of safeguarding data privacy and security,the federated learning framework greatly expands the amount of training data,effectively improving the shortcomings of traditional machine learning and deep learning,and bringing AI algorithms closer to our reality.In the context of the current international data security issues,federated learning is developing rapidly and has gradually moved from the theoretical to the applied level.The paper first introduces the federated learning framework,analyzes its advantages,reviews the results of federated learning applications in industries such as communication and healthcare,then analyzes the pitfalls of federated learning and discusses the security issues that should be considered in applications,and finally looks into the future of federated learning and the application layer.展开更多
基金supported by the National Key Research and Development Program of China(No.2018YFE0116300)the National Natural Science Foundation of China(No.51978601).
文摘Transfer learning is an effective method to predict the energy consumption of information-poor buildings by learning transferable knowledge from operational data of information-rich buildings.However,it is not recommended to directly use the operational data without protection due to the risk of leaking occupants’privacy.To address this problem,this study proposes a federated learning-based method to learn transferable knowledge from building operational data without privacy leaking.It trains a transferable federated model based on the operational data from the buildings similar to the target building with limited data.An advanced secure aggregation algorithm is adopted in the training process to ensure that no one can infer private information from the training data.The federated model obtained is evaluated by comparing it with the standalone model without federated learning based on 13 similar office buildings from the Building Data Genome Project.The results show that the federated model outperforms the standalone model concerning the prediction accuracy and training time.On average,the federated model achieves a 25.4%decrease in CV-RMSE when the target building has limited operational data.Even if the target building has no operational data,the federated model still achieves acceptable accuracy(CV-RMSE is 22.2%).Meanwhile,the training time of the federated model is 90%less than that of the standalone model.The research insights can help develop federated learning-based methods for solving the data silos problem in building energy management.The methodology and analysis procedures are reproducible and all codes and data sets are available on Github.
文摘传统联邦学习训练模型时假定所有参与方可信,但实际场景存在恶意参与方或恶意攻击模型,现有的联邦学习算法面对投毒攻击时,存在模型性能严重下降的问题。针对模型投毒问题,本文提出一种基于联邦平均(federated averaging,Fedavg)与异常检测的联邦检测算法——FedavgCof,该算法考虑到所有参与方之间的差异对比,在中心服务器和本地模型之间添加异常检测层,通过基于聚类的本地异常检测因子(cluster-based local outlier factor,COF)异常检测算法剔除影响模型性能的异常参数,提升模型鲁棒性。实验结果表明,虽然新型投毒方式攻击性更强,但是FedavgCof能够有效防御投毒攻击,降低模型性能损失,提高模型抗投毒攻击能力,相较于Median和模型清洗算法平均提升精度达到10%以上,大幅提升了模型的安全性。
基金supported by National Natural Science Foundation of China (NO.51974131)Hebei Province Natural Science Fund for Distinguished Young Scholars (NO.E2020209082).
文摘Federated learning is a new type of distributed learning framework that allows multiple participants to share training results without revealing their data privacy.As data privacy becomes more important,it becomes difficult to collect data from multiple data owners to make machine learning predictions due to the lack of data security.Data is forced to be stored independently between companies,creating“data silos”.With the goal of safeguarding data privacy and security,the federated learning framework greatly expands the amount of training data,effectively improving the shortcomings of traditional machine learning and deep learning,and bringing AI algorithms closer to our reality.In the context of the current international data security issues,federated learning is developing rapidly and has gradually moved from the theoretical to the applied level.The paper first introduces the federated learning framework,analyzes its advantages,reviews the results of federated learning applications in industries such as communication and healthcare,then analyzes the pitfalls of federated learning and discusses the security issues that should be considered in applications,and finally looks into the future of federated learning and the application layer.