期刊文献+

Optimized Consensus for Blockchain in Internet of Things Networks via Reinforcement Learning

原文传递
导出
摘要 Most blockchain systems currently adopt resource-consuming protocols to achieve consensus between miners;for example,the Proof-of-Work(PoW)and Practical Byzantine Fault Tolerant(PBFT)schemes,which have a high consumption of computing/communication resources and usually require reliable communications with bounded delay.However,these protocols may be unsuitable for Internet of Things(IoT)networks because the IoT devices are usually lightweight,battery-operated,and deployed in an unreliable wireless environment.Therefore,this paper studies an efficient consensus protocol for blockchain in IoT networks via reinforcement learning.Specifically,the consensus protocol in this work is designed on the basis of the Proof-of-Communication(PoC)scheme directly in a single-hop wireless network with unreliable communications.A distributed MultiAgent Reinforcement Learning(MARL)algorithm is proposed to improve the efficiency and fairness of consensus for miners in the blockchain system.In this algorithm,each agent uses a matrix to depict the efficiency and fairness of the recent consensus and tunes its actions and rewards carefully in an actor-critic framework to seek effective performance.Empirical results from the simulation show that the fairness of consensus in the proposed algorithm is guaranteed,and the efficiency nearly reaches a centralized optimal solution.
出处 《Tsinghua Science and Technology》 SCIE EI CAS CSCD 2023年第6期1009-1022,共14页 清华大学学报(自然科学版(英文版)
基金 This work was partially supported by the National Key Research and Development Program of China(No.2020YFB1005900) the National Natural Science Foundation of China(Nos.62102232,62122042,and 61971269) the Natural Science Foundation of Shandong Province(No.ZR2021QF064).
  • 相关文献

参考文献4

二级参考文献23

共引文献24

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部