Optimized Consensus for Blockchain in Internet of Things Networks via Reinforcement Learning

导出

摘要 Most blockchain systems currently adopt resource-consuming protocols to achieve consensus between miners;for example,the Proof-of-Work(PoW)and Practical Byzantine Fault Tolerant(PBFT)schemes,which have a high consumption of computing/communication resources and usually require reliable communications with bounded delay.However,these protocols may be unsuitable for Internet of Things(IoT)networks because the IoT devices are usually lightweight,battery-operated,and deployed in an unreliable wireless environment.Therefore,this paper studies an efficient consensus protocol for blockchain in IoT networks via reinforcement learning.Specifically,the consensus protocol in this work is designed on the basis of the Proof-of-Communication(PoC)scheme directly in a single-hop wireless network with unreliable communications.A distributed MultiAgent Reinforcement Learning(MARL)algorithm is proposed to improve the efficiency and fairness of consensus for miners in the blockchain system.In this algorithm,each agent uses a matrix to depict the efficiency and fairness of the recent consensus and tunes its actions and rewards carefully in an actor-critic framework to seek effective performance.Empirical results from the simulation show that the fairness of consensus in the proposed algorithm is guaranteed,and the efficiency nearly reaches a centralized optimal solution.

作者 Yifei Zou Zongjing Jin Yanwei Zheng Dongxiao Yu Tian Lan

机构地区 Institute of Intelligent Computing Department of Electrical and Computer Engineering

出处《Tsinghua Science and Technology》 SCIE EI CAS CSCD 2023年第6期1009-1022,共14页 清华大学学报（自然科学版（英文版）

基金 This work was partially supported by the National Key Research and Development Program of China(No.2020YFB1005900) the National Natural Science Foundation of China(Nos.62102232,62122042,and 61971269) the Natural Science Foundation of Shandong Province(No.ZR2021QF064).

关键词 consensus in blockchain Proof-of-Communication(PoC) MultiAgent Reinforcement Learning(MARL) Internet of Things(IoT)networks

分类号 TP183 [自动化与计算机技术—控制理论与控制工程] TP391.41 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献4

1Weiping Wang,Zhaorong Wang,Zhanfan Zhou,Haixia Deng,Weiliang Zhao,Chunyang Wang,Yongzhen Guo.Anomaly Detection of Industrial Control Systems Based on Transfer Learning[J].Tsinghua Science and Technology,2021,26(6):821-832. 被引量：13
2Ziarmal Nazar Mohammad,Fadi Farha,Adnan O.M Abuassba,Shunkun Yang,Fang Zhou.Access Control and Authorization in Smart Homes: A Survey[J].Tsinghua Science and Technology,2021,26(6):906-917. 被引量：2
3Xiaolong Xu,Haoyuan Li,Weijie Xu,Zhongjian Liu,Liang Yao,Fei Dai.Artificial Intelligence for Edge Service Optimization in Internet of Vehicles:A Survey[J].Tsinghua Science and Technology,2022,27(2):270-287. 被引量：10
4Li Yang,Yifei Zou,Minghui Xu,Yicheng Xu,Dongxiao Yu,Xiuzhen Cheng.Distributed Consensus for Blockchains in Internet-of-Things Networks[J].Tsinghua Science and Technology,2022,27(5):817-831. 被引量：3

二级参考文献23

1Zhiyao Hu,Dongsheng Li,Deke Guo.Balance Resource Allocation for Spark Jobs Based on Prediction of the Optimal Resource[J].Tsinghua Science and Technology,2020,25(4):487-497. 被引量：6
2Zhao Tong,Feng Ye,Ming Yan,Hong Liu,Sunitha Basodi.A Survey on Algorithms for Intelligent Computing and Smart City Applications[J].Big Data Mining and Analytics,2021,4(3):155-172. 被引量：3
3Youssef Nait Malek,Mehdi Najib,Mohamed Bakhouya,Mohammed Essaaidi.Multivariate Deep Learning Approach for Electric Vehicle Speed Forecasting[J].Big Data Mining and Analytics,2021,4(1):56-64. 被引量：6
4Mourade Azrour,Jamal Mabrouki,Azedine Guezzaz,Yousef Farhaoui.New Enhanced Authentication Protocol for Internet of Things[J].Big Data Mining and Analytics,2021,4(1):1-9. 被引量：6
5Yadong Huang,Yueting Chai,Yi Liu,Jianping Shen.Architecture of Next-Generation E-Commerce Platform[J].Tsinghua Science and Technology,2019,24(1):18-29. 被引量：3
6张思聪,谢晓尧,徐洋.基于dCNN的入侵检测方法[J].清华大学学报（自然科学版）,2019,59(1):44-52. 被引量：22
7YANGFangchun WANG Shangguang LI Jinglin LIU Zhihan SUN Qibo.An Overview of Internet of Vehicles[J].China Communications,2014,11(10):1-15. 被引量：52
8刘万军,秦济韬,曲海成.基于改进单类支持向量机的工业控制网络入侵检测方法[J].计算机应用,2018,38(5):1360-1365. 被引量：18
9Yan Cao,Zhiqiu Huang,Shuanglong Kan,Dajuan Fan,Yang Yang.Specification and Verification of a Topology-Aware Access Control Model for Cyber-Physical Space[J].Tsinghua Science and Technology,2019,24(5):497-519. 被引量：4
10Jinhui Liu,Yong Yu,Jianwei Jia,Shijia Wang,Peiru Fan,Houzhen Wang,Huanguo Zhang.Lattice-Based Double-Authentication-Preventing Ring Signature for Security and Privacy in Vehicular Ad-Hoc Networks[J].Tsinghua Science and Technology,2019,24(5):575-584. 被引量：10

共引文献24

1Peng Zhi,Rui Zhao,Haoran Zhou,Yanwu Zhou,Nam Ling,Qingguo Zhou.Analysis on the development status of intelligent and connected vehicle test site[J].Intelligent and Converged Networks,2021,2(4):320-333. 被引量：2
2郝志强,刘冬,王冲华.工业领域网络流量安全分析关键技术研究[J].工业信息安全,2022(3):27-35. 被引量：4
3程孟菲,高淑萍.基于深度迁移学习的多尺度股票预测[J].计算机工程与应用,2022,58(12):249-259. 被引量：1
4Mohamed H.Mousa,Mohamed K.Hussein.Effcient UAV-Based MEC Using GPU-Based PSO and Voronoi Diagrams[J].Computer Modeling in Engineering & Sciences,2022(11):413-434. 被引量：2
5邓雨康,张磊,李晶.车联网隐私保护研究综述[J].计算机应用研究,2022,39(10):2891-2906. 被引量：12
6Abednego Acheampong,Yiwen Zhang,Xiaolong Xu,Daniel Appiah Kumah.A Review of the Current Task Offloading Algorithms,Strategies and Approach in Edge Computing Systems[J].Computer Modeling in Engineering & Sciences,2023(1):35-88.
7刘辉,张磊,李晶.基于车联网的隐私保护数据聚合研究综述[J].计算机应用研究,2022,39(12):3546-3554. 被引量：4
8余冰雁,雷凯茹,毛琦祺.车联网多级平台体系架构与关键技术[J].移动通信,2022,46(11):8-13. 被引量：4
9付钰,段雪源,王坤,徐浩.基于深度学习的系统异常检测综述[J].海军工程大学学报,2022,34(5):45-53. 被引量：1
10杨庆新,张献,章鹏程.电动车智慧无线电能传输云网[J].电工技术学报,2023,38(1):1-12. 被引量：4

1李斌.基于多智能体强化学习的多无人机边缘计算任务卸载[J].无线电工程,2023,53(12):2731-2740.
2夏家伟,刘志坤,朱旭芳,刘忠.基于多智能体强化学习的无人艇集群集结方法[J].北京航空航天大学学报,2023,49(12):3365-3376. 被引量：1
3Intelligent Internet of Things with Reliable Communication and Collaboration Technologies[J].China Communications,2023,20(12).
4Mila Ilieva-Obretenova,Radi Pipev.IoT for Streetlighting-Requirements for Modelling of Management Services-Part I[J].Management Studies,2023,11(5):245-252.
5Qiang Wu,Jianqing Wu,Bojian Kang,Bo Du,Jun Shen,Adriana Simona Mihăiţă.An integrated and cooperative architecture for multi-intersection traffic signal control[J].Digital Transportation and Safety,2023,2(2):150-163.
6Guangwu Yang,Long Yang,Han Zhao,Haoxu Ding,Bing Yang,Shoune Xiao.Method for Evaluating Bolt Competitive Failure Life Under Composite Excitation[J].Chinese Journal of Mechanical Engineering,2023,36(4):372-384.
7钱立军,宣亮,陈健,陈晨.基于SAC算法的多交叉口交通信号控制研究[J].天津大学学报（自然科学与工程技术版）,2024,57(1):105-111.
8罗睿卿,曾坤,张欣景.稀疏异质多智能体环境下基于强化学习的课程学习框架[J].计算机科学,2024,51(1):301-309.
9Jiawen Xu,Rong Zhang,Jie Ma,Hanting Zhao,Lianlin Li.In-situ manipulation of wireless link with reinforcement-learningdriven programmable metasurface in indoor environment[J].Journal of Information and Intelligence,2023,1(3):217-227.
10Fan Yuan,Luhong Diao,Donglei Du,Lei Liu.Fair k-Center Problem with Outliers on Massive Data[J].Tsinghua Science and Technology,2023,28(6):1072-1084.

Tsinghua Science and Technology

2023年第6期

浏览历史

内容加载中请稍等...

Optimized Consensus for Blockchain in Internet of Things Networks via Reinforcement Learning

参考文献4

二级参考文献23

共引文献24

相关作者

相关机构

相关主题

浏览历史