期刊文献+

基于多主体博弈和强化学习的多微网系统协同优化研究 被引量:10

Collaborative Optimization of Multi-microgrid System Based on Multi-agent Game and Reinforcement Learning
下载PDF
导出
摘要 面向分属于不同投资和运营主体的多个冷热电联供型微网构成的多微网系统,该文提出了一种基于多主体博弈的多微网系统协同优化方法,通过博弈论建立多微网系统的协同优化模型,实现各个微网的利益均衡。针对该模型Nash均衡求解困难的问题,提出了一种改进的Nash-Q学习算法。该算法采用深度神经网络来拟合Nash-Q学习算法中的价值函数,不仅有效解决了Nash-Q学习算法直接应用于复杂环境时遇到的维数灾难问题,并且保证了算法的后效性,能快速完成合理有效的在线优化。实验结果表明,相较于传统数学规划方法和贪婪算法,改进的Nash-Q学习算法能够学习到Nash均衡策略,实现各微网间电能互补,降低各微网的运行成本,验证了所提模型和算法的有效性。 Aiming at the multi-microgrid system composed of multiple microgrids with combined cooling,heating and power,which belong to different investment and operating entities,this paper proposes a collaborative optimization for multi-microgrid systems based on the multi-agent game.By using the game theory,a collaborative optimization model for multi-microgrid system is established to achieve the balance of the interests of all the microgrids.For the difficulty of solving the Nash equilibrium of the model,an improved Nash-Q learning algorithm is proposed.In this algorithm the deep neural network is used to fit the value function in the Nash-Q learning algorithm.Then the dimension disaster problem is solved when Nash-Q learning algorithm is directly applied to complex environment,and also the aftereffect of the algorithm is ensured,which quickly completes the reasonable and effective online optimization.The experimental results show that compared with the traditional mathematical programming and the greedy algorithm,the improved Nash-Q learning algorithm learns well the Nash equilibrium strategy,realizes the power complementarity among microgrids,reduces the operation cost of microgrids,which verifies the effectiveness of the proposed model and the algorithm.
作者 刘俊峰 王晓生 卢俊菠 曾君 LIU Junfeng;WANG Xiaosheng;LU Junbo;ZENG Jun(School of Automation Science and Engineering,South China University of Technology,Guangzhou 510640,Guangdong Province,China;School of Electric Power,South China University of Technology,Guangzhou 510640,Guangdong Province,China)
出处 《电网技术》 EI CSCD 北大核心 2022年第7期2722-2732,共11页 Power System Technology
基金 国家自然科学基金项目(62173148,51877085) 广州市科技计划科学研究专项(No.202002030373)。
关键词 多微网系统 协同优化 博弈论 强化学习 Nash-Q multi-microgrid system collaborative optimization game theory reinforcement learning Nash-Q
  • 相关文献

参考文献18

二级参考文献285

共引文献822

同被引文献131

引证文献10

二级引证文献20

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部