Improved Proximal Policy Optimization Algorithm for Sequential Security-constrained Optimal Power Flow Based on Expert Knowledge and Safety Layer

导出

摘要 In recent years,reinforcement learning(RL)has emerged as a solution for model-free dynamic programming problem that cannot be effectively solved by traditional optimization methods.It has gradually been applied in the fields such as economic dispatch of power systems due to its strong selflearning and self-optimizing capabilities.However,existing economic scheduling methods based on RL ignore security risks that the agent may bring during exploration,which poses a risk of issuing instructions that threaten the safe operation of power system.Therefore,we propose an improved proximal policy optimization algorithm for sequential security-constrained optimal power flow(SCOPF)based on expert knowledge and safety layer to determine active power dispatch strategy,voltage optimization scheme of the units,and charging/discharging dispatch of energy storage systems.The expert experience is introduced to improve the ability to enforce constraints such as power balance in training process while guiding agent to effectively improve the utilization rate of renewable energy.Additionally,to avoid line overload,we add a safety layer at the end of the policy network by introducing transmission constraints to avoid dangerous actions and tackle sequential SCOPF problem.Simulation results on an improved IEEE 118-bus system verify the effectiveness of the proposed algorithm.

作者 Yanbo Chen Qintao Du Honghai Liu Liangcheng Cheng Muhammad Shahzad Younis

机构地区 State Key Laboratory of Alternate Electrical Power System with Renewable Energy Sources and School of Electrical&Electronic Engineering School of Engineering China Electric Power Research Institute National University of Sciences and Technology

出处《Journal of Modern Power Systems and Clean Energy》 SCIE EI CSCD 2024年第3期742-753,共12页 现代电力系统与清洁能源学报（英文）

基金 supported in part by National Natural Science Foundation of China(No.52077076) in part by the National Key R&D Plan(No.2021YFB2601502)。

关键词 Sequential security-constrained optimal power flow(SCOPF) expert experience safety layer renewable energy safe reinforcement learning

分类号 TM73 [电气工程—电力系统及自动化] TP18 [自动化与计算机技术—控制理论与控制工程]

引文网络
相关文献

参考文献7

1Hao Liu,Fengwei Liang,Tianyu Hu,Jichao Hong,Huimin Ma.Multi-Scale Fusion Model Based on Gated Recurrent Unit for Enhancing Prediction Accuracy of State-of-Charge in Battery Energy Storage Systems[J].Journal of Modern Power Systems and Clean Energy,2024,12(2):405-414. 被引量：1
2Yanbo Chen,Hao Chen,Yang Jiao,Jin Ma,Yuzhang Lin.Data-driven Robust State Estimation Through Off-line Learning and On-line Matching[J].Journal of Modern Power Systems and Clean Energy,2021,9(4):897-909. 被引量：8
3徐正清,肖艳炜,李群山,孙淑琴,颜文丽,吴晨悦.基于灵敏度及粒子群算法的输电断面功率越限控制方法对比研究[J].电力系统保护与控制,2020,48(15):177-186. 被引量：24
4Yanbo Chen,Chao Wu,Junjian Qi.Data-driven Power Flow Method Based on Exact Linear Regression Equations[J].Journal of Modern Power Systems and Clean Energy,2022,10(3):800-804. 被引量：5
5肖朝霞,贾双,朱建国,樊世军.风光储微电网并网联络线功率控制策略[J].电工技术学报,2017,32(15):169-179. 被引量：22
6王涛,刘雨濛,顾雪平,秦晓辉.基于连锁故障时空图的电网脆弱线路辨识[J].中国电机工程学报,2019,39(20):5962-5972. 被引量：21
7杨志学,任洲洋,孙志媛,刘默斯,姜晶,印月.基于近端策略优化算法的新能源电力系统安全约束经济调度方法[J].电网技术,2023,47(3):988-997. 被引量：9

二级参考文献83

1李响,郭志忠.基于N-1静态安全约束的输电断面有功潮流控制[J].中国电力,2005,38(3):26-28. 被引量：7
2高翔,张沛超.电网故障信息系统应用技术[J].电力自动化设备,2005,25(4):11-15. 被引量：19
3孙元章,程林,刘海涛.基于实时运行状态的电力系统运行可靠性评估[J].电网技术,2005,29(15):6-12. 被引量：110
4丁明,韩平平.基于小世界拓扑模型的大型电网脆弱性评估算法[J].电力系统自动化,2006,30(8):7-10. 被引量：120
5周德才,张保会,姚峰,王立永,邹本国,赵义术.基于图论的输电断面快速搜索[J].中国电机工程学报,2006,26(12):32-38. 被引量：124
6余晓丹,贾宏杰,陈建华.电力系统连锁故障预测初探[J].电网技术,2006,30(13):20-25. 被引量：39
7郭剑波.我国电力科技现状与发展趋势[J].电网技术,2006,30(18):1-7. 被引量：43
8曹一家,陈晓刚,孙可.基于复杂网络理论的大型电力系统脆弱线路辨识[J].电力自动化设备,2006,26(12):1-5. 被引量：219
9李晓佳,张鹏,狄增如,樊瑛.复杂网络中的社团结构[J].复杂系统与复杂性科学,2008,5(3):19-42. 被引量：80
10王安斯,罗毅,涂光瑜,刘沛,苏丹.用于预防控制的电力系统连锁故障事故链在线生成方法[J].高电压技术,2009,35(10):2446-2451. 被引量：15

共引文献82

1杨建林,黄一超,费斐,郭明星,庞爱莉.不同商业运营模式下储能技术经济效益分析研究[J].电气技术,2018,19(3):80-84. 被引量：22
2张有兵,王嘉瑶,杨晓东,杜夏冰,徐志成,赵波.计及电转气技术的区域综合能源系统在线优化方法[J].电网技术,2018,42(8):2467-2476. 被引量：17
3赵思锋,唐英伟,王大杰.基于GTR飞轮储能的微电网电能质量调节研究[J].电力电容器与无功补偿,2018,39(3):156-161. 被引量：7
4张继红,王洪明,魏毅立,吴振奎,杨培宏.含复合储能和燃气轮发电机的直流微电网母线电压波动分层控制策略[J].电工技术学报,2018,33(6):1238-1246. 被引量：18
5张继元,舒杰,宁佳,王浩.考虑SOC自均衡的光储独立微电网协调控制[J].电工技术学报,2018,33(A02):527-537. 被引量：24
6张炀,马伟哲,程韧俐,许琴.基于DIgSILENT的风光储微电网系统对电网安全稳定影响分析[J].南方能源建设,2018,5(A01):1-6. 被引量：1
7李春兰,任鹏,王长云,王晓暄,石砦,杜松怀.微电网中蓄电池充放电非线性控制策略研究[J].农业工程学报,2020,36(8):156-164. 被引量：8
8漆淘懿,惠红勋,徐立中,马翔,丁一.基于GridLAB-D的微电网广义需求响应建模与控制[J].供用电,2020,37(7):3-10. 被引量：8
9赵晓龙,方恒福,王罡,杨红磊,孙辰军,王越.面向弹性配电网防灾减灾的组件重要度评估方法[J].电力系统保护与控制,2020,48(16):28-36. 被引量：14
10刘志坚,刘瑞光,梁宁,刘晓欣.含电转气的微型能源网日前经济优化调度策略[J].电工技术学报,2020,35(S02):535-543. 被引量：13

1Qingbo Zhang,Manlu Liu,Heng Wang,Weimin Qian,Xinglang Zhang.Off-policy correction algorithm for double Q network based on deep reinforcement learning[J].IET Cyber-Systems and Robotics,2023,5(4):16-26.
2Li Zheng,Wenjie Bi,Zhao Jin,Shantang Liu.Synthesis of hierarchical shell-core SnO2 microspheres and their gas sensing properties[J].Chinese Chemical Letters,2020,31(8):2083-2086.
3Sungmin Cho,Jong Chan Hyun,Son Ha,Yeonhua Choi,Honggyu Seong,Jaewon Choi,Hyoung-Joon Jin,Young Soo Yun.Sulfur-doped hard carbon hybrid anodes with dual lithium-ion/metal storage bifunctionality for high-energy-density lithium-ion batteries[J].Carbon Energy,2023,5(1):71-81.
4Fan Feng,Zeping Ou,Fangdou Zhang,Jinxing Chen,Jiankun Huang,Jingxiang Wang,Haiqiang Zuo,Jingbin Zeng.Artificial intelligence-assisted colorimetry for urine glucose detection towards enhanced sensitivity,accuracy,resolution,and anti-illuminating capability[J].Nano Research,2023,16(10):12084-12091. 被引量：1
5Guanfu Wang,Yudie Sun,Jinling Li,Yu Jiang,Chunhui Li,Huanan Yu,He Wang,Shiqiang Li.Dynamic Economic Scheduling with Self-Adaptive Uncertainty in Distribution Network Based on Deep Reinforcement Learning[J].Energy Engineering,2024,121(6):1671-1695.
6Yujia Chen,Wei Pei,Hao Xiao,Tengfei Ma.Incentive-compatible and budget balanced AGV mechanism for peer-to-peer energy trading in smart grids[J].Global Energy Interconnection,2023,6(1):26-35. 被引量：1
7Ziming Ma,Haiwang Zhong,Qing Xia,Chongqing Kang,Qiang Wang,Xin Cao.An Efficient Method for Identifying the Inactive Transmission Constraints in a Network-Constrained Unit Commitment[J].CSEE Journal of Power and Energy Systems,2023,9(6):2366-2373.
8Xiaohong Tan,Jiawei Liu,Jiating Huang,Yilin Li,Akif Zeb,Xiaoming Lin.Interfacial Engineering of Defect-Rich and Multi-Heteroatom-Doped Metal-Organic Framework-Derived Manganese Fluoride Anodes to Boost Lithium Storage[J].Energy & Environmental Materials,2023,6(5):449-459. 被引量：1
9Wei Dai,Cheng Wang,Hui Hwang Goh,Jingyi Zhao,Jiangyi Jian.Hosting Capacity Evaluation Method for Power Distribution Networks Integrated with Electric Vehicles[J].Journal of Modern Power Systems and Clean Energy,2023,11(5):1564-1575. 被引量：1
10V.V.S.N.Murty,Ashwani Kumar.Multi-objective energy management in microgrids with hybrid energy sources and battery energy storage systems[J].Protection and Control of Modern Power Systems,2020,5(1):1-20. 被引量：45

Journal of Modern Power Systems and Clean Energy

2024年第3期

浏览历史

内容加载中请稍等...

Improved Proximal Policy Optimization Algorithm for Sequential Security-constrained Optimal Power Flow Based on Expert Knowledge and Safety Layer

参考文献7

二级参考文献83

共引文献82

相关作者

相关机构

相关主题

浏览历史