To solve the optimal power flow(OPF)problem,reinforcement learning(RL)emerges as a promising new approach.However,the RL-OPF literature is strongly divided regarding the exact formulation of the OPF problem as an RL e...To solve the optimal power flow(OPF)problem,reinforcement learning(RL)emerges as a promising new approach.However,the RL-OPF literature is strongly divided regarding the exact formulation of the OPF problem as an RL environment.In this work,we collect and implement diverse environment design decisions from the literature regarding training data,observation space,episode definition,and reward function choice.In an experimental analysis,we show the significant impact of these environment design options on RL-OPF training performance.Further,we derive some first recommendations regarding the choice of these design decisions.The created environment framework is fully open-source and can serve as a benchmark for future research in the RL-OPF field.展开更多
以统一潮流控制器(unified power flow controller,UPFC)为代表的灵活交流输电技术(flexible AC transmission system,FACTS)可实现传输功率的合理分布、优化系统资源,提高系统的稳定性和可靠性。该文基于内点优化方法,提出计及UPFC的...以统一潮流控制器(unified power flow controller,UPFC)为代表的灵活交流输电技术(flexible AC transmission system,FACTS)可实现传输功率的合理分布、优化系统资源,提高系统的稳定性和可靠性。该文基于内点优化方法,提出计及UPFC的无功优化模型,以系统有功网损最小为目标函数,采用UPFC电压源模型,将其作用等效为一系列电压和功率的约束,直接放到内点法的约束中,在不同的负荷运行方式下进行优化分析。在IEEE-30节点系统测试中发现,引入UPFC后系数矩阵的维数会有所增加,但不会影响其收敛性。算例就系统网损和电压指标对装设UPFC前后进行比较,并给出最优控制方案下UPFC的参数值。结果表明该方法是可行的、有效的,取得很好的效果。展开更多
文摘To solve the optimal power flow(OPF)problem,reinforcement learning(RL)emerges as a promising new approach.However,the RL-OPF literature is strongly divided regarding the exact formulation of the OPF problem as an RL environment.In this work,we collect and implement diverse environment design decisions from the literature regarding training data,observation space,episode definition,and reward function choice.In an experimental analysis,we show the significant impact of these environment design options on RL-OPF training performance.Further,we derive some first recommendations regarding the choice of these design decisions.The created environment framework is fully open-source and can serve as a benchmark for future research in the RL-OPF field.
文摘以统一潮流控制器(unified power flow controller,UPFC)为代表的灵活交流输电技术(flexible AC transmission system,FACTS)可实现传输功率的合理分布、优化系统资源,提高系统的稳定性和可靠性。该文基于内点优化方法,提出计及UPFC的无功优化模型,以系统有功网损最小为目标函数,采用UPFC电压源模型,将其作用等效为一系列电压和功率的约束,直接放到内点法的约束中,在不同的负荷运行方式下进行优化分析。在IEEE-30节点系统测试中发现,引入UPFC后系数矩阵的维数会有所增加,但不会影响其收敛性。算例就系统网损和电压指标对装设UPFC前后进行比较,并给出最优控制方案下UPFC的参数值。结果表明该方法是可行的、有效的,取得很好的效果。