基于深度强化学习的逆变器多频点控制参数优化

Optimization of Inverter Multi-Frequency Control Parameters Based on Deep Reinforcement Learning

下载PDF

导出

摘要传统逆变器闭环控制具有良好的静态和动态性能,但非常依赖精确的系统数学模型,难以适应逆变器接入不同负载或电网环境等原因带来的模型参数扰动。将该深度强化学习应用于逆变器多频点控制参数整定过程。文章首先建立了单相电压型逆变器的控制模型;在此基础上分析了逆变器参数调节过程中的不稳定特征,并设计了一种基于FFT的不稳定特征判断方法,实现逆变器参数调节过程中稳定状态的在线监测;其次对逆变器控制参数自适应过程进行马尔可夫过程建模,设计了智能体的状态、动作和奖励函数;针对智能体样本不平衡问题引入了经验优先级回放以及动作屏蔽机制提高智能体的学习效率;经过仿真学习训练,智能体实现比例谐振控制器参数自整定以获取最佳的多频点跟踪性能;最后在搭建的实验平台上进行实验,结果表明:训练后的智能体可以在较少次数训练后获得满足各频点控制精度要求的控制参数,同时整个训练过程系统都是稳定的。 Traditional inverter closed-loop control has good static and dynamic performance,but it is heavily dependent on the precise mathematical model of the system,and it is difficult to adapt to the model parameter disturbance caused by the inverter connected to different loads or power grid environment.In this paper,deep reinforcement learning is applied to the multi-frequency control parameter setting process of the inverter.Firstly,a method of approximate calculation of resonance coefficient based on steady-state error index is proposed.Based on this method,the resonance coefficient values under different frequency points and different load requirements are analyzed,which has guiding significance for the parameter design of resonance coefficient.Secondly,the stability characteristic judgment method based on FFT is designed to realize the on-line monitoring of the output stability state of the inverter.On this basis,Markov process modeling is carried out for the adaptive process of inverter control parameters,and the state,action and reward functions of the agent are designed.To address unbalanced agent samples,the experience priority playback and action shielding mechanism are introduced to improve the learning efficiency of the agent.After simulation learning training,the agent realizes the parameter self-tuning of the proportional resonance controller to obtain the best multifrequency tracking performance.Finally,experiments are carried out on the experimental platform,and the results show that the trained agent can obtain control parameters that meet the control accuracy requirements of each frequency after less training,and the whole training process system is stable.

作者覃日升况华姜訸于辉李虹万明凯殷一林雷万钧 QIN Risheng;KUANG Hua;JIANG He;YU Hui;LI Hong;WAN Mingkai;YIN Yilin;LEI Wanjun(Electric Power Science Research Institute of Yunnan Power Grid Co.,Ltd.,Kunming 650214,Yunnan,China;Yunnan Power Grid Co.,Ltd.,Kunming 650011,Yunnan,China;Honghe Power Supply Bureau of Yunnan Power Grid Co.,Ltd.,Honghe 661199,Yunnan,China;Dali Power Supply Bureau of Yunnan Power Grid Co.,Ltd.,Dali 672699,Yunnan,China;School of Electrical Engineering,Xi’an Jiaotong University,Xi’an 710049,Shaanxi,China)

机构地区云南电网有限责任公司电力科学研究院云南电网有限公司云南电网有限责任公司红河供电局云南电网有限责任公司大理永平供电局西安交通大学电气工程学院

出处《电网与清洁能源》 CSCD 北大核心 2024年第7期124-132,共9页 Power System and Clean Energy

基金国家重点研发计划项目(2018YFB0905800)。

关键词逆变器比例谐振控制深度强化学习参数优化 inverter proportional resonance control deep reinforcement learning parameter optimization

分类号 TM464 [电气工程—电器]

引文网络
相关文献

1牛春豪,徐永海,范兴管.电网不平衡下级联型电力电子变压器的网侧谐波抑制方法研究[J].电测与仪表,2024,61(9):40-47.
2郭格格,孙红灵,孙陆阳,曹晟楠,杨军.次级源近场和管壁弹性对充液管路有源消声性能的影响[J].声学学报,2022,47(6):727-738. 被引量：2
3曹伟东,柯鸿飞,薛乔溦,郑峰,林玲.高弹性电网环境下配电网智能化规划的实践研究[J].电力系统装备,2024(9):10-12.
4黄增国.智能电网环境下光伏发电系统的优化运行策略[J].通信电源技术,2024,41(19):215-217.
5王金玉,王思航.基于准比例谐振控制的并网逆变器研究[J].自动化与仪器仪表,2024(9):87-91.
6曲道鹏,张涛,华晨曦,宋欣雨,程昌利,刘禹,王震宇.高强电磁屏蔽环氧复合材料的3D打印工艺研究[J].中国塑料,2024,38(9):24-29.
7卢聪,罗扬,郭建春,曾凡辉.融合物理约束的压裂水平井产能智能预测框架构建与应用[J].天然气工业,2024,44(9):99-107.
8陈辉,王荆宇,张文旭,赵永红,席磊.基于蒙特卡罗策略梯度的雷达观测器轨迹规划[J].兰州理工大学学报,2024,50(5):77-85.
9王俊芳.智能电网环境下新能源配电网调度优化系统设计[J].通信电源技术,2024,41(20):35-37.
10李世龙,李龙江,刘欣博,张华.一种增强稳定性的储能变流器线性自抗扰控制参数整定方法[J].中国电力,2024,57(10):25-35.

电网与清洁能源

2024年第7期

浏览历史

内容加载中请稍等...

基于深度强化学习的逆变器多频点控制参数优化

相关作者

相关机构

相关主题

浏览历史