计及稳压率和经济性的城轨直流牵引供电光储系统深度Q网络优化控制方法

Deep Q network optimization control method for DC traction power supply photovoltaic-energy storage system of urban rail considering voltage stabilization rate and economy

下载PDF

导出

摘要光伏应用于直流牵引供电系统可提高新能源渗透率、降低系统能耗,但可再生能源出力的不确定性及列车负荷的强波动性增加了控制策略的寻优难度。针对该问题,提出一种基于深度强化学习的控制策略优化方法。该方法基于深度Q网络,将源-储-荷能量管理系统作为智能代理,通过光伏出力、储能荷电状态、牵引网压等外部状态训练代理,得到可实现系统经济可靠运行的优化策略。介绍源-储-荷综合系统的框架结构及传统控制策略,并对各设备进行外特性建模;对源-储-荷综合系统的能量管理问题开展马尔可夫决策过程建模,确立强化学习框架;根据某市域线路数据在MATLAB平台上进行仿真以验证所提方法的有效性。研究结果表明,所提方法通过动态调整储能电压阈值,可实现控制策略优化;通过与几种传统控制策略对比可知,所提方法在兼顾系统稳压水平与运行经济性方面占据优势;不同环境下的收敛效果对比体现了所提方法的可继承性,并在多组测试样本下验证了该方法的普适性。 The application of photovoltaic in DC traction power supply system can improve the penetration rate of new energy and reduce the energy consumption of the system,but the uncertainty of renewable energy output and the strong fluctuation of train load increase the difficulty of control strategy optimization.To solve this problem,a deep reinforcement learning-based control strategy optimization method is proposed.Based on the deep Q network(DQN),the source-energy storage-load energy management system is used as an intelli⁃gent agent,and the agent is trained by the external states such as the photovoltaic output,the state of charge of energy storage,the traction network voltage,and so on,so as to obtain an optimal strategy to realize the economic and reliable operation of the system.The framework structure and traditional control strategy of source-energy storage-load integrated system are introduced and the external characteristics of each device are modeled.Then,Markov decision process modeling is carried out for the energy management problem of source-energy storage-load integrated system and the reinforcement learning framework is established.The effectiveness of the proposed method is verified by simulation in MATLAB platform based on a municipal line data.The results show that the proposed method can optimize the control strategy by dynamically adjus-ting the voltage threshold of energy storage.Compared with several traditional control strategies,the proposed method has advantages in considering both the system voltage stability level and operation economy.The comparison of convergence effect in different environments shows the inheritability of the proposed method,and verifies the universality of the proposed method under multiple sets of test samples.

作者吕宗璞戴朝华姚志刚周斌彬郭爱吴磊 LÜZongpu;DAI Chaohua;YAO Zhigang;ZHOU Binbin;GUO Ai;WU Lei(School of Electrical Engineering,Southwest Jiaotong University,Chengdu 610031,China;China Academy of Railway Sciences,Beijing 100080,China)

机构地区西南交通大学电气工程学院中国铁道科学研究院集团有限公司

出处《电力自动化设备》 EI CSCD 北大核心 2024年第10期46-52,共7页 Electric Power Automation Equipment

基金北京市自然科学基金-丰台轨道交通前沿研究联合基金资助项目(L221002) 四川省科技计划项目(2020YJ0250)

关键词光伏发电直流牵引供电系统改进控制策略深度强化学习深度Q网络 photovoltaic power generation DC traction power supply system improved control strategy deep reinforcement learning DQN

分类号 TM922.3 [电气工程—电力电子与电力传动]

引文网络
相关文献

1陈申淮.轨道交通直流牵引供电系统线路测试回路熔断器选型方法分析[J].电子技术（上海）,2024,53(7):142-144.
2陈欣.基于AMEsim的液压惯容器外特性仿真分析[J].扬州职业大学学报,2024,28(3):47-51.
3王兴明,滕杰,范廷玉,储昭霞,董众兵,董鹏.光伏电站对沉陷塘冬季水质和浮游植物群落结构的影响[J].水土保持通报,2024,44(4):177-186.
4王希维.试论《红楼梦》的叙事艺术与悲剧内涵——以钗黛结局的合理性为切入点[J].九江学院学报（社会科学版）,2024,43(3):65-71.
5柳思贤,丁坤,董海鹰.考虑碳捕集和电转气的零碳园区综合能源系统经济调度[J].太阳能学报,2024,45(9):188-196.
6陈美艳.论网络社交账号的继承[J].合作经济与科技,2024(24):190-192.
7戴浩男,张辰灏,甄钊,王飞.基于时空特征聚类和双层动态图卷积网络建模的短期净负荷预测[J].高电压技术,2024,50(9):3914-3923.
8谭诗琪,范嘉智,耿欢,廖春花,卞一飞.基于LSTM神经网络的多要素用电量动态预测[J].农业灾害研究,2024,14(7):161-163.
9张永学,苏家玉,祁紫伟,袁志懿.地热井用潜水泵冲蚀磨损特性[J].排灌机械工程学报,2024,42(10):983-988.
10汤英,徐利岗,何新林,李金娟,李金泽.宁夏引黄灌区不同带距及水肥处理对玉米大豆间作系统光合特性和产量的影响[J].中国农村水利水电,2024(10):240-247.

电力自动化设备

2024年第10期

浏览历史

内容加载中请稍等...

计及稳压率和经济性的城轨直流牵引供电光储系统深度Q网络优化控制方法

相关作者

相关机构

相关主题

浏览历史