基于深度强化学习的铁路纵断面智能设计模型研究

Study on Deep Reinforcement Learning Model for Railway Vertical Alignment Design

下载PDF

导出

摘要传统智能算法通常要求变量维度在计算过程中不变,而铁路纵断面智能设计中的变坡点数量需要根据地形等变化自适应确定。考虑到强化学习能从地面高程和已经生成的线形等环境数据中获得最优策略的特点,将深度强化学习方法应用于纵断面智能设计,研究智能体决策变坡点的方法,提出铁路纵断面设计的变坡点决策模型,确定模型中的状态、动作、奖励等表达形式。结合纵断面设计约束多的特点,引入动作屏蔽机制处理约束,加快收敛并提高模型性能。将计算期引入模型的状态,提出通过单网络产生多个多目标策略的单网络多策略的多目标处理方法。通过实际工程案例验证了本文所提模型的正确性和有效性。 Traditional intelligent algorithms require a fixed number of variables to remain unchanged during the calcula-tion process,while the number of slope-change points in the intelligent design of railway vertical alignment needs to be adaptively determined according to changes in terrain.Considering the characteristics of reinforcement learning being able to learn and interact with environmental data such as ground elevations and generated alignments to obtain the opti-mal strategies,in this paper,the method of deep reinforcement learning was applied to the intelligent design of the verti-cal alignments,and the method for the intelligent agent to decide the slope-change points in sequence from front to back was studied.A grade change point decision-making model was proposed for railway vertical alignment design to determine the expression forms of states,actions and rewards in the model.At the same time,combined with the char-acteristics of many design constraints in the vertical alignment design,an action masking mechanism was introduced to deal with the constraints,accelerate the convergence and improve the performance of the model.In addition,by intro-ducing the computation period into the state of the model,a single-network multi-strategy multi-objective processing method was proposed to generate multiple multi-objective strategies through a single network.The correctness and ef-fectiveness of the models for single-objective and multi-objective profile problems were verified through practical engi-neering cases.

作者缪鹍戴炎林高鸿剑 MIAO Kun;DAI Yanin;GAO Hongjian(School of Civil Engineering,Central South University,Changsha 410075,China)

机构地区中南大学土木工程学院

出处《铁道学报》 EI CAS CSCD 北大核心 2024年第9期102-110,共9页 Journal of the China Railway Society

基金国家自然科学基金(51478480)。

关键词铁路纵断面设计深度强化学习安全强化学习动作屏蔽 railway vertical alignment design deep reinforcement learning safe reinforcement learning action mask

分类号 U212.34 [交通运输工程—道路与铁道工程]

引文网络
相关文献

1王成青.高速铁路穿越大型垃圾场存在问题与对策[J].铁道勘察,2023,49(6):50-55.
2王灿.污水管道纳入综合管廊设计要点分析[J].工程技术研究,2024,9(15):208-210.
3刘伟玲,张慧.基于多目标策略的高速铁路闭塞分区优化研究[J].铁路计算机应用,2024,33(4):65-70.
4张宾.基于Civil 3D和HEC-Ras的山洪沟纵断面设计[J].河南水利与南水北调,2024,53(9):53-55.
5刘宏伟,何庆成,李状,韩博,高伊航.雄安新区地下空间利用地质安全风险评价[J].水文地质工程地质,2024,51(5):207-220.
6曲道鹏,张涛,华晨曦,宋欣雨,程昌利,刘禹,王震宇.高强电磁屏蔽环氧复合材料的3D打印工艺研究[J].中国塑料,2024,38(9):24-29.
7胡辉,张军燕.新时代高职院校“双创”教育与劳动教育融合路径研究[J].湖北开放职业学院学报,2024,37(20):4-6.
8卢聪,罗扬,郭建春,曾凡辉.融合物理约束的压裂水平井产能智能预测框架构建与应用[J].天然气工业,2024,44(9):99-107.
9姚天磊,陈希亮,余沛毅.基于序列建模的生成式强化学习研究综述[J].计算机科学,2024,51(11):213-228.
10贾东昇,吴盼,于斐涵,刘峰.相变与变形中的约束[J].中国有色金属学报,2024,34(10):3209-3227.

铁道学报

2024年第9期

浏览历史

内容加载中请稍等...

基于深度强化学习的铁路纵断面智能设计模型研究

相关作者

相关机构

相关主题

浏览历史