期刊文献+

基于深度强化学习的自动化码头堆场场桥调度方法

Yard Crane Scheduling Method Based on Deep Reinforcement Learning for the Automated Container Terminal
原文传递
导出
摘要 场桥是自动化码头堆场中的核心作业机械,场桥的合理调度是集装箱作业效率提升的关键。针对场桥调度问题具有的复杂时空耦合特性和高度的动态性,以最小化自动导引车(Automatic guided vehicle,AGV)和外集卡的等待时间为优化目标构建数学规划模型,并提出一种新颖的深度强化学习方法进行求解。算法设计贴近实际堆场作业环境的智能体,并在智能体与环境的交互部分通过指针网络、注意力机制和演员-评论家(Actor-critic,A-C)架构的设计提高了获取状态中的隐藏模式的能力。在基于洋山四期自动化码头实际数据生成的不同规模的算例上展开试验,所提算法能实现场桥调度方案的高效输出,相较于一些启发式规则算法有17%左右的性能提升。试验结果表明所提调度方法是有效且优越的,能够在实际中为堆场作业提供动态决策支持。 As the core working machinery of automated terminal yard,the dispatching of yard crane is the key to improve the efficiency of container operation.In order to minimize the waiting time of AGVs and external container trucks,a mathematical programming model for the yard crane scheduling problem is established considering complex spatio-temporal coupling characteristics and high dynamic,and a novel deep reinforcement learning method is proposed to solve the problem.The algorithm describes the yard environment close to reality through the agent definition,and improves the ability of extracting hidden state patterns through pointer network,attention mechanism and A-C architecture in the interaction design between the agent and the environment.Experiments are carried out on examples of different scales based on the actual data of Yangshan Phase IV Automated Terminal.The results show that the proposed algorithm can provide an approximately optimal crane scheduling scheme in a relatively short time,and the performance of it is about 17%better compared with state-of-art heuristic rule algorithms.Therefore,the proposed scheduling method is effective and superior,and it can provide dynamic decision support for yard operation in practice.
作者 王无印 黄子钊 庄子龙 方怀瑾 秦威 WANG Wuyin;HUANG Zizhao;ZHUANG Zilong;FANG Huaijin;QIN Wei(Institute of Industrial Engineering and Management,Shanghai Jiao Tong University,Shanghai 200240;Shanghai International Port(Group)Co.,Ltd.,Shanghai 200080)
出处 《机械工程学报》 EI CAS CSCD 北大核心 2024年第6期44-57,共14页 Journal of Mechanical Engineering
基金 国家重点研发计划资助项目(2019YFB1704401)。
关键词 自动化集装箱码头 堆场 场桥调度 深度强化学习 automated container terminal yard yard crane scheduling deep reinforcement learning
  • 相关文献

参考文献4

二级参考文献62

  • 1熊禾根,李建军,孔建益,杨金堂,蒋国璋.考虑工序相关性的动态Job shop调度问题启发式算法[J].机械工程学报,2006,42(8):50-55. 被引量:33
  • 2刑文训,谢金星.现代优化计算方法[M].北京:清华大学出版社,2001.
  • 3Johnson S M. Optimal two and three-stage production schedules with setup times included[J] . Naval Research Logistics, 1954, 1(1):61-68.
  • 4郑大钟, 赵千川. 离散事件动态系统[M] . 北京:清华大学出版社, 1999.
  • 5Manne A S. On the Job-Shop scheduling problem[J] . Operations Research, 1960, 8(2):219-223.
  • 6Van Hulle M M. A goal programming network for mixed integer linear programming:a case study for the Job-Shop scheduling problem[J] . International Journal of Neural Networks, 1991, 2(3):201-209.
  • 7Balas E. Machine scheduling via disjunctive graphs:an implicit enumeration algorithm[J] . Operations Research, 1969, 17(6):941-957.
  • 8McMahon G B, Florian M. On scheduling with ready times and due dates to minimize maximum lateness[J] . Operations Research, 1975, 23(3):475-482.
  • 9Cheng Runwei, Gen M, Tsujimura Y. A tutorial survey of Job-Shop scheduling problems using genetic algorithms, part Ⅱ:hybrid genetic search strategies[J] . Computers and Industrial Engineering, 1999, 37(2):343-364.
  • 10Xiong Hegen, Fan Huali, Li Gongfa. Genetic algorithm-based hybrid methods for a flexible single-operation serial-batch scheduling problem with mold constraints[J] . Sensors and Transducers, 2013, 155(8):232-241.

共引文献102

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部