基于深度强化学习的自动化码头堆场场桥调度方法

Yard Crane Scheduling Method Based on Deep Reinforcement Learning for the Automated Container Terminal

导出

摘要场桥是自动化码头堆场中的核心作业机械,场桥的合理调度是集装箱作业效率提升的关键。针对场桥调度问题具有的复杂时空耦合特性和高度的动态性,以最小化自动导引车(Automatic guided vehicle,AGV)和外集卡的等待时间为优化目标构建数学规划模型,并提出一种新颖的深度强化学习方法进行求解。算法设计贴近实际堆场作业环境的智能体,并在智能体与环境的交互部分通过指针网络、注意力机制和演员-评论家(Actor-critic,A-C)架构的设计提高了获取状态中的隐藏模式的能力。在基于洋山四期自动化码头实际数据生成的不同规模的算例上展开试验,所提算法能实现场桥调度方案的高效输出,相较于一些启发式规则算法有17%左右的性能提升。试验结果表明所提调度方法是有效且优越的,能够在实际中为堆场作业提供动态决策支持。 As the core working machinery of automated terminal yard,the dispatching of yard crane is the key to improve the efficiency of container operation.In order to minimize the waiting time of AGVs and external container trucks,a mathematical programming model for the yard crane scheduling problem is established considering complex spatio-temporal coupling characteristics and high dynamic,and a novel deep reinforcement learning method is proposed to solve the problem.The algorithm describes the yard environment close to reality through the agent definition,and improves the ability of extracting hidden state patterns through pointer network,attention mechanism and A-C architecture in the interaction design between the agent and the environment.Experiments are carried out on examples of different scales based on the actual data of Yangshan Phase IV Automated Terminal.The results show that the proposed algorithm can provide an approximately optimal crane scheduling scheme in a relatively short time,and the performance of it is about 17%better compared with state-of-art heuristic rule algorithms.Therefore,the proposed scheduling method is effective and superior,and it can provide dynamic decision support for yard operation in practice.

作者王无印黄子钊庄子龙方怀瑾秦威 WANG Wuyin;HUANG Zizhao;ZHUANG Zilong;FANG Huaijin;QIN Wei(Institute of Industrial Engineering and Management,Shanghai Jiao Tong University,Shanghai 200240;Shanghai International Port(Group)Co.,Ltd.,Shanghai 200080)

机构地区上海交通大学工业工程与管理系上港国际港务(集团)股份有限公司

出处《机械工程学报》 EI CAS CSCD 北大核心 2024年第6期44-57,共14页 Journal of Mechanical Engineering

基金国家重点研发计划资助项目(2019YFB1704401)。

关键词自动化集装箱码头堆场场桥调度深度强化学习 automated container terminal yard yard crane scheduling deep reinforcement learning

分类号 U691 [交通运输工程—港口、海岸及近海工程]

引文网络
相关文献

参考文献4

1黄子钊,庄子龙,滕浩,秦威,秦涛,邹鹰.自动化码头出口箱箱位分配优化超启发式算法[J].计算机集成制造系统,2022,28(8):2619-2632. 被引量：8
2刘朝阳,穆朝絮,孙长银.深度强化学习算法与应用研究现状综述[J].智能科学与技术学报,2020(4):314-326. 被引量：45
3肖鹏飞,张超勇,孟磊磊,洪辉,戴稳.基于深度强化学习的非置换流水车间调度问题[J].计算机集成制造系统,2021,27(1):192-205. 被引量：30
4范华丽,熊禾根,蒋国璋,李公法.动态车间作业调度问题中调度规则算法研究综述[J].计算机应用研究,2016,33(3):648-653. 被引量：25

二级参考文献62

1熊禾根,李建军,孔建益,杨金堂,蒋国璋.考虑工序相关性的动态Job shop调度问题启发式算法[J].机械工程学报,2006,42(8):50-55. 被引量：33
2刑文训,谢金星.现代优化计算方法[M].北京:清华大学出版社,2001.
3Johnson S M. Optimal two and three-stage production schedules with setup times included[J] . Naval Research Logistics, 1954, 1(1):61-68.
4郑大钟, 赵千川. 离散事件动态系统[M] . 北京:清华大学出版社, 1999.
5Manne A S. On the Job-Shop scheduling problem[J] . Operations Research, 1960, 8(2):219-223.
6Van Hulle M M. A goal programming network for mixed integer linear programming:a case study for the Job-Shop scheduling problem[J] . International Journal of Neural Networks, 1991, 2(3):201-209.
7Balas E. Machine scheduling via disjunctive graphs:an implicit enumeration algorithm[J] . Operations Research, 1969, 17(6):941-957.
8McMahon G B, Florian M. On scheduling with ready times and due dates to minimize maximum lateness[J] . Operations Research, 1975, 23(3):475-482.
9Cheng Runwei, Gen M, Tsujimura Y. A tutorial survey of Job-Shop scheduling problems using genetic algorithms, part Ⅱ:hybrid genetic search strategies[J] . Computers and Industrial Engineering, 1999, 37(2):343-364.
10Xiong Hegen, Fan Huali, Li Gongfa. Genetic algorithm-based hybrid methods for a flexible single-operation serial-batch scheduling problem with mold constraints[J] . Sensors and Transducers, 2013, 155(8):232-241.

共引文献102

1曹红倩.应用改进Q-learning算法解决柔性作业车间调度问题[J].国外电子测量技术,2022,41(4):164-169. 被引量：3
2乔东平,裴杰,肖艳秋,周坤.蚁群算法及其应用综述[J].软件导刊,2017,16(12):217-221. 被引量：29
3张春燕.基于改进遗传进化算法的复杂作业流程调度[J].软件,2017,38(12):98-103. 被引量：2
4范华丽,熊禾根,蒋国璋,李公法,李梓响.基于遗传规划的动态作业车间调度规则生成[J].计算机集成制造系统,2018,24(4):876-885. 被引量：14
5王雄伟,陈春良,曹艳华,陈伟龙,吴同晗.考虑优先级的维修任务动态调度方法[J].兵工自动化,2018,37(6):83-87. 被引量：3
6解明利,胡占齐,马宁.基于最大熵神经网络算法的柔性制造系统调度策略研究[J].计算机应用研究,2018,35(12):3697-3700. 被引量：3
7周琪森,林杰,白翱.考虑班组负荷均衡的智能制造车间工序级作业任务排程模型研究[J].制造业自动化,2018,40(3):101-105. 被引量：5
8曾强,邓敬源,常梦辉,张进春.混合工作日历下作业车间调度遗传进化方法[J].中国机械工程,2018,29(22):2690-2702. 被引量：2
9罗弦,廖荣涛,查志勇,王逸兮,焦尧毅.云平台下电力系统能量备用实时调度模型研究[J].电子设计工程,2019,27(2):175-178. 被引量：10
10赵宏涛,许伟,陈峰,王涛.高速铁路列车运行计划自动调整系统研究[J].铁道运输与经济,2019,41(2):59-64. 被引量：10

1周宇涛,陈强,崔希良,刘耀徽,张枫.自动化集装箱码头堆场调度规则研究与优化[J].水运工程,2024(6):193-196.
2赖杭燕.深入挖掘大数据技术潜力全面提升企业人力资源管理效能[J].中国商界,2024(4):136-137. 被引量：2
3贾盼.传统节日走进幼儿园的实践探索[J].安徽教育科研,2024(10):122-123.
4郑心.力争做好一名星辰“监护人”——记北京航空航天大学航空科学与工程学院教授于洋[J].科学中国人,2024(5):82-83.
5余辰熠,魏洪乾,张幽彤.基于关联规则与离群点的新能源汽车动力域入侵检测[J].汽车工程学报,2024,14(3):412-421.
6你好长影[J].电影文学,2024(10).
7彭善鑫,刘婷婷,孙志清,朱晓松,左志文,廖瑞兰,付立芳,周卫萍.2013—2023年临沂地区伯克霍尔德菌的临床分布、感染特征及耐药分析[J].中国抗生素杂志,2024,49(4):463-468.
8贺鸿飞.探寻主流媒体短视频新闻的未来进路——以高流量短视频新闻为例[J].喜剧世界（上）,2023(10):102-104. 被引量：1
9张远庆.基于AGV的螺栓自动收集转运流水线的设计与应用[J].铁道建筑,2024,64(5):163-167.
10马芮萍,段筱筠,杨阳,王雪娇,李晓东,周春红.基于UPLC-MS/MS对3种类型洋葱中有机酸和氨基酸成分的分析[J].中国调味品,2024,49(6):168-175. 被引量：1

机械工程学报

2024年第6期

浏览历史

内容加载中请稍等...

基于深度强化学习的自动化码头堆场场桥调度方法

参考文献4

二级参考文献62

共引文献102

相关作者

相关机构

相关主题

浏览历史