协同智能体强化学习算法的柔性作业车间调度方法研究

Flexible Job Shop Scheduling Method Based on Collaborative Agent Reinforcement Learning Algorithm

下载PDF

导出

摘要为提高柔性作业车间调度效率,构建一种具有柔性作业车间调度问题约束条件的马尔可夫决策过程,针对工件与机器的同时选择问题,提出一种协同智能体强化学习方法进行求解。在构建马尔可夫决策过程中,引入析取图表述状态特征,采用两种智能体执行工件与机器的选取,预测不同时刻最小化最大完工时间的差值来映射整个调度过程的奖励参数;求解时,嵌入GIN(graph isomorphic network)图神经网络提取状态,为工件与机器智能体分别设置编码器-解码器构件输出两种动作策略,以PPO(proximal policy optimization)算法与D3QN算法训练工件与机器智能体的决策网络参数。通过正交试验法选取算法超参数,以标准实例与其他文献进行对比,实验结果表明,所提方法在求解FJSP方面明显优于其他算法,进一步验证所提方法的可行性与有效性。 To enhance the efficiency of flexible job shop scheduling,this paper develops a Markov decision process with specific constraints tailored to the scheduling problem.A cooperative agent reinforcement learning method is proposed to solve the problem of concurrent selection of workpieces and machines.During the construction of the Markov decision process,a disjunctive graph is introduced to represent the state characteristics.Two agents are introduced to select the workpieces and machines.The reward parameters governing the entire scheduling process are established by predicting variations in the minimum-maximum completion time across different time points.A GIN(graph isomorphic network)graph neural network is embedded in the solving procedure to extract the relevant state information.Encoder and decoder components are respectively set for the workpiece and machine agent to output two action strategies.The PPO(proximal policy optimization)algorithm and D3QN algorithm are used to train the decision network parameters for these agents.Algorithm hyperparameters,determined through the orthogonal experiment method,are compared with standard benchmarks and those in existing literature.The results demonstrate the significant superiority of the proposed method in solving the flexible job shop scheduling problem,further substantiating the feasibility and effectiveness of the method.

作者李健李洹坤何鹏博王化北徐莉萍何奎 Li Jian;Li Huankun;He Pengbo;Wang Huabei;Xu Liping;He Kui(School of Mechatronics Engineering,Henan University of Science and Technology,Luoyang 471000,China;Henan Collaborative Innovation Center for Advanced Manufacturing of Mechanical Equipment,Luoyang 471000,China)

机构地区河南科技大学机电工程学院机械装备先进制造河南省协同创新中心

出处《系统仿真学报》 CAS CSCD 北大核心 2024年第11期2699-2711,共13页 Journal of System Simulation

基金国家重点研发计划(2018YFB1701205) 河南省科技攻关项目(212102210356)。

关键词柔性作业车间调度问题图神经网络马尔可夫决策过程协同智能体强化学习正交试验法 flexible job shop scheduling problem graph neural network(GNN) Markov decision process collaborative agent reinforcement learning orthogonal experiment method

分类号 TP278 [自动化与计算机技术—检测技术与自动化装置]

引文网络
相关文献

1李兴洲,李艳武,谢辉.基于CNN的深度强化学习算法求解柔性作业车间调度问题[J].计算机工程与应用,2024,60(17):312-320.
2吕展辉,闫莉,李雨菲.基于改进灰狼算法的柔性作业车间调度[J].自动化与仪表,2024,39(11):18-22.
3张洪亮,曹恒婉.基于改进樽海鞘群算法的多目标柔性作业车间调度问题研究[J].安徽工业大学学报（社会科学版）,2024,41(3):17-23.
4杜利珍,宣自风,唐家琦,王鑫涛.改进的Q-learning蜂群算法求解置换流水车间调度问题[J].组合机床与自动化加工技术,2024(10):175-180.
5马训德,毕利,王俊杰.基于群体免疫算法的绿色车间调度研究[J].系统仿真学报,2024,36(11):2578-2591.
6张洪亮,童超,丁倩兰.带有动态到达工件的分布式柔性作业车间调度问题研究[J].安徽工业大学学报（自然科学版）,2024,41(5):573-582.
7陈勇,张咏秋,王宸,彭运贤.基于改进NSGA-Ⅲ的商用车车厢底板生产批量调度[J].组合机床与自动化加工技术,2024(8):185-192.
8刘泳棋,刘媛华.混合改进麻雀算法解决柔性车间调度问题[J].建模与仿真,2024,13(4):4912-4926.
9本刊英文版2024年67卷第9期(1957-2194)摘要[J].中国科学：数学,2024,54(9).
10李彬.改进遗传算法求解柔性作业车间调度问题[J].电脑知识与技术,2024,20(27):79-82.

系统仿真学报

2024年第11期

浏览历史

内容加载中请稍等...

协同智能体强化学习算法的柔性作业车间调度方法研究

相关作者

相关机构

相关主题

浏览历史