期刊文献+

协同智能体强化学习算法的柔性作业车间调度方法研究

Flexible Job Shop Scheduling Method Based on Collaborative Agent Reinforcement Learning Algorithm
下载PDF
导出
摘要 为提高柔性作业车间调度效率,构建一种具有柔性作业车间调度问题约束条件的马尔可夫决策过程,针对工件与机器的同时选择问题,提出一种协同智能体强化学习方法进行求解。在构建马尔可夫决策过程中,引入析取图表述状态特征,采用两种智能体执行工件与机器的选取,预测不同时刻最小化最大完工时间的差值来映射整个调度过程的奖励参数;求解时,嵌入GIN(graph isomorphic network)图神经网络提取状态,为工件与机器智能体分别设置编码器-解码器构件输出两种动作策略,以PPO(proximal policy optimization)算法与D3QN算法训练工件与机器智能体的决策网络参数。通过正交试验法选取算法超参数,以标准实例与其他文献进行对比,实验结果表明,所提方法在求解FJSP方面明显优于其他算法,进一步验证所提方法的可行性与有效性。 To enhance the efficiency of flexible job shop scheduling,this paper develops a Markov decision process with specific constraints tailored to the scheduling problem.A cooperative agent reinforcement learning method is proposed to solve the problem of concurrent selection of workpieces and machines.During the construction of the Markov decision process,a disjunctive graph is introduced to represent the state characteristics.Two agents are introduced to select the workpieces and machines.The reward parameters governing the entire scheduling process are established by predicting variations in the minimum-maximum completion time across different time points.A GIN(graph isomorphic network)graph neural network is embedded in the solving procedure to extract the relevant state information.Encoder and decoder components are respectively set for the workpiece and machine agent to output two action strategies.The PPO(proximal policy optimization)algorithm and D3QN algorithm are used to train the decision network parameters for these agents.Algorithm hyperparameters,determined through the orthogonal experiment method,are compared with standard benchmarks and those in existing literature.The results demonstrate the significant superiority of the proposed method in solving the flexible job shop scheduling problem,further substantiating the feasibility and effectiveness of the method.
作者 李健 李洹坤 何鹏博 王化北 徐莉萍 何奎 Li Jian;Li Huankun;He Pengbo;Wang Huabei;Xu Liping;He Kui(School of Mechatronics Engineering,Henan University of Science and Technology,Luoyang 471000,China;Henan Collaborative Innovation Center for Advanced Manufacturing of Mechanical Equipment,Luoyang 471000,China)
出处 《系统仿真学报》 CAS CSCD 北大核心 2024年第11期2699-2711,共13页 Journal of System Simulation
基金 国家重点研发计划(2018YFB1701205) 河南省科技攻关项目(212102210356)。
关键词 柔性作业车间调度问题 图神经网络 马尔可夫决策过程 协同智能体强化学习 正交试验法 flexible job shop scheduling problem graph neural network(GNN) Markov decision process collaborative agent reinforcement learning orthogonal experiment method
  • 相关文献

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部