期刊文献+

应用改进Q-learning算法解决柔性作业车间调度问题 被引量:3

Application of improved Q-learning algorithm to solve flexible job shop scheduling problem
下载PDF
导出
摘要 柔性作业车间调度是目前加工系统中的一个重要调度问题,对于该问题的解决方法大都存在速度较慢等缺陷,提出一种改进的Q-learning算法,设计了动态的状态空间及动作集,通过引入“Sigmoid”函数作为动态选择策略改进Q-learning算法,使改进后的算法前期随机选择动作,后期在随机选择动作和选择奖励值最高的动作中动态变化,有效改善了传统Q-learning算法容易陷入局部最优且收敛速度慢等缺陷。将改进Q-learning算法应用到TSP问题中,证实改进算法的普适性和可行性,再将其应用解决柔性调度问题中,证实了其改进的有效性,提升了解决柔性作业车间调度问题的速度和精度。 Flexible job shop scheduling is an important scheduling problem in the current processing system.Most of the solutions to this problem have problems such as slow speed.This paper proposes an improved Q-learning algorithm and designs a dynamic state space and action set.Improve the Q-learning algorithm by introducing the"Sigmoid"function as a dynamic selection strategy,so that the improved algorithm randomly selects actions in the early stage,and dynamically changes in the later stage of the random selection of actions and the action with the highest reward value,effectively improving the traditional Q-learning algorithm It is easy to fall into defects such as local optimum and slow convergence speed.This article first applies the improved Q-learning algorithm to the TSP problem,confirms the universality and feasibility of the improved algorithm,and then applies it to solve the flexible scheduling problem,confirms the effectiveness of its improvement,and improves the solution to flexible job shop scheduling.The speed and accuracy of the problem.
作者 曹红倩 Cao Hongqian(School of Information and Control Engineering,Liaoning Petrochemical University,Fushun 130001,China)
出处 《国外电子测量技术》 北大核心 2022年第4期164-169,共6页 Foreign Electronic Measurement Technology
基金 辽宁省自然科学基金指导计划重点项目(20170540589)资助
关键词 Q-LEARNING 柔性 作业车间调度 Q-learning flexibility job shop scheduling
  • 相关文献

参考文献14

二级参考文献141

  • 1李辉,祁宇明.一种复杂环境下基于深度强化学习的机器人路径规划方法[J].计算机应用研究,2020,37(S01):129-131. 被引量:13
  • 2戴博,肖晓明,蔡自兴.移动机器人路径规划技术的研究现状与展望[J].控制工程,2005,12(3):198-202. 被引量:75
  • 3江雷.基于并行遗传算法的弹性TSP研究[J].微电子学与计算机,2005,22(8):130-133. 被引量:10
  • 4周涛.基于改进遗传算法的TSP问题研究[J].微电子学与计算机,2006,23(10):104-106. 被引量:19
  • 5Hung Dinh Nguyen, Kunihito Yamamori, Moritoshi Yasunaga, et al. Implementation of an effective hybrid GA for large-scale traveling salesman problems [J]. IEEE Transactions on Systems, Man, And Cybernetics, 2007,37(1) : 92--99.
  • 6Tsai C F, Tsai C W, Tseng C C. A new approach for sovling large traveling salseman problem [ C]// Pro ceedings of the 2002 Congress on Evolutionary Computation. Honolulu, HI: IEEE, 2002(8):1636-1641.
  • 7Firbes J. Burkowski, proximity and priority: applying a gene expression algorithm to the traveling salesperson problem [C]// International Parallel and Distributed Processing Symposium. Nice:IEEE, 2003,30(5): 803 - 816.
  • 8Lin S, Kernighan B W. An effective heuristic algorithm for the traveling-salesman problem[J]. Operations Re search, 1973,21(2) :498-516.
  • 9He Xiaoxian, Zhu Yunlong, Hu Hechun. A new algo rithm for TSP based on swarm intelligence[C]// Proceeding of the 6th World Congress on Intelligent Cont reland Automation. Dalian: IEEE, 2006 :3241 -- 3244.
  • 10GAREY M R, JOHNSON D S, SETHI R. The complexity of flow shop and job shop scheduling[ J]. Mathematics of Operations Research, 1976,1 (2) :117-129.

共引文献187

同被引文献28

引证文献3

二级引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部