期刊文献+

同顺序Flow-shop问题的一种遗传强化学习算法 被引量:5

A Genetic Reinforcement Learning Algorithm for Permutation Flow-shop Scheduling Problem
原文传递
导出
摘要 针对Flow-shop排序问题的固有复杂性,设计了一种遗传强化学习算法.首先,引入状态变量和行动变量,把组合优化的排序问题转换成序贯决策问题加以解决;其次,设计了一个Q-学习算法和基于组合算子的遗传算法相集成,遗传算法利用染色体的优良模式及其适应值信息来指导智能体的学习过程,提高学习效率和效果,强化学习则对染色体进行局部优化进而改良遗传群体,二者有机结合共同解决Flow-shop排序问题;再次,提出了多种适应性策略,使算法关键参数能够周期性递变,以更好地在深度搜索和广度搜索之间均衡;最后,仿真优化实验结果验证了该算法的有效性. Considering the inherent complexity of Flow-shop scheduling problem, an algorithm named Genetic Reinforcement Learning, GRL, is designed to solve it. First, state variable and action variable are employed to transform the combinational-optimization scheduling problem into sequential-decision problem. Secondly, a Q-Learning algorithm is proposed to integrate with a Genetic Algorithm based on combined operators. The agent is supervised by chromosomes' good modes and their fitness information. As a result, the agent' s learning performance is improved. The genetic population is also meliorated by the local optimization of Reinforcement Learning to each chromosome. So GA and RL are integrated in GRL to solve the Flow-shop scheduling problem. Thirdly, several self-adaptive policies are introduced into GRL algorithm to make it balance in exploitation and exploration. Finally, the algorithm is validated by simulation experiments.
出处 《系统工程理论与实践》 EI CSCD 北大核心 2007年第9期115-122,共8页 Systems Engineering-Theory & Practice
基金 国家自然科学基金(70371005 70521001) 高等学校博士学科点专项科研基金(20020006-4) 新世纪优秀人才支持计划(NCET-04-0175)
关键词 FLOW-SHOP 遗传算法 强化学习 自适应 flow-shop genetic algorithm reinforcement learning self- adaptive
  • 相关文献

参考文献11

  • 1French S.Sequencing and Scheduling:an Introduction to the Mathematics of Job Shop[M].England:Ellis Horwood Ltd.,West Sussex,1982.
  • 2Garey M R,Johnson D S,Sethi R.The complexity of flowshop and jobshop scheduling[J].Mathematics of Operations Research,1976,1(2):117-129.
  • 3韦有双,杨湘龙,冯允成.一种新的求解Flow Shop问题的启发式算法[J].系统工程理论与实践,2000,20(9):41-47. 被引量:11
  • 4Gen M,Cheng R W.Genetic Algorithms and Engineering Optimization[M].America:John Wiley & Sons,Inc.2000.
  • 5Richard S S,Andrew G B.Reinforcement Learning:An Introduction[M].America:MIT Press,Cambridge,MA,1998.
  • 6Tom M M.Machine Learning[M].New York:McGraw-Hill Press,1997.
  • 7Wang Y C,Usher J M.Learning policies for single Machine Job dispatching[J].Robotics and Computer-Integrated Manufacturing,2004,20(6):553-562.
  • 8Wang Y C,Usher J M.Application of reinforcement learning for agent-based production scheduling[J].Engineering Applications of Artificial Intelligence,2005,18(1):73-82.
  • 9吴继伟,杨定鹏,萧蕴诗.多智能体协作方法及其应用研究[J].控制与决策,2004,19(2):216-218. 被引量:5
  • 10吕赐兴,朱云龙,尹朝万,于海斌.基于多Agent的敏捷生产调度中的协商策略[J].计算机集成制造系统,2006,12(4):579-584. 被引量:5

二级参考文献18

  • 1王艳红,尹朝万.一类基于多Agent和分布式规则的敏捷生产调度[J].控制理论与应用,2004,21(4):526-530. 被引量:8
  • 2Hoon Shik Woo,Computers Operations Research,1998年,25卷,3期,175页
  • 3沈英俊,硕士学位论文,1996年
  • 4陈荣秋,排序的理论与方法,1987年
  • 5石锦惠.[D].上海:同济大学电气工程系,1999.
  • 6RABELO R J,CAMARINHA-MATOS L M,AFSARMANESH H.Multi-Agent-based agile scheduling[J].Robotic and Autonomous System,1999,27(1-2):15-28.
  • 7WANG Yanhong,YIN Chaowan,ZHANG Y.A multi-Agent and distributed ruler based approach to production scheduling of agile manufacturing systems[J].International Journal Computer Integrated Manufacturing,2003,16(2):81-92.
  • 8SHEN W.Distributed manufacturing scheduling using intelligent Agents[J].IEEE Intelligent Systems,2002,17 (1):88-94.
  • 9SANDHOLM T,LESSER V.Leveled commitment contracting:a backtracking instrument for multi-Agent systems[J].AI Magazine,2002,23(3):89-100.
  • 10SHEN W,DOUGLAS H N.Agent-based systems for intelligent manufacturing:a state of-the-art survey[J].Knowledge and Information Systems,1999,1 (2):129-156.

共引文献18

同被引文献91

引证文献5

二级引证文献22

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部