期刊文献+

基于Q-学习的动态单机调度 被引量:11

Q-Learning Based Dynamic Single Machine Scheduling
下载PDF
导出
摘要 针对当前基于Q-学习的Agent生产调度优化研究甚少的现状,利用Q-学习对动态单机调度问题在3种不同系统目标下的调度规则动态选择问题进行了研究.在建立Q-学习与动态单机调度问题映射机制的基础上,通过MATLAB实验仿真,对算法性能进行了评价.仿真结果表明,对于不同的系统调度目标,Q-学习能提高Agent的适应能力,达到单一调度规则无法达到的性能,适合基于Agent的动态生产调度环境. Q-learning was applied to a dynamic single-machine scheduling problem. Corresponding to the environment status change and three predefined system performance measurement, the machine agent that is embedded with Q-learning can select an appropriate dispatching rule dynamically. Based on the model between Q-learning and the dynamic single-machine scheduling problem, the performance of Q-learning was evaluated through simulations in MATLABa environment. The simulation results demonstrate that Q-learning can perform well for different system objectives, which is impossible for single dispatching rule. Therefore, Q-learning is promising for application to the agent-based dynamic production scheduling.
出处 《上海交通大学学报》 EI CAS CSCD 北大核心 2007年第8期1227-1232,1243,共7页 Journal of Shanghai Jiaotong University
基金 国家自然科学基金资助项目(60574054) 2006年新世纪优秀人才支持计划
关键词 Q-学习 强化学习 动态单机调度 调度规则选择 Q-learning reinforcement learning dynamic single machine scheduling dispatching rules selection
  • 相关文献

参考文献10

  • 1Shen W M,Wang L H,Hao Q.Agent-based distributed manufacturing process planning and scheduling:A state-of-the-art survey[J].IEEE Transactions on Systems,Man,and Cybernetics-Part C:Applications and Reviews,2006,36(4):563-577.
  • 2Liu J M.Autonomous agents and multi-agent systems:Explorations in learning,self-organization and adaptive computation[M].Singapore:World Scientific,2001.
  • 3李冬梅,陈卫东,席裕庚.基于强化学习的多机器人合作行为获取[J].上海交通大学学报,2005,39(8):1331-1335. 被引量:4
  • 4范波,潘泉,张洪才.一种基于分布式强化学习的多智能体协调方法[J].计算机仿真,2005,22(6):115-117. 被引量:5
  • 5Crites R H,Barto A G.Improving elevator performance using reinforcement learning[C]// Advances in Neural Information Processing Systems.Cambridge,MA:MIT Press,1996:1017-1023.
  • 6Aydin M E,Oztemel E.Dynamic job-shop scheduling using reinforcement learning agents[J].Robotics and Autonomous Systems,2000,33:169-178.
  • 7Wang Y C,Usher J M.Application of reinforcement learning for agent-based production scheduling[J].Engineering Applications of Artificial Intelligence,2005,18:73-82.
  • 8魏英姿,赵明扬.资源受限单机动态调度的并行GA算法研究[J].系统仿真学报,2005,17(4):827-830. 被引量:2
  • 9Watkin C,Dayan P.Q-Learning[J].Machine Learning,1992,8:279-292.
  • 10Kaelbling L P,Littman M L,Moore A W.Reinforcement learning:A survey[J].Journal of Artificial Intelligence Research,1996,4:237-285.

二级参考文献21

  • 1贾建强,陈卫东,席裕庚.全自主足球机器人系统关键技术综述[J].上海交通大学学报,2003,37(z1):45-49. 被引量:13
  • 2玄光南 程润伟.遗传算法与工程设计[M].北京:科学出版社,2000..
  • 3Andreas S Schulz; Martin Skutella The power of -points in preemptive single machine scheduling [J]. Journal of Scheduling 2002, 5: 121-133.
  • 4Sonke Hartmann. A Competitive Genetic Algorithm for Resource- Constrained Project Scheduling [J]. Naval Research Logistics, 1998, 45: 733-750.
  • 5Miyashita K. Job-Shop Scheduling with Genetic Programming [A]. Proceeding of the Genetic and Evolutionary Computation Conference (GECCO), 2000, 505-512.
  • 6P L Kaelbling, L M Littman, W A Moore. Reinforcement Learning: A survey[J]. Journal of Artificial Intelligence Research, 1996, 4: 237-285.
  • 7R Bellman.Dynamic Programming: deterministic and stochastic models[M].Prentice-Hall, Englewood Cliffs, NJ, 1957.
  • 8R A Howard. Dynamic Programming and Markov progress[M].Springer-Verlag, 1960.
  • 9C J C H Watkons and P Dayan. Q-leanign[J]. Machine Learning, 1992, 8(3): 279-292.
  • 10Nicolescu M, Mataric M J. Learning and interacting in human-robot domains [J].Socially Intelligent Agents, 2001, 31(5):419-430.

共引文献8

同被引文献115

引证文献11

二级引证文献92

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部