期刊文献+

可重入生产系统的递阶增强型学习调度 被引量:2

HIERARCHICAL REINFORCEMENT LEARNING SCHEDULE FOR RE-ENTRANT MANUFACTURING SYSTEM
下载PDF
导出
摘要 对平均报酬型马氏决策过程 ,本文研究了一种递阶增强型学习算法 ;并将算法应用于一个两台机器组成的闭环可重入生产系统 ,计算机仿真结果表明 ,调度结果优于熟知的两种启发式调度策略 . In this paper, a hierarchical reinforcement learning algorithm is investigated for Markov Decision Process with average reward. And it is applied to a close re entrant manufacturing system composed of two machines. Computer simulation demonstrates that the algorithm outperforms two well known heuristic scheduling policies.
出处 《信息与控制》 CSCD 北大核心 2001年第3期199-203,共5页 Information and Control
基金 国家重点基础研究发展规划项目!G19980 2 0 3 0 2 西安交通大学机械制造系统工程国家重点实验室资助课题
关键词 超大规模集成电路 可重入生产系统 递阶增强型学习算法 启发式调速 markov decision process,hierarchical,reinforcement learning, schedule
  • 相关文献

参考文献4

  • 1郑应平 赵丽娜.离散事件与混杂系统的调度控制[J].控制理论与应用,1999,16:82-86.
  • 2郑应平,控制理论与应用,1999年,16卷,增刊,82页
  • 3Jin H,Math Oper Res,1997年,22卷,4期,886页
  • 4Dean T,Decomposition Techniques for Planningin Stochastic Domains Proceedings of the14 th Int Joint Confere,1121页

共引文献3

同被引文献25

  • 1Kumar P R.Re-entrant lines[J].Special Issue on Queueing Networks,1993,13(May):87-110.
  • 2Harrison J M,Wein L M.Scheduling of queues:heavy traffic analysis of a two-station closed network[J].Operation Research,1990,38:1052-1064.
  • 3Lu S C H,Ramaswamy D,Kumar P R.Efficient scheduling policies to reduce mean and variance of cycle-time in semiconductor manufacturing plants[J].IEEE Trans.Semiconductor Manufacturing,1994,7:374-385.
  • 4Lu S C H,Kumar P R.Fluctuation smoothing schedbilee workshop on computing and intelligent systems[M].India:Bangalore,1993.
  • 5Johnson S M.Optimal two-and three-stage production schedules with set-up times included[J].Nav.Res.Logistic.Quart,1954,1:61-68.
  • 6Garey M R,Johnson D S.Computers and intertractability:a guide to the theory of NP-completeness[M].San Francisco,California:Freeman W H,1979.
  • 7Bertsekas D P,Tsitsiklis J N.Nero-dynamic programming[M].Athena Scientific,1996.
  • 8Lippman S.Applying a new device in the optimization of exponential queueing systems[J].Operation research,1975,23:687-710.
  • 9MIYASHITA K.Learning scheduling control knowledge through reinforcements[J].International Transactions in Operational Research,2000,7(2):125-138.
  • 10PINEDO M.Scheduling:theory,algorithms,and systems[M].2nd ed.Upper Saddle River,N.J.,USA:Prentice Hall,2002.

引证文献2

二级引证文献4

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部