期刊文献+

APPLICATION OF HIERARCHICAL REINFORCEMENT LEARNING IN ENGINEERING DOMAIN 被引量:3

APPLICATION OF HIERARCHICAL REINFORCEMENT LEARNING IN ENGINEERING DOMAIN
原文传递
导出
出处 《Journal of Systems Science and Systems Engineering》 SCIE EI CSCD 2005年第2期207-217,共11页 系统科学与系统工程学报(英文版)
基金 ThisworkwassupportedpartlybytheNationalNaturalScienceFoundationofChinaunderGrantNo.69975013
关键词 Engineering domain knowledge CONTROLLER reinforcement learning elevator group control Engineering domain knowledge, controller, reinforcement learning, elevator,group control
  • 相关文献

参考文献11

  • 1[1]Bao, G., C. G. Cassandras, T. E. Djaferis,A.D. Gandhi, and D. P. Looze, "Elevator dispatchers for down peak traffic", ECE Department Technical Report, University of Massachusetts, 1994.
  • 2[2]Barto, A. G., S. Mahadevan, "Recent advances in hierarchical reinforcement learning", Discrete Event Dynamic Systems:Theory and Applications, Vol. 13, pp41-77,2003.
  • 3[3]Bradtke, S. J. and M. O. Duff,"Reinforcement learning methods for continuous-time Markov decision problems", Advances in Neural Information Processing Systems 7,Cambridge, MA, 1995.
  • 4[4]Crites, R. H. and A. G. Barto, "Improving elevator performance using reinforcement learning", Advances in Neural Information Processing Systems 8, pp1017-1023, 1996.
  • 5[5]Mahadevan, S., M. Nicholas, D. Tapas. and G. Abhijit, "Self-Improving factory simulation using continuous-time average-reward reinforcement learning",Proceedings of the 14th International Conference on Machine Learning (IMLC ′97), Nashville, TN, 1997.
  • 6[6]Mataric, M., "Reinforcement learning in the multi-robot domain", Autonomous Robots, Vol. 4, No. 1, pp73-83, 1997.
  • 7[7]Parr, R., "Hierarchical control and learning for markov decision processes", Ph.D.dissertation, University of California,Berkeley, CA, 1998.
  • 8[8]Rajbala, M., M. Sridhar, and G.Mohammad, "Hierarchical multi-agent reinforcement learning", Proceedings of the fifth International Conference on Autonomous Agents, pp246-253, 2001.
  • 9[9]Sutton, R.S. and A.G. Barto, Reinforcement Learning: An Introduction, Cambridge,MA: MIT Press, 1998.
  • 10[10]Szepesvari, C. and M. L. Littman, "A unified analysis of value-function-based reinforcement learning algorithms", Neuro Computing, Vol. 11, pp2017-2060, 1999.

同被引文献46

  • 1苏畅,高阳,陈世福,陈兆乾.基于SMDP环境的自主生成options算法的研究[J].模式识别与人工智能,2005,18(6):679-684. 被引量:9
  • 2彭志平,彭宏,郑启伦.一种双边多议题自治协商模型的研究[J].电子与信息学报,2007,29(3):733-738. 被引量:12
  • 3高阳,周如益,王皓,曹志新.平均奖赏强化学习算法研究[J].计算机学报,2007,30(8):1372-1378. 被引量:38
  • 4FISCHER F, ROVATSOS M, WEISS G. Hierarchical reinforcement learning in communication-mediated multiagent coordination [ C ]// Proc of the 3rd International Conference on Autonomous Agents and Muhiagent Systems. New York: ACM Press, 2004.
  • 5HENGST B. Discovering hierarchy in reinforcement learning [ D ]. Sydney: University of New South Wales, 2003.
  • 6SKELLY M M. Hierarchical reinforcement learning with function approximation for adaptive control [ D ]. Ohio : Case Western Reserve University, 2004.
  • 7UTHER W T B. Tree based hierarchical reinforcement learning[ D]. Pittsburgh: Carnegie Mellon University, 2002.
  • 8BELLMAN R E, DREYFUS S E. Applied dynamic programming [ M ]. New Jersey : Princeton University Press, 1962.
  • 9WATKINS C, DAYAN P. Q-learning[J]. Machine Learning, 1992,8(3 ) :279-292.
  • 10PARR R. Hierarchical control and learning for Markov decision processes [ D ]. Berkeley, Califomia: University of California, 1998.

引证文献3

二级引证文献10

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部