期刊文献+

强化学习中资格迹的作用 被引量:1

The Function of Eligibility Traces in Reinforcement Learning
下载PDF
导出
摘要 强化学习一词来自行为心理学该学科把学习看作反复试验的过程,强化学习系统中的资格迹用来解决时间信度分配问题,文章介绍,了资格迹的基本原理和实现方法。 The word, reinforcement learning, comes from behavior psychology. This subject takes learning as trial and error process so as to map world state to the actions. The eligibility traces of reinforcement learning system are used to solve temporal credit assignment problems. In this paper, the basic principle and implementation methods of eligibility traces are presented. ;;
出处 《计算机工程》 CAS CSCD 北大核心 2002年第5期128-129,198,共3页 Computer Engineering
基金 黑龙江省自然科学基金资助项目()F9911
关键词 资格迹 强化学习 机器学习 智能系统 Eligibility traces Reinforcement learning Machine learning
  • 相关文献

参考文献5

  • 1[1]Sutton R S. Temporal Credit Assignment in Reintorcement Learning. [PhD Thesis],U niversity of Massachusetts, Amherst. M A, 1984
  • 2[2]Sutton R S. Learning to Predict by the Methods of Temporal Difference. Machine Learning,1988(3):9-44
  • 3[3]Dayan P,The Convergence of TD for General Machine Learning, 1992(8):341-362
  • 4[4]Singh S P. Reinforcement Learning with Replacing Eligibility Trace. Machine learning, 1996(22): 123-158
  • 5[5]Watkins J C H,Dayan P. Q-learning. Machine Learning, 1992(8 ):279

引证文献1

二级引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部