摘要
强化学习一词来自行为心理学该学科把学习看作反复试验的过程,强化学习系统中的资格迹用来解决时间信度分配问题,文章介绍,了资格迹的基本原理和实现方法。
The word, reinforcement learning, comes from behavior psychology. This subject takes learning as trial and error process so as to map world state to the actions. The eligibility traces of reinforcement learning system are used to solve temporal credit assignment problems. In this paper, the basic principle and implementation methods of eligibility traces are presented. ;;
出处
《计算机工程》
CAS
CSCD
北大核心
2002年第5期128-129,198,共3页
Computer Engineering
基金
黑龙江省自然科学基金资助项目()F9911
关键词
资格迹
强化学习
机器学习
智能系统
Eligibility traces Reinforcement learning Machine learning