Control Task for Reinforcement Learning with Known Optimal Solution for Discrete and Continuous Actions

Control Task for Reinforcement Learning with Known Optimal Solution for Discrete and Continuous Actions

下载PDF

导出

摘要 The overall research in Reinforcement Learning (RL) concentrates on discrete sets of actions, but for certain real-world problems it is important to have methods which are able to find good strategies using actions drawn from continuous sets. This paper describes a simple control task called direction finder and its known optimal solution for both discrete and continuous actions. It allows for comparison of RL solution methods based on their value functions. In order to solve the control task for continuous actions, a simple idea for generalising them by means of feature vectors is presented. The resulting algorithm is applied using different choices of feature calculations. For comparing their performance a simple measure is The overall research in Reinforcement Learning (RL) concentrates on discrete sets of actions, but for certain real-world problems it is important to have methods which are able to find good strategies using actions drawn from continuous sets. This paper describes a simple control task called direction finder and its known optimal solution for both discrete and continuous actions. It allows for comparison of RL solution methods based on their value functions. In order to solve the control task for continuous actions, a simple idea for generalising them by means of feature vectors is presented. The resulting algorithm is applied using different choices of feature calculations. For comparing their performance a simple measure is introduced

作者 Michael C. ROTTGER Andreas W. LIEHR

机构地区不详

出处《Journal of Intelligent Learning Systems and Applications》 2009年第1期28-41,共14页 智能学习系统与应用（英文）

关键词 comparison CONTINUOUS ACTIONS example problem REINFORCEMENT learning performance comparison continuous actions example problem reinforcement learning performance

分类号 R73 [医药卫生—肿瘤]

引文网络
相关文献

1陈雪,唐浩然.自动定向机工作原理及常见故障分析[J].技术与市场,2018,25(10):77-78.

Journal of Intelligent Learning Systems and Applications

2009年第1期

浏览历史

内容加载中请稍等...

Control Task for Reinforcement Learning with Known Optimal Solution for Discrete and Continuous Actions

相关作者

相关机构

相关主题

浏览历史