期刊文献+
共找到1篇文章
< 1 >
每页显示 20 50 100
Greedy feature replacement for online value function approximation
1
作者 feng-fei zhao Zheng QIN +2 位作者 Zhuo SHAO Jun FANG Bo-yan REN 《Journal of Zhejiang University-Science C(Computers and Electronics)》 SCIE EI 2014年第3期223-231,共9页
Reinforcement learning(RL) in real-world problems requires function approximations that depend on selecting the appropriate feature representations. Representational expansion techniques can make linear approximators ... Reinforcement learning(RL) in real-world problems requires function approximations that depend on selecting the appropriate feature representations. Representational expansion techniques can make linear approximators represent value functions more effectively; however, most of these techniques function well only for low dimensional problems. In this paper, we present the greedy feature replacement(GFR), a novel online expansion technique, for value-based RL algorithms that use binary features. Given a simple initial representation, the feature representation is expanded incrementally. New feature dependencies are added automatically to the current representation and conjunctive features are used to replace current features greedily. The virtual temporal difference(TD) error is recorded for each conjunctive feature to judge whether the replacement can improve the approximation. Correctness guarantees and computational complexity analysis are provided for GFR. Experimental results in two domains show that GFR achieves much faster learning and has the capability to handle large-scale problems. 展开更多
关键词 Reinforcement learning Function approximation Feature dependency Online expansion Feature replacement
原文传递
上一页 1 下一页 到第
使用帮助 返回顶部