摘要
在HAMs框架中引入策略耦合SMDPs的观点,定义了HAM-可分解概念,并明确了HAM机、HAM-可分解及策略耦合SMDPs这三者之间的关系,证明了HAM框架适合解决策略耦合SMDPs问题.在此基础上,针对一类具有有向无环图形式的策略耦合SMDPs问题,提出一种层次分解方法,并给出一个判断层次分解有效性的条件.最后使用一个典型的实验来说明该方法的特点.
This paper introduces the concept of "policy-coupled" semi-Markov decision processes (SMDPs) into HAMs. It defines the concept of HAM-decomposable and makes the relations among the HAM machine, HAM-decomposable, and "policy-coupled" SMDPs clear. It also proves that HAMs is suitable for solving the "policy-coupled" SMDPs problem. Based on these, this paper gives a method for hierarchical decomposition on a class of "policy-coupled" SMDPs with a DAG call graph and presents a precondition that can be used for determining whether or not can generate a valid hierarchical decomposition. Lastly, a typical experiment is tested for illustrating the characteristics of this method.
出处
《小型微型计算机系统》
CSCD
北大核心
2008年第4期653-658,共6页
Journal of Chinese Computer Systems
基金
国家自然科学基金面上项目(60503048)资助
关键词
层次强化学习
层次抽象机
策略耦合SMDPs
hierarchical reinforcement learning
hierarchies of abstract machines
policy-coupled semi-Markov decision processes