期刊文献+
共找到1篇文章
< 1 >
每页显示 20 50 100
Multi-agent reinforcement learning based on policies of global objective
1
作者 张化祥 黄上腾 《Journal of Systems Engineering and Electronics》 SCIE EI CSCD 2005年第3期676-681,共6页
In general-sum games, taking all agent's collective rationality into account, we define agents' global objective, and propose a novel multi-agent reinforcement learning(RL) algorithm based on global policy. In eac... In general-sum games, taking all agent's collective rationality into account, we define agents' global objective, and propose a novel multi-agent reinforcement learning(RL) algorithm based on global policy. In each learning step, all agents commit to select the global policy to achieve the global goal. We prove this learning algorithm converges given certain restrictions on stage games of learned Q values, and show that it has quite lower computation time complexity than already developed multi-agent learning algorithms for general-sum games. An example is analyzed to show the algorithm' s merits. 展开更多
关键词 Markov games reinforcement learning collective rationality policy.
下载PDF
上一页 1 下一页 到第
使用帮助 返回顶部