期刊文献+
共找到1篇文章
< 1 >
每页显示 20 50 100
Optimal Response Learning and Its Convergence in Multiagent Domains
1
作者 张化祥 黄上腾 乐嘉锦 《Journal of Donghua University(English Edition)》 EI CAS 2005年第3期116-119,共4页
In multiagent reinforcement learning, with different assumptions of the opponents’ policies, an agent adopts quite different learning rules, and gets different learning performances. We prove that, in multiagent doma... In multiagent reinforcement learning, with different assumptions of the opponents’ policies, an agent adopts quite different learning rules, and gets different learning performances. We prove that, in multiagent domains, convergence of the Q values is guaranteed only when an agent behaves optimally and its opponents’ strategies satisfy certain conditions, and an agent can get best learning performances when it adopts the same learning algorithm as that of its opponents. 展开更多
关键词 MULTIAGENT LEARNING POLICY
下载PDF
上一页 1 下一页 到第
使用帮助 返回顶部