期刊文献+

动态联盟收益值的再励学习 被引量:1

Reinforcement Learning for the Value of Dynamic Coalition
下载PDF
导出
摘要 联盟形成的收益值是模糊和不确定的,难于计算,而联盟收益值在成员变化的情况下的计算就更为复杂。Lerman等人实现了动态联盟Agent进出联盟的管理方法,Chalkiadakis则研究了不确定情况下联盟的再励学习,但没有涉及联盟成员变化情况下的收益值动态性。论文定义了带折扣率的估计核,给出一种再励学习算法来计算联盟成员变化后的收益值,深化了Chalkiadakis的工作。实验结果验证了该方法的可行性和正确性。 It is difficult to compute the value of dynamic coalition because of its fuzzy and uncertain character.It is even more difficult to compute the value while the number of coalition member changes,Lerman implements the management methods for agents joining and leaving coalition.Chalkiadakis investigates Bayesian reinforcement learning for coalition formation under uncertainty,but he has not investigated the value of dynamic coalition with the change of dynamic coalition membership.In this paper an estimate core using discount factor is defined.A reinforcement learning method is proposed to compute the value of dynamic coalition.It improves the work of Chalkiadakis.The experiment result demonstrates that it is feasible and correct.
作者 童向荣 张伟
出处 《计算机工程与应用》 CSCD 北大核心 2006年第6期85-87,共3页 Computer Engineering and Applications
基金 国家自然科学基金重大资助项目(编号:60496323) 山东省教育厅科技计划资助项目(编号:JSJ03J1)
关键词 多AGENT系统 动态联盟形成 再励学习 multi-agent system,dynamic coalition formation,reinforcement learning
  • 相关文献

参考文献9

  • 1Sandholm T,Larson K,anderson M et al.coalition Structure Genera tion with Worst Case Guarantees[J].Artificial Intelligence,1999;111(1-2):209~238
  • 2Shehory O,Kraus S.Methods for task allocation via Agent coalition formation[J].Artificial Intelligence,1998;101 (1-2):165~200
  • 3Mares M.Fuzzy cooperative games-Cooperation with vague expectations[M].Physical Verlag,2001
  • 4Sarit Kraus,Onn Shehory,Gilad Taase.Coalition Formation with Uncertain Heterogeneous Information[C].In:AAMAS'03,Melboume,Australia,2003:14~18
  • 5K Lerman,O Shehory.Coalition Formation for Large Scale Electronic Markets[C].In:Proceedings of the Fourth International conference on MultiAgent Systems ICMAS'2000,Boston,2000:216~222
  • 6Georgios Chalkiadakis,Craig Boutilier.Bayesian Reinforcement Learning for Coalition Formation under Uncertainty[C].In:AAMAS'04,New York City,USA,2004:19~23
  • 7Leslie Pack Kaelbling,Michael L Littman,Anthony R Cassandra.Planning and acting in partially observable stochastic domains[J].Artificial Intelligence,1998;101:99~134
  • 8Michael Bowling,Manuela Veloso.Multiagent Learning Using a Variable Learning Rate[J].Artificial Intelligence,2002;136:215~250
  • 9Michael Bowling,Manuela Veloso.Simultaneous Adversarial MultiRobot Learning[C].In:IJCAI-03,2003:699~704

同被引文献4

引证文献1

二级引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部