1Xuan P,Lesser V.Multi-agent policies:From centralized ones to decentralized ones[A].Proceedings of the International Joint Conference on Autonomous Agents and Multiagent Systems[C].New York,NY,USA:ACM,2002.1098 -1105.
2Goldman C V,Zilberstein S.Decentralized control of cooperative systems:Categorization and complexity analysis[J].Journal of Artificial Intelligence Research,2004,22:143-174.
3Hansen E A,Bernstein D S,Zilberstein S.Dynamic programming for partially observable stochastic games[A].Proceedings of the Nineteenth National Conference on Artificial Intelligence[C].Menlo Park,CA,USA:AAAI,2004.709 -715.
4Peshkin L,Kim K E,Meuleau N,et al.Learning to cooperate via policy search[A].Proceedings of the Sixteenth Conference on Uncertainty in Artificial Intelligence[C].USA:Morgan Kaufmann,2000.489-496.
5Pynadath D V,Tambe M.The communicative multiagent team decision problem:Analyzing teamwork theories and models[J].Journal of Artificial Intelligence Research,2002,16:389 -423.
6Nair R,Tambe M,Roth M,et al.Communication for improving policy computation in distributed POMDPs[A].Proceedings of the International Joint Conference on Autonomous Agents and Multiagent Systems[C].New York,NY,USA:ACM,2004.1098-1105.
7Roth M,Simmons R,Veloso M.Decentralized communication strategies for coordinated multi-agent policies[A].Multi-Robot Systems:From Swarms to Intelligent Automata Vol.Ⅲ[C].Dordrecht,Netherlands:Springer,2005.93 -106.
8Littman M L,Cassandra A,Kaelbling L.Learning policies for partially observable environments:Scaling up[A].Proceedings of the 12th International Conference on Machine Learning[C].San Francisco,CA:Morgan Kaufmann Publishers,1995.362 -370.