期刊文献+

面向伙伴选择的模糊Markov博弈控制及仿真研究 被引量:1

Study on Fuzzy Markov Game Controller for Partner Selection Management Simulation
下载PDF
导出
摘要 针对不确定条件下的伙伴选择决策问题,把自适应模糊控制系统理论及神经网络理论引入到Markov博弈中,提出一种基于多智能体的伙伴选择模糊控制模型。该模型引入基于ANFIS和神经网络的模糊神经网络,实现了一种全新的进行值函数逼近的梯度下降Q学习的算法。并应用该模型对伙伴选择问题进行研究,对多影响因素进行FNN学习,将输出量作为标准Markov博弈模型的输入量,得到影响的策略,最后研究了一个应用实例,利用具体历史数据对建模方法和模型进行了验证和分析。 According to partner selection under uncertain conditions, a multi-agent fuzzy Markov game controller was proposed based on adaptive neuron-fuzzy inference system (ANFIS), neural network and Markov game. Fuzzy neural network was used as value function approximators. In this model, FNN was used to train the factors which influenced the partner selection and the results of FNN was taken as the input for the standard Markov game while the finial policy was taken as the output. A case was studied and the simulation model was validated by historic data.
出处 《系统仿真学报》 EI CAS CSCD 北大核心 2007年第15期3572-3576,共5页 Journal of System Simulation
基金 国家自然科学基金(70540005)
关键词 伙伴选择 多智能体 自适应模糊控制系统 神经网络 Markov博弈 Q学习 partner selection multi-agent ANFIS neural network Markov Game Q learning
  • 相关文献

参考文献17

  • 1杜春侠,高云,张文.多智能体系统中具有先验知识的Q学习算法[J].清华大学学报(自然科学版),2005,45(7):981-984. 被引量:21
  • 2Littman Michael L.Friend or foe Q-learning in General-sum Markov Games[C]//18th International Conference on Machine Learning.MA:MIT press,2001:322-328.
  • 3Watkins C.Technical note:Q-learning[J].Machine Learning (S0885-6125),1992,8:279-292.
  • 4Littman Michael L.Markov Game as a framework for multi-agent reinforcement learning[C]//11th International Conference on Machine Learning,San Francisco:Morgan Kaufman Publishers,1994:1023-1036.
  • 5Haddaid A Sundermeyer.KBDI agent architectures,0' Hare GMP[C]// Jennings Foundations of DA,New York:John Wiley & Sons,1996:169-185.
  • 6Abolpour B,Javan M,Karamouz M.Water allocation improvement in river basin using Adaptive Neural Fuzzy Reinforcement Learning Approach[J].Applied Soft Computing (S1568-4946),2005,6:21-31.
  • 7李蔚恒,王庆林,李彦志,杨承志.SUGENO型网络在空战CGF战术决策中的应用[J].系统仿真学报,2007,19(6):1274-1276. 被引量:1
  • 8Gavin A.Problem Solving with Reinforcement Learning[D].PHD thesis.Cambridge University Engineering Department,1995.
  • 9Zhang G,Peter Times series forecasting using a hybrid ARIMA and neural network model[J].Neural Computing (S0899-7667),2003,28(12):159-175.
  • 10徐昕,贺汉根.神经网络增强学习的梯度算法研究[J].计算机学报,2003,26(2):227-233. 被引量:21

二级参考文献60

共引文献128

同被引文献11

引证文献1

二级引证文献3

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部