期刊文献+

基于影响图的多智能体学习算法 被引量:1

Learning algorithm of multi-agents system based on influence diagrams
下载PDF
导出
摘要 提出一种多智能体学习算法.用影响图作为 agent 表示工具,给定 agent 的一个初始模型和它的历史行为,在能力、信念和优先学习的基础上来构建新的模型.学习方法是把其它 agent 的历史行为作为训练集,利用神经网络以及决策知识和专家知识来修改影响图中各结点的连接关系.针对与 agent 历史行为不一致的情况,本文把它看成效用函数发生了随机偏差,通过 Markov chain-Monte Carlo 技术进行模拟,实现效用函数的调整.最后利用多机编队协同空战作为例子说明算法的实用性. This paper proposes a learning algorithm of multi-agents system. The influence diagram is used asa modeling representation tool. Given an initial model of an Agent and its history behavior based on capabilities, preferences and beliefs of the Agent, a new model is constructed. Using observed behavior history of other Agent as training set, the learning method is to modify the connection relation between the nodes in the diagram by using neural network, decision knowledge and expert knowlege approach. The inconsistent behavior is interpreted as random deviations from an underlying utility function in this paper, and the utility function is modified by the Markov chain-Monte Carlo technique. An example of team cooperative air combat shows that this algorithm is effective.
出处 《系统工程学报》 CSCD 北大核心 2008年第3期377-380,共4页 Journal of Systems Engineering
基金 空军国防预研课题资助项目(4020501)
关键词 多智能体 影响图 学习 神经网络 multi-agent systems influence diagram learning neural network
  • 相关文献

参考文献8

  • 1Sahin F, Bay J S. Learning from experience using a decision-theoretic intelligent Agent in multi-Agent systems [ A ]. In: Mountain Workshop on Soft Computing in Industrial Applications[ C]. San Diego: IEEE, 2001. 109-114.
  • 2张润梅,王浩.一种基于影响图学习其他Agent模型方法[J].辽宁工程技术大学学报(自然科学版),2005,24(4):577-579. 被引量:2
  • 3Carmel D, Markovitch S. Learning models of intelligent Agents [ J ]. International Journal of Expert Systems, 1996, 14 (1): 62-67.
  • 4Chajewska U, Koller D, Ormoneit D. Learning an Agent's utility function by observing behavior[ A]. In: Proceedings of the Eighteenth International Conference on Machine Learning[ C ]. Kanagawa, Japan: IEEE, 2001. 35-42.
  • 5Nielsen T D, Jensen F V. Sensitivity analysis in influence diagrams [ J ]. IEEE Trans. on Systems, Man and Cybernetics, Part A, Systems and Humans, 2003, 33 (2) : 223-234.
  • 6Applegate D, Kannan R. Sampling and integration of near log-concave functions[ A]. In: Proceedings of the 23rd Annual ACM Symposium on Theory and Computing[ C]. New Orleans: ACM Press, 1991. 156-163.
  • 7刘金星,佟明安.双机编队协同战术的实现[J].系统工程与电子技术,2003,25(5):540-542. 被引量:11
  • 8董彦非,申洋,张恒喜.空战机动决策中的影响图方法[J].电光与控制,2001,8(1):49-53. 被引量:16

二级参考文献15

  • 1于睿箭,冯允成.影响图的基础理论和发展[J].北京航空航天大学学报,1994,20(4):429-435. 被引量:11
  • 2詹原瑞,何娟.树与影响图[J].系统工程理论与实践,1997,17(4):1-8. 被引量:8
  • 3[4]Goodrich K. H. ,McManus J.W. Development of a tactical guidance research and evaluation system (TIGRES). AIAA Paper 89~3312,Aug. 1989.
  • 4[5]Katz A. Tree lookahead in air combat. Journal of Aircraft, 1994,31 (4): 970~973.
  • 5[6]汪应洛.系统工程(第二版)[M].北京:机械工业出版社,1999.
  • 6[7]Kai Virtanen,Tuomas Raivio,Raimo P. Decision theoretical approach to pilot simulation. Journal of Aircraft, 1999,36(4) : 632~ 641.
  • 7Sen S. Evolution and learning in multi-agent systems[J]. International Journal of Human-Computer Studies, 1998,48(1):1-7.
  • 8Noh Sand Gmytrasiewicz P J. Implementation and evaluation of rational communicative behavior in coordinated defense[A].Oren Etzioni,J(o)rg P.Müller and Jeffrey M.Bradshaw. Proceedings of the third annual conference on Autonomous Agents [C]. New York, USA:ACM Press, 1999.123-130.
  • 9Noh S and Gmytrasiewicz P J. Agent modeling in antiair defense[A]. A.Jameson C.Paris and C. Tasso. Proceedings of the Sixth International Conference on User Modeling[C]. Sardinia, Italy: Springer-Verlag Telos, 1997.389-400.
  • 10Howard R A. and Matheson J E. Readings on the principles and applications of decision analysis(Vol 2) [M]. CA: Strategic Decisions Group, Menlo Park. 1984.719-762.

共引文献26

同被引文献18

引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部