多Agent交互式动态影响图的建模方法被引量：2

The Exploration on Modeling Methods for Interactive Multi-agent Dynamic Influence Diagrams

下载PDF

导出

摘要交互式动态影响图是一种以动态影响图为基础,利用有向图构造Agent之间交互作用的决策概率模型,目前只能解决2个Agent的问题.根据概率图模型理论、交互式部分可观测马尔可夫决策过程性质、最大奖励期望值原理等以3个Agent为例建立多Agent交互式动态影响图(I-MADIDs)模型,探讨除建模Agent之外,其他非建模Agent之间存在稳定关系时,如何简化I-MADIDs模型.最后对老虎问题进行建模,利用HUGIN7.0对其进行求解,分别讨论了建模A-gent和其他Agent的决策情况,对比了精确方法和简化模型中贝叶斯参数学习近似方法中Agent的决策情况,证明了近似方法的有效性. Interactive dynamic influence diagrams （I-DIDs） are a kind of probability graph models based on dynamic influence dia grams,using directed graph to construct decision-making models about interaction between agents. I-DIDs can only solve 2 agents＇ problems. Take 3 Agents for example,the paper tries to model interactive multi-agent dynamic influence diagrams （I-MADIDs） by means of probabilistic graph model theory,interactive partially observable Markov decision process nature and the principle of maxi- mum reward expectations,and explores how to simply I-MADIDs when there is the stable relationship between non modeling agents. Finally,we model the tiger problem, solve models using HUGIN7. 0, and discuss separately various decision-making cases for the modeling agent and other agents. Examples prove the validity of the approximate method based on Bayesian parameter learning through comparing the exact and approximate methods.

作者潘颖慧罗键曾一锋

机构地区厦门大学信息科学与技术学院

出处《厦门大学学报（自然科学版）》 CAS CSCD 北大核心 2012年第6期985-990,共6页 Journal of Xiamen University：Natural Science

基金国家自然科学基金项目(60975052) 江西省教育厅科技重点项目(GJJ10695)

关键词交互式动态影响图多AGENT建模概率图模型 interaetive dynamic influence diagrams multi-agent modeling probabilistic graph model

分类号 TP391.41 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献9

1Polich K, Gmytrasiewicz P. Interactive dynamic influence diagrams[C]//Proceedings of the 6th International Joint Conference on Autonomous Agents and Multiagent Sys- tems. New York, USA : ACM, 2006 : 147-149.
2Prashant D, Zeng Y F, Chen Q Y. Graphical models for interactive POMDPs: representations and solutions [J]. Autonomous Agents and Multi-agent Systems, 2009, 18 (3) :376-416.
3Zeng Y,Chen Y, Doshi P. Approximating behavioral e- quivalence of models using top-K policy paths[C]//Inter- national Joint Conference on Autonomous Agents and Multi-Agent Systems. Richland, USA: ACM, 2011 : 1229-1230.
4Zeng Y,Doshi P,Pan Y,et al. Utilizing partial policies for identifying equivalence of behavioral models [C]//Pro- ceedings of the Conference on Association for the Ad- vancement of Artificial Intelligence.[s. 1. ] : AAAI, 2011 : 1083-1088.
5Zeng Y,Doshi P,Chen Q. Approximate solutions of inter- active dynamic influence diagrams using model clustering [C]//Proeeedings of the 22th international Conference on Association for the Advancement of Artificial Intelli- gence. Vancouver, Canada : AAAI, 2007 : 782-787.
6Doshi P, Chandrasekaran M,Zeng Y. Epsilon-subjeetive e- quivalence of models for interactive dynamic influence dia-grams [C]//IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology. Toronto, Canada .. IEEE, 2010 : 165-172.
7Doshi P, Zeng Y. Improved approximation of interactive dynamic influence diagrams using discriminative model updates[C]//Proceedings of 8th International Conference on Autonomous Agents and Multiagent Systems. Rich-land, USA : 2009 : 907-914.
8Cooper G F, Herskovits E. A bayesian method for the in- duction of probabilistic networks from data[J]. Machine Learning,1992,9(4) :309 347.
9Gmytrasiewicz P J,Prashant D. A framework for sequen- tial planning in multi-agent settings[J]. J Artif Int Res, 2005,24:49-79.

同被引文献18

1钟伟才,刘静,刘芳焦,李成.组合优化多智能体进化算法[J].计算机学报,2004,27(10):1341-1353. 被引量：34
2范波,潘泉,张洪才.基于Markov对策的多智能体协调方法及其在Robot Soccer中的应用[J].机器人,2005,27(1):46-51. 被引量：5
3刘海涛,洪炳镕,乔立民,朴松昊.多智能体机器人系统分散式通信决策研究[J].机器人,2007,29(6):540-545. 被引量：5
4刘春阳,谭应清,柳长安,马莹巍.多智能体强化学习在足球机器人中的研究与应用[J].电子学报,2010,38(8):1958-1962. 被引量：19
5张迎晓,杨涛,胡波,陈光梦.基于Dec-POMDP的认知无线电网络频谱接入算法[J].信息与电子工程,2010,8(6):720-725. 被引量：3
6李波,曹浪财,庄进发.交互式动态影响图及其精确求解算法[J].解放军理工大学学报（自然科学版）,2011,12(2):119-124. 被引量：1
7姜鑫,刘新建,陈超.基于多主体影响图及博弈论的军事决策建模[J].系统工程与电子技术,2011,33(7):1565-1569. 被引量：3
8李晓,杨洪勇.复杂网络特性与多智能体的一致性[J].复杂系统与复杂性科学,2011,8(3):38-43. 被引量：4
9朱曼玲,金芝.一种服务Agent的可信性评估方法[J].软件学报,2011,22(11):2593-2609. 被引量：8
10马广富,梅杰.有向网络下非线性多智能体系统的协调跟踪[J].控制与决策,2011,26(12):1861-1864. 被引量：8

引证文献2

1潘颖慧,曾一锋.交互式动态影响图研究及其最优K模型解法[J].计算机学报,2018,41(1):28-46. 被引量：3
2安敬民,李冠宇,张冬青,蒋伟.面向序贯决策中异常情景下交互问题处理方法[J].计算机集成制造系统,2020,26(12):3274-3282.

二级引证文献3

1安敬民,李冠宇,张冬青,蒋伟.面向序贯决策中异常情景下交互问题处理方法[J].计算机集成制造系统,2020,26(12):3274-3282.
2宋伟中,王行业,王宁.一种面向无人机区域协同覆盖的感知任务分配方法[J].计算机应用与软件,2021,38(5):75-81. 被引量：3
3李壮阔,常凯旋.合作博弈的连续蚁群算法求解[J].计算机工程与应用,2021,57(24):198-204. 被引量：2

1王俊欢.MADIDS模型在Snort入侵检测系统中的应用研究[J].微计算机信息,2009,25(15):60-62. 被引量：1
2田乐,罗键,曹浪财.多Agent交互动态影响图的近似行为等价算法[J].华中科技大学学报（自然科学版）,2014,42(4):60-63. 被引量：2
3董兴陆,惠晓滨,杨仕美,杜继永,曹中红.基于Multi-Agent的远程测试故障诊断系统的建模[J].火力与指挥控制,2012,37(10):50-53. 被引量：2
4罗键,武鹤.基于交互式动态影响图的对手建模[J].控制与决策,2016,31(4):635-639. 被引量：4
5谭晓辉.计算机数据库入侵检测技术分析[J].当代教育实践与教学研究（电子版）,2016,0(2X):79-79. 被引量：4
6何琨,赵勇.网格资源管理与调度研究综述[J].武汉理工大学学报（信息与管理工程版）,2005,27(4):1-5. 被引量：11
7刘双印,徐龙琴,徐兵,李振坤.基于Mobile Agent分布式入侵检测系统研究[J].电脑开发与应用,2006,19(8):39-40.
8赵凯.基于移动Agent的分布式入侵检测[J].硅谷,2008,1(22):80-80.
9刘风华,丁贺龙,林果园.一个基于移动Agent的分布式入侵检测模型[J].电子技术应用,2005,31(2):1-4. 被引量：2
10季秀兰.基于移动Agent的分布式入侵检测系统设计与实现[J].甘肃联合大学学报（自然科学版）,2011,25(5):52-56.

厦门大学学报（自然科学版）

2012年第6期

浏览历史

内容加载中请稍等...

多Agent交互式动态影响图的建模方法被引量：2

参考文献9

同被引文献18

引证文献2

二级引证文献3

相关作者

相关机构

相关主题

浏览历史

多Agent交互式动态影响图的建模方法 被引量：2

参考文献9

同被引文献18

引证文献2

二级引证文献3

相关作者

相关机构

相关主题

浏览历史

多Agent交互式动态影响图的建模方法被引量：2