多Agent动态影响图的近似计算方法被引量：4

Approximate Computation of Multi-Agent Dynamic Influence Diagrams

下载PDF

导出

摘要由于复杂系统具有高维性和不确定性常难以表示处理,因而知识表示和计算方法是复杂系统研究中的公开难题.当前,多Agent影响图不能建模动态环境和多Agent,马尔可夫决策过程难以表示A-gents之间结构关系的问题,因而提出一种用局部概率因式表示动态环境中多Agent之间关系的新决策模型——多Agent动态影响图(MADIDs).针对MADIDs模型的联合概率分布和联合效用函数在计算上的高维问题,研究该模型的近似计算方法.给出MADIDs概率结构部分的一种分层分解的分布近似方法,并通过对该近似方法的误差和复杂性的分析,给出一个可对近似分布的精度和复杂性进行均衡的函数δ(k);给出一种BP神经网络通过局部效用的学习来近似计算MADIDs的联合效用.在模型实例上的实验结果显示了MADIDs模型近似计算方法的有效性. Due to high dimension and uncertainty of the complex system, the complexity system is often hard to represent and process, and the knowledge representation and computation methods of complex systems are open hard problems in complex system research. At present, MAIDs can not model dynamic environment and it is difficult for multi-agent MDPs to represent structural relations among agents; so a multi-agent dynamic influence diagrams （MADIDs） model is given to representation relations among multiagents in dynamic environment by local factor probability form. The computation of joint probability distribution and joint utility function of MADIDs are a high dimension problem, so the approximate computation methods are researched. A distribution approximation method of hierarchical decomposition of probability structural MADIDs is studied; based on analysis of the complexity and the error of the distribution approximation method, a function δ（k） is introduced to establish equilibrium between precision and complexity of approximate distribution. Then a BP neural network is given to approximately compute utility structural MADIDs by learning local utility. Finally, given model instances, the experiment results show the validity of the approximation computation method of the MADIDs model.

作者姚宏亮王浩汪荣贵李俊照

机构地区合肥工业大学计算机科学与技术系

出处《计算机研究与发展》 EI CSCD 北大核心 2008年第3期487-495,共9页 Journal of Computer Research and Development

基金国家自然科学基金项目(60575023) 安徽省自然科学基金项目(070412054 070412064)

关键词影响图多AGENT动态影响图 KL差分联合树 EBK算法 influence diagram MADIDs KL-divergence junction tree EBK algorithm

分类号 TP181 [自动化与计算机技术—控制理论与控制工程]

引文网络
相关文献

参考文献16

1吴志勇,蔡莲红.基于动态贝叶斯网络的音视频双模态说话人识别[J].计算机研究与发展,2006,43(3):470-475. 被引量：11
2C Boutilier, D Poole. Computing optimal policies for partially observable decision processes using compact representations [ C]. AAAI-96, Protland, USA, 1996.
3C Boutilier. Sequential optimality and coordination in multiagent systems [C]. San Francisco: Morgan Kaufmann, IJCAI- 99, 1999. 478-485.
4A G Barto, Mahadevan. Recent advances in hierarchical reinforcement learning (discrete event dynamic systems)[J]. Theory and Applications, 2003, 13(1/2): 41-77.
5D Koller, B Milch. Multi-agent influence diagrams for representing and .solving games [C]. IJCAI, Seattle, USA, 2001.
6Y Gal, A Pfeffer. A language for modeling agents decision making processes in games [C]. AAMAS-2nd, Nekbiyrbe, Ausevier, 2003.
7A Dielmann, S Renals. Dynamic Bayesian networks for meeting structuring [C]. IEEE Int'l Conf on Acoustics, Speech, and Signal Processing, ICASSP' 04, Res Edinborgh University, 2004.
8M Frick, M Groiie. Deciding first-order properties of locally tree-decomposable graphs [J]. Journal of the ACM, 2001, 48 (6): 1184-1206.
9S Kirshner, P Smyth. Conditional Chow-Liu tree structures for modeling discrete-valued vector time series [R]. School of Information and Computer Science, University of California, Tech Rep: 04-04, 2004.
10F R Bach, M I Jordan. Tin junction trees [C]. Advances in Neural Information Processing Systems, Vancouver, Canada, 2002.

二级参考文献9

1C. C. Chibelushi, F. Deravi, J. S. D. Mason. A review of speech-based bimodal recognition, IEEE Trans. Multimedia,2002, 4(1): 23-37.
2S. Dupont, J. Luettin. Audiovisual .speech modeling for continuous speech reeognition, IEEE Trans. Multimedia, 2000, 2(3): 141-151.
3A. Nefian, Luhong Liang, Xiaobo Pi, et al. A coupled HMM for audio visual speech recognition. In: Int'l Conf. Acoustics, Speech and Signal Processing (ICASSP2002) . Piscataway, N J: IEEE Press, 2002. 2013-2016.
4A. Nefian, Luhong Liang, Tieyan Fu, et al. A Bayesian approach to audlo-visual speaker identification. Inz Proe. 4th Int'l Conf. Audio-and Video-based Biometrie Person Authentication(AVBPA2003). Berlin: Springer, 2003. 761-769.
5G. G, Zweig, Speech recognition with dynamic Bayesian networks: [Ph. D, dissertation]. Berkeley: U, C. Berkeley,1998.
6J. N. Gowdy, A. Subramanya, C. Bartels, et al. DBN based multi-stream models for audio visual speech recognition. In: Int'l Conf. Acoustics, Speech and Signal Processing (ICASSP2004).Piscataway, NJ: IEEE Press, 2004. 993-996.
7T. Chen, Audiovisual speech processing. IEEE Trans. Signal Processing, 2001, 18 ( 1 ) : 9-21.
8K. Murphy. The Bayes net toolbox for Matlab. http://www. ai. mit. edu/-- murphyk/Scftware/BNT/bnt, html, 2004-11 -22.
9王志明,蔡莲红,艾海舟.基于支持向量回归的唇动参数预测[J].计算机研究与发展,2003,40(11):1561-1565. 被引量：7

共引文献10

1苗夺谦,王睿智,冉巍.基于动态贝叶斯网络的连续语音识别框架及其Token传递模型[J].计算机研究与发展,2008,45(11):1882-1891.
2张润梅,王浩,张佑生,姚宏亮,方长胜.基于内部结构MPoMDP模型的策略梯度学习算法[J].计算机工程与应用,2009,45(7):20-23. 被引量：1
3宋培岩,蒋冬梅,王风娜.基于发音特征的音/视频双流语音识别模型[J].计算机应用研究,2009,26(7):2481-2483. 被引量：1
4黄建明,方娇莉,王心平.大学课程贝叶斯网络模型研究[J].贵州大学学报（自然科学版）,2009,26(2):81-84.
5冷翠平,王双成,王辉.动态贝叶斯网络结构学习的依赖分析方法研究[J].计算机工程与应用,2011,47(3):51-53. 被引量：3
6赵欢,王纲金,胡炼,彭秀娟.车载环境下基于样本熵的语音端点检测方法[J].计算机研究与发展,2011,48(3):471-476. 被引量：7
7王双成,裴瑱,毕玉江.经济周期转折点预测的动态贝叶斯网络分类器模型[J].管理工程学报,2011,25(2):173-177. 被引量：3
8冯璐,王路露,张磊,张华东.车载环境下的语音端点检测方法[J].测控技术,2016,35(3):39-41. 被引量：2
9李国法,陈耀昱,吕辰,陶达,曹东璞,成波.智能汽车决策中的驾驶行为语义解析关键技术[J].汽车安全与节能学报,2019,10(4):391-412. 被引量：6
10陈湟康,陈莹.基于具有深度门的多模态长短期记忆网络的说话人识别[J].激光与光电子学进展,2019,56(3):130-136. 被引量：11

同被引文献61

1王红卫,李琛,刘会新.马尔可夫决策过程复杂性的熵测度[J].控制与决策,2004,19(9):983-987. 被引量：10
2王浩.基于影响图的多Agent决策问题研究[J].合肥工业大学学报（自然科学版）,2005,28(9):1112-1116. 被引量：5
3刘金兰,韩文秀,李光泉.影响图中的分离相互作用模型[J].管理工程学报,1996,10(4):229-233. 被引量：1
4邹国辉,敬忠良,胡洪涛.基于优化组合重采样的粒子滤波算法[J].上海交通大学学报,2006,40(7):1135-1139. 被引量：43
5Smith J Q. Influence diagrams for statistical modeling[J]. Annals of Statistics, 1989,17(2) :654-672.
6Shachter R D. Probabilistic inference and influence diagrams[J]. Operations Research, 1988,36 (4) : 724-741.
7Shachter R D. Evaluating influence diagrams[J]. Operations Research, 1986,34(6) :871-882.
8Howard R. Knowledge maps[J]. Management Science, 1989,35?:903-922.
9Howard R, Matheson J. Influence diagrams[J]. Readings on the Principles and Applications of Decision Analysis, 1984,2:721-762.
10安殉.面向顾客满意度改进决策的结构方程和影响图结合研究[D].天津:天津大学,2006.

引证文献4

1罗键,李波,潘颖慧,尹华一,吴长庆.基于多Agent的交互式动态影响图研究、应用与展望[J].厦门大学学报（自然科学版）,2011,50(2):253-260. 被引量：1
2姚宏亮,王秀芳,胡大伟,王浩,茆美琴.多Agent动态影响图的一种混合近似推理算法[J].计算机研究与发展,2011,48(4):584-591. 被引量：2
3李波,罗键,尹华一,田乐.一种交互式动态影响图的改进算法[J].模式识别与人工智能,2011,24(4):506-513.
4李波,罗键,庄进发,尹华一.交互式动态影响图的一种近似求解算法[J].华中科技大学学报（自然科学版）,2011,39(10):64-68. 被引量：3

二级引证文献6

1秦之凡,杨伟龙.基于粒子滤波的隐式对手策略匹配方法[J].装甲兵学报,2022(5):86-92.
2刘辉舟,夏维,付超,姚宗信.对抗场景划分识别与多agent群体对抗策略选择研究[J].计算机应用研究,2011,28(12):4572-4575.
3田乐,罗键,曹浪财.多Agent交互动态影响图的近似行为等价算法[J].华中科技大学学报（自然科学版）,2014,42(4):60-63. 被引量：2
4吴小志,米军,燕锦华.基于接受阈值的CSR舆论传播模型研究[J].合肥师范学院学报,2014,32(3):41-46.
5罗键,武鹤,曹浪财.多智能体对手建模及其真实模型的确定[J].华中科技大学学报（自然科学版）,2015,43(10):48-52. 被引量：1
6鲁桂芳.基于交互式动态影响图的决策模型及算法分析[J].科技经济导刊,2016(3):3-4. 被引量：1

1姚宏亮,王浩,张佑生,汪荣贵.多Agent动态影响图及其一种近似推理算法研究[J].计算机学报,2008,31(2):236-244. 被引量：14
2姚宏亮,王浩,张佑生,俞奎.多Agent动态影响图及其概率分布的近似方法[J].模式识别与人工智能,2007,20(4):525-532. 被引量：2
3姚宏亮,王浩,张佑生,汪荣贵,方宝富.基于多Agent动态影响图的协作实现[J].系统仿真学报,2007,19(14):3270-3275. 被引量：1
4姚宏亮,王秀芳,胡大伟,王浩,茆美琴.多Agent动态影响图的一种混合近似推理算法[J].计算机研究与发展,2011,48(4):584-591. 被引量：2
5王俊欢.MADIDS模型在Snort入侵检测系统中的应用研究[J].微计算机信息,2009,25(15):60-62. 被引量：1
6刘双印,徐龙琴,徐兵,李振坤.基于Mobile Agent分布式入侵检测系统研究[J].电脑开发与应用,2006,19(8):39-40.
7赵凯.基于移动Agent的分布式入侵检测[J].硅谷,2008,1(22):80-80.
8刘风华,丁贺龙,林果园.一个基于移动Agent的分布式入侵检测模型[J].电子技术应用,2005,31(2):1-4. 被引量：2
9季秀兰.基于移动Agent的分布式入侵检测系统设计与实现[J].甘肃联合大学学报（自然科学版）,2011,25(5):52-56.
10徐庚保,曾莲芝.基于仿真的复杂系统研究[J].计算机仿真,2013,30(2):1-4. 被引量：6

计算机研究与发展

2008年第3期

浏览历史

内容加载中请稍等...

多Agent动态影响图的近似计算方法被引量：4

参考文献16

二级参考文献9

共引文献10

同被引文献61

引证文献4

二级引证文献6

相关作者

相关机构

相关主题

浏览历史

多Agent动态影响图的近似计算方法 被引量：4

参考文献16

二级参考文献9

共引文献10

同被引文献61

引证文献4

二级引证文献6

相关作者

相关机构

相关主题

浏览历史

多Agent动态影响图的近似计算方法被引量：4