期刊文献+

智能博弈对抗中的对手建模方法及其应用综述 被引量:6

Survey of Opponent Modeling Methods and Applications in Intelligent Game Confrontation
下载PDF
导出
摘要 智能博弈对抗一直是人工智能研究的热点。在博弈对抗环境中,通过对对手进行建模,可以推测敌对智能体动作、目标、策略等相关属性,为博弈策略制定提供关键信息。对手建模方法在竞技类游戏和作战仿真推演等领域的应用前景广阔,博弈策略的制定必须以博弈各方的行动策略为前提,因此建立一个准确的对手行为模型对于预测其意图尤其重要。从内涵、方法、应用三个方面,阐述了对手建模的必要性,对现有建模方式进行了分类;对基于强化学习的预测方法、基于心智理论的推理方法和基于贝叶斯的优化方法进行了梳理与总结;以序贯博弈(德州扑克)、即时策略博弈(星际争霸)和元博弈为典型应用场景,分析了智能博弈对抗过程中的对手建模的作用;从有限理性、策略欺骗性和可解释性三个方面进行了对手建模技术发展的展望。 Intelligent game confrontation has always been the focus of artificial intelligence research.In the game confrontation environment,the actions,goals,strategies,and other related attributes of agent can be inferred by opponent modeling,which provides key information for game strategy formulation.The application of opponent modeling method in competitive games and combat simulation is promising,and the formulation of game strategy must be premised on the action strategy of all parties in the game,so it is especially important to establish an accurate model of opponent behavior to predict its intention.From three dimensions of connotation,method,and application,the necessity of opponent modeling is expounded and the existing modeling methods are classified.The prediction method based on reinforcement learning,reasoning method based on theory of mind,and optimization method based on Bayesian are summarized.Taking the sequential game(Texas Hold’em),real-time strategy game(StarCraft),and meta-game as typical application scenarios,the role of opponent modeling in intelligent game confrontation is analyzed.Finally,the development of adversary modeling technology prospects from three aspects of bounded rationality,deception strategy and interpretability.
作者 魏婷婷 袁唯淋 罗俊仁 张万鹏 WEI Tingting;YUAN Weilin;LUO Junren;ZHANG Wanpeng(College of Intelligence Science and Technology,National University of Defense Technology,Changsha 410073,China)
出处 《计算机工程与应用》 CSCD 北大核心 2022年第9期19-29,共11页 Computer Engineering and Applications
基金 国家自然科学基金(61702528,61806212) 湖南省研究生科研创新项目(CX20210011)。
关键词 对手建模 不完美信息 行为预测 深度强化学习 递归推理 元博弈 opponent modeling imperfect information behavior prediction deep reinforcement learning recursive reasoning meta-game
  • 相关文献

参考文献3

二级参考文献4

共引文献23

同被引文献130

引证文献6

二级引证文献4

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部