期刊文献+

面向智能博弈的决策Transformer方法综述 被引量:1

On Decision-making Transformer Methods for Intelligent Gaming
下载PDF
导出
摘要 智能博弈是认知决策智能领域的挑战性问题,是辅助联合作战筹划与智能任务规划的关键支撑.从协作式团队博弈、竞争式零和博弈和混合式一般和博弈共3个角度梳理了智能博弈模型,从认知角度出发定义了运筹型博弈(完全/有限理性)、不确定型博弈(经验/知识)、涌现探索型博弈(直觉+灵感)、群体交互型博弈(协同演化)共4类智能博弈认知模型,从问题可信任解、策略训练平台、问题求解范式共3个视角给出智能博弈求解方案.基于Transformer架构重点梳理了架构增强(表示学习、网络组合、模型扩展)与序列建模(离线预训练、在线适变、模型扩展)共2大类6小类决策Transformer方法,相关研究为开展“离线预训练+在线适变”范式下满足多主体、多任务、多模态及虚实迁移等应用场景的决策预训练模型构建提供了初始参考.为智能博弈领域的决策基石模型相关研究提供可行借鉴. Intelligent gaming is a challenging problem in the field of cognitive decision-making intelligence,and it is the key support for assisting joint combat planning and intelligent mission planning.The intelligent gaming model is sorted out from three perspectives:collaborative team game,competitive zero-sum game and mixed general-sum game,four kinds of cognitive models of intelligent gaming are defined from the perspective of cognition:operational game(complete or bounded rationality),uncertain game(experience/knowledge),emerging exploratory game(intuition and inspiration),and population interactive game(co-evolution).Solutions of intelligent gaming are given from three perspectives:trustworthy solution of problems,benchmark learning method,and strategy training platform.Secondly,based on Transformer framework,the decision-making Transformer methods are analyzed from architecture enhancement(presentation learning,network combination,model extension)and sequence modeling(offline pre-training,online adaptation,model extension).Relevant research provides an initial reference for the construction of decision-making pre-trained model in multi-agent,multi-task,multi-mode and sim-to-real transfer application scenarios under the paradigm of"offline pre-training+online adaptation".It is expected to provide feasible reference for the research on the decision-making foundation model in the field of intelligent gaming.
作者 罗俊仁 张万鹏 苏炯铭 王尧 陈璟 LUO Junren;ZHANG Wanpeng;SU Jiongming;WANG Yao;CHEN Jing(College of Intelligence Science and Technology,National University of Defense Technology,Changsha Hunan 410073,China)
出处 《指挥与控制学报》 CSCD 2023年第1期9-22,共14页 Journal of Command and Control
基金 国家自然科学基金(61806212) 湖南省研究生创新项目(CX20210011)资助。
关键词 智能博弈 智能规划与决策 认知建模 离线预训练 在线适变 决策基石模型 intelligent gaming intelligent planning and decision-making cognitive modeling offline pre-training online adaptation decision-making foundation model
分类号 E91 [军事]
  • 相关文献

参考文献1

二级参考文献5

共引文献26

同被引文献38

引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部