期刊文献+

稀疏奖励场景下基于个体落差情绪的多智能体协作算法

Multi-agent Cooperation Algorithm Based on Individual Gap Emotion in Sparse Reward Scenarios
下载PDF
导出
摘要 针对在多智能体环境中强化学习面临的稀疏奖励问题,借鉴情绪在人类学习和决策中的作用,文中提出基于个体落差情绪的多智能体协作算法.对近似联合动作值函数进行端到端优化以训练个体策略,将每个智能体的个体动作值函数作为对事件的评估.预测评价与实际情况的差距产生落差情绪,以该落差情绪模型作为内在动机机制,为每个智能体产生一个内在情绪奖励,作为外在奖励的有效补充,以此缓解外在奖励稀疏的问题.同时内在情绪奖励与具体任务无关,因此具有一定的通用性.在不同稀疏程度的多智能体追捕场景中验证文中算法的有效性和鲁棒性. To address the sparse reward problem confronted by reinforcement learning in multi-agent environment,a multi-agent cooperation algorithm based on individual gap emotion is proposed grounded on the role of emotions in human learning and decision making.The approximate joint action value function is optimized end-to-end to train individual policy,and the individual action value function of each agent is taken as an evaluation of the event.A gap emotion is generated via the gap between the predicted evaluation and the actual situation.The gap emotion model is regarded as an intrinsic motivation mechanism to generate an intrinsic emotion reward for each agent as an effective supplement to the extrinsic reward.Thus,the problem of sparse extrinsic rewards is alleviated.Moreover,the intrinsic emotional reward is task-independent and consequently it possesses some generality.The effectiveness and robustness of the proposed algorithm are verified in a multi-agent pursuit scenario with different sparsity levels.
作者 王浩 汪京 方宝富 WANG Hao;WANG Jing;FANG Baofu(School of Computer Science and Information Engineering,Hefei University of Technology,Hefei 230601;Anhui Province Key Laboratory of Affective Computing and Advanced Intelligent Machine,Hefei University of Technology,Hefei 230601)
出处 《模式识别与人工智能》 EI CSCD 北大核心 2022年第5期451-460,共10页 Pattern Recognition and Artificial Intelligence
基金 国家自然科学基金项目(No.61872327) 民航飞行技术与飞行安全重点实验室开放基金项目(No.FZ2020KF07)资助。
关键词 稀疏奖励 多智能体协作 强化学习 个体落差情绪 内在情绪奖励 Sparse Reward Multi-agent Cooperation Reinforcement Learning Individual Gap Emotion Intrinsic Emotional Reward
  • 相关文献

参考文献2

二级参考文献7

共引文献86

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部