期刊文献+

一种基于多Agent强化学习的多星协同任务规划算法 被引量:21

An Algorithm of Cooperative Multiple Satellites Mission Planning Based on Multi-agent Reinforcement Learning
下载PDF
导出
摘要 在分析任务特点和卫星约束的基础上给出了多星协同任务规划问题的数学模型。引入约束惩罚算子和多星联合惩罚算子对卫星Agent原始的效用值增益函数进行改进,在此基础上提出了一种多卫星Agent强化学习算法以求解多星协同任务分配策略,设计了基于黑板结构的多星交互方式以降低学习交互过程中的通信代价。通过仿真实验及分析证明该方法能够有效解决多星协同任务规划问题。 A multi-satellite cooperative planning problem model was given considering the characteristics of the task requests and satellite constraints.Then the original performance function of each satellite agent was modified by introducing both the constraint punishing operator and the multi-satellite joint punishing operator.Next,a multi-satellite reinforcement learning algorithm(MUSARLA) was proposed to derive the coordinated task allocation strategy.Furthermore,the interaction among multiple satellites was designed based on blackboard architecture to reduce the communication cost while learning.Finally,simulated experiments are carried out which verified the effectiveness of the proposed algorithm.
出处 《国防科技大学学报》 EI CAS CSCD 北大核心 2011年第1期53-58,共6页 Journal of National University of Defense Technology
基金 国家自然科学基金资助项目(60604035) 国家863高技术资助项目(2007AA12020203)
关键词 卫星任务规划 协同规划 多智能体强化学习 黑板结构 satellite mission planning cooperative planning multi-agent reinforcement learning blackboard architecture
  • 相关文献

参考文献10

  • 1Khatib L, Frank J. Interleaved Observation Execution and Rescheduling on Earth Observing Systems[C]//Proceedings of the 13^th International Conference on Automated Planning and Scheduling, Trento, Italy, 2003.
  • 2Schetter T, Campbell M, Surka D. Multiple Agent-based Autonomy for Satellite Constellatioas[J]. Artificial Intelligence, 2003 (145): 147- 180.
  • 3Cesta A, Ocon J, Rasconi R, et al. Simulating On-board Autonomy in a Multi-agent System with Planning and Sdaeduling[C]//Proceedings of 20^th International Conference on Planning and Scheduling, Toronto, Canada, 2010.
  • 4陈浩,景宁,李军,唐宇.基于外包合同网的自治电磁探测卫星群任务规划[J].宇航学报,2009,30(6):2285-2291. 被引量:11
  • 5Smith R G, Davis R. Frameworks for Cooperation in Distributed Problem Solving [ J ]. IEEE Trans. On Systems, Man, and Cybernetics, 1981, 11 (1): 61-70.
  • 6Modi P J, Shen W, Tambe M, Yokoo M. An Asynchronous Complete Method for Distributed Constraint Optimization[C]//Proceedings of 2^nd Autonomous Agent and Multi-agent System, Melbourne, Australia, 2003.
  • 7Tan M.Multi-agent Reinforcement Learning:Independent vs. Cooperative Agents [ C ]//Proceedings of 10^th International Conference on Machine Learning, Amherst, MA, 1993: 330-337.
  • 8Busoniu L, Schutter B D, Babuska R. learning and Coordination in Dynamic Multiagent Systems[R], Technical Report 05-019, Delft Center for Systems and Control, Delft University of Technology, The Netherlands, 2005.
  • 9Busoniu L, Schutter B D. A Comprehensive Survey of Multiagent Reinforcement Learning[J]. IEEE Trans. Syst. Man, Cyber., 2008, 38(2) : 156- 172.
  • 10Hu J,Wellman M P.Multiagent Reinforcement Learning: Theoretical Framework and an Algorithm [C]//Proceedings of 15^th Interntional Conference on Machine Learning, Madison, WI, 1998:242 -250.

二级参考文献9

  • 1肖正,吴承荣,张世永.多Agent系统合作与协调机制研究综述[J].计算机科学,2007,34(5):139-143. 被引量:16
  • 2张正强,谭跃进,王军民.基于MAS的分布式卫星系统任务规划研究[J].系统仿真学报,2007,19(12):2868-2871. 被引量:12
  • 3Scott C, Spencer D. Optimal reconfiguration of satellites in formation [J]. Journal of Spacecraft and Revokers, 2007, 44(1): 230- 239.
  • 4Verthillie G, Lenkaitre M. Tutorial on planning activities for earth watching and observation satellites and constellations: from off-line ground planning to on-line on-board planning [ C ]. Proceedings of ICAPS-06, Cambria, UK, 2006.
  • 5Khatib L, Frank J, et al. Interleaved observation execution and rescheduling on earth observing systems[ C]//the Proceedings of the 13th International Conference on Automated Planning and Scheduling, Trento, Italy, 2003.
  • 6Damiani S, Yerfaillie G, et al. An earth watching satellite constellation : how to manage a team of watching agents with limited communications[ C]//the Proceedings of the 4th International Joint Conference on Autonomous Agents and Multi - Agent Systems, Utrecht, Netherlands, 2005.
  • 7Das S, W Curt, Truszkowski W. Distributed intelligent planning and scheduling for enhanced spacecraft autonomy [ C ]//the Proceedings of the AAAI 2001 Spring Symposium Series, California, USA, 2001.
  • 8Schetter T, Campbell M, Surka D. Multiple agent-based autonomy for satellite constellations [ J ]. Artificial Intelligence, 2003 ( 145 ) : 147- 180.
  • 9Smith G, Davis R. Frameworks for cooperation in distributed problem solving[ J ]. IEEE Transactions on Systems, Man and Cybernetics, 1981, 11(1): 61-70.

共引文献10

同被引文献352

引证文献21

二级引证文献183

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部