RoboCup is a particularly good domain for studying multi-agent systems. A wide variety of MAS issues can be studied in robotic soccer, in which the theory, algorithm and architecture of agent system can be evaluated. ...RoboCup is a particularly good domain for studying multi-agent systems. A wide variety of MAS issues can be studied in robotic soccer, in which the theory, algorithm and architecture of agent system can be evaluated. Because of the inherent complexity of MAS, there are many interests in using machine learning techniques to handle it. This paper investigates and discusses the machine-learning techniques used in RoboCup. The background is firstly presented and the application of machine learning in RoboCup is lately demonstrated with some top simulation teams. The machine-learning system in NDSocTeam is also introduced. Finally some open issues in this field are pointed out.展开更多
模拟机器人足球比赛(Robot World Cup,RoboCup)作为多Agent系统的一个理想的实验平台,已经成为人工智能的研究热点。传统的Q学习已被有效地应用于处理RoboCup中传球策略问题,但是它仅能简单地离散化连续的状态、动作空间。提出将神经网...模拟机器人足球比赛(Robot World Cup,RoboCup)作为多Agent系统的一个理想的实验平台,已经成为人工智能的研究热点。传统的Q学习已被有效地应用于处理RoboCup中传球策略问题,但是它仅能简单地离散化连续的状态、动作空间。提出将神经网络应用于Q学习,系统只需学习部分状态-动作的Q值即可获得近似连续的Q值,就可以有效地提高泛化能力。然后将改进的Q学习应用于优化传球策略,最后在RobCup中实现测试了该算法,实验结果表明改进的Q学习在RoboCup传球策略中的应用,可以有效提高传球的成功率。展开更多
文摘RoboCup is a particularly good domain for studying multi-agent systems. A wide variety of MAS issues can be studied in robotic soccer, in which the theory, algorithm and architecture of agent system can be evaluated. Because of the inherent complexity of MAS, there are many interests in using machine learning techniques to handle it. This paper investigates and discusses the machine-learning techniques used in RoboCup. The background is firstly presented and the application of machine learning in RoboCup is lately demonstrated with some top simulation teams. The machine-learning system in NDSocTeam is also introduced. Finally some open issues in this field are pointed out.
文摘模拟机器人足球比赛(Robot World Cup,RoboCup)作为多Agent系统的一个理想的实验平台,已经成为人工智能的研究热点。传统的Q学习已被有效地应用于处理RoboCup中传球策略问题,但是它仅能简单地离散化连续的状态、动作空间。提出将神经网络应用于Q学习,系统只需学习部分状态-动作的Q值即可获得近似连续的Q值,就可以有效地提高泛化能力。然后将改进的Q学习应用于优化传球策略,最后在RobCup中实现测试了该算法,实验结果表明改进的Q学习在RoboCup传球策略中的应用,可以有效提高传球的成功率。