基于启发式强化学习的多智能体覆盖问题研究

Multi-Agent Coverage Based on Heuristically Accelerated Reinforcement Learning

下载PDF

导出

摘要针对多智能体覆盖问题存在的计算量大、收敛速度慢等问题,提出一种基于启发式强化学习的多智能体覆盖算法。利用智能体收集到的环境信息作为先验知识,对强化学习中智能体的行动选择进行引导。仿真实验表明,该算法在不影响覆盖效果的情况下有效提高覆盖问题的学习收敛速度。 Since multi-agent coverage problem requires large amount of computation and can be very time consuming, proposes a multi-agent coverage algorithm based on heuristically accelerated reinforcement learning. Agents extract the environmental structure information as priori knowledge to guide the choices of actions in reinforcement learning. The simulation results show that the algorithm can effectively speed up the learning convergence of the coverage problem without affect the coverage result.

作者贺荟霖 HE Hui-lin(School of Electrical Engineering, Southwest Jiaotong University, Chengdu 611756)

机构地区西南交通大学电气工程学院

出处《现代计算机（中旬刊）》 2018年第5期8-11,共4页 Modern Computer

关键词多智能体启发式强化学习覆盖问题 Multi-Agent Heuristic Reinforcement Leanfing Coverage

分类号 TP18 [自动化与计算机技术—控制理论与控制工程]

引文网络
相关文献

参考文献1

1蔡自兴,崔益安.多机器人覆盖技术研究进展[J].控制与决策,2008,23(5):481-486. 被引量：19

二级参考文献47

1Yoav Gabriely, Elon Rimon. Spiral-STC: An on-line coverage algorithm of grid environments by a mobile robot[C]. Proc of the IEEE Int Conf on Robotics and Automation. Washington, 2002: 954-960.
2Zelinsky A, Jarvis R A, Byrne J C, et al. Planning paths of complete coverage of an unstructured environment by a mobile robots [C]. Int Conf on Advanced Robotics. Tokyo, 1993: 533-538.
3Ercan Umut Acar. Complete sensorbased coverage of unknown spaces: Incremental construction of cellular decompositions [D]. Pennsylvania: Carnegie Mellon University, 2002.
4Iwan R Ulrich, Francesco Mondada, Nicoud J D. Autonomous vacuum cleaner [ J ]. Robotics and Autonomous Systems, 1997,19 (3/4): 233-245.
5Gerkey B P, MataricM J. A formal analysis and taxonomy of task allocation in multi-robot systems[J]. Int J of Robotics Research, 2004, 23(9): 939-954.
6Min T W, Yin H K. A decentralized approach for cooperative sweeping by multiple mobile robots[C]. Proc of IEEE Int Conf on Intelligent Robots and Systems. Victoria, 1998: 380-385.
7Wagner I A, Lindenbaum M, Bruckstein A M. MAC vs PC: Determinism and randomness as complementary approaches to robotic exploration of continuous unknown domains [J]. Int J of Robotics Research, 2000, 19(1): 12-31.
8Payton D, Daily M, Estkowski R, et al. Pheromone robotics[J]. Autonomous Robots, 2001, 11(3): 319-324.
9Duckett T, Nehmzow U. Mobile robot self-localization and measurement of performance in middle-scale environments[J]. Robotics and Autonomous Systems, 1998, 24(1): 57-69.
10Nehmzow U. Quantitative analysis of robot-environment interaction on the difference between simulations and the real thing[C]. Proc of Eurobot. Lund, 2001: 171-178.

共引文献18

1常宝娴,丁洁,朱俊武,章永龙.未知环境下机器人Q学习覆盖算法[J].南京理工大学学报,2013,37(6):792-798. 被引量：2
2高同跃,夏晓玲,饶进军,龚振邦,罗均.多无人机协作监测污染气团的研究现状[J].环境监测管理与技术,2010,22(1):12-15. 被引量：2
3张国有,曾建潮.基于黄蜂群算法的群机器人全区域覆盖算法[J].模式识别与人工智能,2011,24(3):431-437. 被引量：10
4徐益群,左敏,高彦平,涂序彦.机器人和软件人协同智能仿真方法与技术研究[J].计算机仿真,2011,28(7):37-39. 被引量：1
5左敏,曾广平,涂序彦,魏伟.机器人和软件人协同智能仿真平台研究[J].计算机仿真,2011,28(7):40-42. 被引量：4
6赵慧南,刘淑华,吴富章,程宇.基于二分搜索的牛耕式全覆盖规划算法研究[J].计算机工程与应用,2011,47(23):51-53. 被引量：10
7周东健,张兴国,李成浩.多机器人系统协同作业技术发展近况与前景[J].机电技术,2013,36(6):146-150. 被引量：9
8李晔,姜言清,张国成,黄蜀玲,李一鸣,陈鹏云.一种基于电子海图的欠驱动AUV区域搜索方案[J].机器人,2014,36(5):609-618. 被引量：7
9蔡标,戴学丰,姜来浩.基于蚁群算法的多机器人环境探索[J].齐齐哈尔大学学报（自然科学版）,2014,30(6):10-13. 被引量：1
10徐博,徐旻,陈立平,谭彧.智能机械全覆盖路径规划算法综述[J].计算机测量与控制,2016,24(10):1-5. 被引量：16

1左家亮,杨任农,张滢,李中林,邬蒙.基于启发式强化学习的空战机动智能决策[J].航空学报,2017,38(10):212-225. 被引量：51
2寇峰,王树明,刘艳,李小琳,阎伟林.基于实数编码的遗传神经网络水淹层综合判别[J].吉林大学学报（信息科学版）,2001,19(3):58-62. 被引量：1
3薛俊伟,罗红,刘雨濛.随行电缆皮飞站覆盖电梯的LTE网络系统设计[J].信息技术与信息化,2018(5):124-125.
4李建威,叶伟明.关于如何改善调频覆盖效果的探讨[J].电声技术,2018,42(1):74-75.
5王锋,石子君,施健.基于峭度特征提取的模拟电路故障诊断[J].计量与测试技术,2018,45(1):95-98. 被引量：1
6刘成颖,吴昊,王立平,张智.基于PSO优化LS-SVM的刀具磨损状态识别[J].清华大学学报（自然科学版）,2017,57(9):975-979. 被引量：22
7姚培,黄培煌,郭龙坤.最小化移动传感器最大迁移距离的快速栅栏覆盖算法[J].小型微型计算机系统,2018,39(6):1260-1265. 被引量：1
8王震,钱炜,刘旭燕.地下粮食运输系统的构建及优化[J].农业装备与车辆工程,2018,56(7):95-97.
9张新春,曹应平,韩春雨,白云灿.基于图像处理的输电线路导线表面损伤特征研究[J].图学学报,2018,39(3):440-447. 被引量：9
10梁松柏,陈锋,宋海平,李文生.基于高斯分布的基站天线最佳方向角定位算法[J].电信科学,2018,34(7):128-134. 被引量：6

现代计算机（中旬刊）

2018年第5期

浏览历史

内容加载中请稍等...

基于启发式强化学习的多智能体覆盖问题研究

参考文献1

二级参考文献47

共引文献18

相关作者

相关机构

相关主题

浏览历史