期刊文献+

面向智能避障场景的深度强化学习研究 被引量:2

Research on Deep Reinforcement Learning for Intelligent Obstacle Avoidance Scenarios
下载PDF
导出
摘要 研究基于深度强化学习技术的避障场景的算法模型设计,采用改进的深度Q网络(Deep Q-learning Network,DQN)算法克服了Q-learning表格式算法在连续状态下导致内存不足的局限性。鉴于学习过程中奖励稀疏导致很难获得较好结果的情况,改进奖励机制,增加实时奖惩作为补充,解决学习耗时长和训练不稳定的问题;采用相对角度、位置和距离等信息,相比绝对坐标信息可以更有效的躲避障碍物。不同于基于栅格法/可视图法等传统人为策略避障算法,深度强化学习算法DQN能够在缺乏先验知识的条件下具备自主决策能力,因此适用性更强。该技术可应用在仓储无人车、巡检机器人、无人机等现实场景。 It researched the design of algorithmic models for obstacle avoidance scenarios using deep reirfforcement learning techniques,and adopted an improved Deep Q-learning Network (DQN) algorithm to overcome the problem of the Q-learning table format algorithm which leads to insufficient memory in continuous state.In view of rewarding sparseness in the learning process makes it difficult to obtain better results, to improve the reward mechanism, increased real-time rewards and punishments as a supplement to solve the problem of long learning time and unstable training;use information such as relative angle, position,and distance to avoid obstacles more eft)etively than absolute coordinate irfformation. Dift)rent from the traditional human strategy obstacle avoidance algorithm, such as grid method/visibility include, deep reirfforeement learn-ing algorithm DQN has the capability of autonomous decision-making under the condition of lack of prior knowledge, so it has stronger adaptability. The technology can be applied in the storage of unmanned vehicles,inspection robots,drones and othor ronlistio soonnrios
作者 刘庆杰 林友勇 李少利 LIU Qing-jie;LIN You-yong;LI Shao-li(CETHIK Research Institute,Hangzhou 310012,China)
出处 《智能物联技术》 2018年第2期18-22,共5页 Technology of Io T& AI
关键词 深度强化学习 DQN 自主决策 避障 deep reirfforcement learning DQN auto-decision obstacle avoidance
  • 相关文献

参考文献6

二级参考文献21

  • 1史忠植.智能主体及其应用[M].北京:科学出版社,2001.7-11.
  • 2刘彬,谭建平,黄长征.一种改进PID控制算法的研究与应用[J].微计算机信息,2007(06S):15-17. 被引量:18
  • 3李伟,Proc of Fuzzy.IEEE’94,1994年
  • 4李伟,Proc of the 1994 IEEE/RSJ Int Conf on Intelligent Robots and Systems,1994年
  • 5范玉顺 曹军威.多Agent系统理论、方法与应用[M].北京:清华大学出版社,2002..
  • 6何炎强 陈莘明著.Agent和多Agent系统的设计与应用[M].武汉:武汉大学出版社,2001..
  • 7KIM Doh-Hyun, OHJun-Ho. Globally asymptotically stable tracking control of mobile robots[C]//Proc of IEEE Int Confon Control Applications. New York: IEEE Press, 1998:1297 - 1301.
  • 8LEE Sungon, YOUM Y, CHUNG Wank-yun. Control of car-like mobile robots for posture stabilization[C] // Proc of IEEE Int Conf on Intelligent Robotics and Systems. New York: IEEE Press, 1999:1745 - 1749.
  • 9PEI Xinzhe, LIU Zhiyuan, PEI Run. Practical stabilization of wheeled mobile robots based on control Lyapunov function[C] //Proc of IEEE Int Conf on Control Applications. New York: IEEE Press, 2002:345 - 349.
  • 10SAMSON C, AIT-ABDERRAHIM K. Feedback control of a nonholonomic wheeled cart in Cartesian space[C] // Proc of IEEE Int Conf on Robotics and Automation. New York: IEEE Press, 1991: 1136- 1141.

共引文献250

同被引文献33

引证文献2

二级引证文献32

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部