期刊文献+

基于改进CB-HAQL算法的无人机导航方法研究 被引量:1

Research on UAV navigation method based on improved CB-HAQL algorithm
下载PDF
导出
摘要 针对基于案例推理启发式Q学习(CB-HAQL)算法受案例库质量影响而无法收敛到较优策略的问题,提出基于有效触发机制改进的CB-HAQL算法。首先,根据迭代次数设置触发式案例库更新机制,只在达到阈值时生成或更新案例库,保证案例库质量;其次,设置动态参数调整案例对动作选取影响,使智能体根据对环境掌握程度决定启发影响大小;最后,加入经验倾向性探索动作加快学习效率。实验证明,改进后的算法提升了策略质量和训练速度,无人机完成导航任务证明了学习策略的有效性。 The quality of case base would affect the convergence effect of CB-HAQL algorithm strategy.Aiming at the fact,this paper developed an improved CB-HAQL algorithm based on effective triggering mechanism.Firstly,the algorithm set the trigger case base update mechanism according to the number of iterations.In order to ensure the quality of the case base,only when the threshold was reached,the algorithm generated or update the case base.Secondly,the dynamic parameter was set to adjust the impact of the case on action selection,so that the agent could determine the size of heuristic influence according to the degree of mastery of the environment.Finally,the algorithm added experience-oriented exploratory action to accelerate the learning efficiency.Experiments show that the algorithm improves the strategy quality and training speed,and the UAV’s navigation task proves the effectiveness of learning strategy.
作者 胡丹丹 莫宇帅 Hu Dandan;Mo Yushuai(Robotics Institute,Civil Aviation University of China,Tianjin 300300,China)
出处 《计算机应用研究》 CSCD 北大核心 2020年第7期2068-2071,共4页 Application Research of Computers
关键词 无人机 避障 自主导航 CB-HAQL 触发机制 UAV obstacle avoidance autonomous navigation case based heuristically accelerated Q-learning(CB-HAQL) trigger mechanism
  • 相关文献

参考文献2

二级参考文献18

  • 1李伟,何雪松,叶庆泰,朱昌明.基于先验知识的强化学习系统[J].上海交通大学学报,2004,38(8):1362-1365. 被引量:5
  • 2彭辉,沈林成,霍霄华.多UAV协同区域覆盖搜索研究[J].系统仿真学报,2007,19(11):2472-2476. 被引量:40
  • 3范洪迭,马向玲,叶文.飞机低空突防航路规划技术[M].北京:国防工业出版社.2007:92-113.
  • 4YANG Yan-li, MINAI A A, POLYCARPOU M M. Evidential map- building approaches for multi-UAV cooperative search [ C ]//Proc of American Control Conference. 2005 : 116-121.
  • 5MAZA I, OLLERO A. Multiple UAV cooperative searching operation using polygon area decomosition and efficient coverage algorithms [ C ]//Proc of the 7th International Symposium on Distributed Autono- mous Robotics Systems. 2004:211-220.
  • 6BAB A, BRAFMAN R I. Multi-agent reinforcement learning in common interest and fixed sum stochastic games:an experimental study[J]. Jour- nal of Machine Learning Research,2008,9:2635-2675.
  • 7GREENWALD A, HALL K, ZINKEVICH M. Correlated Q-learning [ C ]//Proc of International Conference on Machine Learning. 2003 : 242 - 249.
  • 8KOK J R, VLASSISS N. Collaborative multiagent reinforcement learning by payoff propagation[ J]. Journal of Machine Learning Research,2006,7 ( 2 ) : 1789-1828.
  • 9ALPAYDIN E.机器学习导论[M].北京:机械工业出版社,2009:245-251.
  • 10夏欢,周德云,陈龙建.多无人机协同搜索路径规划方法研究[C]//中国航空学会航空武器系统分会2010年学术年会暨第三届“中国航空武器装备试验与发展学术论坛”论文集.西安:西北工业大学出版社,2010:432-436.

共引文献16

同被引文献7

引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部