期刊文献+

改进遗传算法进化的演员网络种群强化学习算法

Evolving Actor Network Population Algorithm by Improved Genetic in Reinforcement Learning
下载PDF
导出
摘要 深度强化学习算法已成功应用于一系列具有挑战性的任务,然而这些方法通常会遇到奖励稀疏的时间信用分配、缺乏有效的探索以及探索经验不足等问题。演化算法是一类受自然进化启发的黑盒优化技术,算法提出了改进的混沌遗传算法以及量子遗传算法分别与强化学习算法结合,首先创建用于进化计算演员网络的总体,并使用梯度下降来更新网络参数,进化种群中的网络,直至算法收敛。算法的适应度度量整合强化学习中事件的回报,一定程度上解决了稀疏奖励条件下的时间信用分配问题;利用种群的方法来生成各种经验训练RL智能体,提高了鲁棒性。在离散和连续的强化学习环境中做了对比实验和消融实验,实验证明本文的算法能收敛到更高的奖励值,且能提高收敛速度。Deep reinforcement learning algorithms have been successfully applied to a range of challenging tasks;however, these methods often encounter problems such as sparse reward time credit allocation, lack of effective exploration, and insufficient exploration experience. Evolutionary algorithm is a type of black box optimization technique inspired by natural evolution. Improved chaotic genetic algorithm and quantum genetic algorithm are proposed to be combined with reinforcement learning algorithm. The algorithm first creates a population for evolutionary computation of actor networks and uses gradient descent to update network parameters, evolving the network in the population until the algorithm converges. The fitness measurement of the algorithm integrates the reward of events in reinforcement learning, which to some extent solves the problem of time credit allocation under sparse reward conditions;The use of population methods to generate various experience trained RL agents has improved robustness. Comparative experiments and ablation experiments were conducted in both discrete and continuous reinforcement learning environments, demonstrating that our algorithm can converge to higher reward values and improve convergence speed.
出处 《计算机科学与应用》 2024年第10期102-109,共8页 Computer Science and Application
  • 相关文献

参考文献6

二级参考文献68

  • 1崔乃刚,王平,郭继峰,程兴.空间在轨服务技术发展综述[J].宇航学报,2007,28(4):805-811. 被引量:166
  • 2周传华,钱锋.改进量子遗传算法及其应用[J].计算机应用,2008,28(2):286-288. 被引量:33
  • 3Narayanan A,MOORE M.Quantum-inspired genetic algorithm[C]//Proc of IEEE Internation on Conference on Congress onEvolutionaryComputation.1996:61-66.
  • 4Han K H,Kim J H.Genetic quantum algorithm and its applicationto combinatorial optimization problem[C]// Proc of IEEECongress on Evolutionary Computation,2000: 1354-1360.
  • 5Gao Lin,Gu Xingsheng.A Novel Real-coded Quantum-inspiredGenetic Algorithm and Its Application in Data Reconciliation[J].International Journal of computational intelligence systems,2012,5(3):413-420.
  • 6Sun Y,Xiong H G. Real Coded Quantum Genetic Algorithm and itsApplication[J].Journul of Engineering Science and TechnologyReview,2013,6(5):25-32.
  • 7Liu J,Wang H,Sun Y.Real-Coded Quantum-Inspired GeneticAlgorithm-Based BP Neural Network Algorithm[J]. MathematicalProblems in Engineering,2015,(1): 1-10.
  • 8Lei G,Yin X,Shi W.Research on Network Congestion ControlBased on Quantum Genetic Algorithm[J].Applied Mechanics &Materials,2014,513(2):845-849.
  • 9Lv H.A novel Quantum Genetic Algorithm in TSP[J].AppliedMechanics & Materials, 2014,519(8):759-763.
  • 10Mousa A A,Elattar E E.Best Compromise Alternative to EELDProblem using Hybrid Multiobjective Quantum Genetic Algorithm[J].Applied mathematics & information sciences,2014,8(6):2889-2902.

共引文献7

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部