期刊文献+
共找到2篇文章
< 1 >
每页显示 20 50 100
A stable actor-critic algorithm for solving robotic tasks with multiple constraints
1
作者 Peiyao ZHAO Fei ZHU +1 位作者 Quan LIU xinghong ling 《Frontiers of Computer Science》 SCIE EI CSCD 2023年第4期233-235,共3页
1 Introduction.Deep reinforcement learning has achieved great success especially in game[1]and control areas.Unfortunately,for real world environments that involve more than one objectives.For example,an autonomous ca... 1 Introduction.Deep reinforcement learning has achieved great success especially in game[1]and control areas.Unfortunately,for real world environments that involve more than one objectives.For example,an autonomous car should consider constraints such as the driving speed,energy efficiency,comfort and safety of the passengers[2,3].To solve the problems,Constrained Markov Decision Process(CMDP)was proposed to model tasks with constraints[4,5]. 展开更多
关键词 CONSTRAINTS DEEP TASKS
原文传递
Improving deep reinforcement learning by safety guarding model via hazardous experience planning
2
作者 Pai Peng Fei Zhu +2 位作者 xinghong ling Peiyao Zhao Quan Liu 《Frontiers of Computer Science》 SCIE EI CSCD 2022年第4期223-225,共3页
1Introduction and main contributions Deep reinforcement learning that considers the advantages of both deep learning and reinforcement learning has achieved success in many fields[1],However,during the learning proces... 1Introduction and main contributions Deep reinforcement learning that considers the advantages of both deep learning and reinforcement learning has achieved success in many fields[1],However,during the learning process,a possibility still exists that the agent fails in the task because of falling into hazardous states due to taking improper actions. 展开更多
关键词 GUARD learning HAS
原文传递
上一页 1 下一页 到第
使用帮助 返回顶部