摘要
1Introduction and main contributions Deep reinforcement learning that considers the advantages of both deep learning and reinforcement learning has achieved success in many fields[1],However,during the learning process,a possibility still exists that the agent fails in the task because of falling into hazardous states due to taking improper actions.
基金
supported by the National Natural Science Foundation of China(Grant No.61303108)
Natural Science Foundation of Jiangsu Province(BK20211102)
Suzhou Key,Industries Technological Innovation-Prospective_Applied Research Project(SYG201804)
A Project Funded by the Priority Academic Program Development of JiangsuHigher Education Institutions.