期刊文献+

Improving deep reinforcement learning by safety guarding model via hazardous experience planning

原文传递
导出
摘要 1Introduction and main contributions Deep reinforcement learning that considers the advantages of both deep learning and reinforcement learning has achieved success in many fields[1],However,during the learning process,a possibility still exists that the agent fails in the task because of falling into hazardous states due to taking improper actions.
出处 《Frontiers of Computer Science》 SCIE EI CSCD 2022年第4期223-225,共3页 中国计算机科学前沿(英文版)
基金 supported by the National Natural Science Foundation of China(Grant No.61303108) Natural Science Foundation of Jiangsu Province(BK20211102) Suzhou Key,Industries Technological Innovation-Prospective_Applied Research Project(SYG201804) A Project Funded by the Priority Academic Program Development of JiangsuHigher Education Institutions.
关键词 GUARD learning HAS
  • 相关文献

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部