Improving deep reinforcement learning by safety guarding model via hazardous experience planning

导出

摘要 1Introduction and main contributions Deep reinforcement learning that considers the advantages of both deep learning and reinforcement learning has achieved success in many fields[1],However,during the learning process,a possibility still exists that the agent fails in the task because of falling into hazardous states due to taking improper actions.

作者 Pai Peng Fei Zhu Xinghong Ling Peiyao Zhao Quan Liu

机构地区 School of Computer Science and Technology

出处《Frontiers of Computer Science》 SCIE EI CSCD 2022年第4期223-225,共3页 中国计算机科学前沿（英文版）

基金 supported by the National Natural Science Foundation of China(Grant No.61303108) Natural Science Foundation of Jiangsu Province(BK20211102) Suzhou Key,Industries Technological Innovation-Prospective_Applied Research Project(SYG201804) A Project Funded by the Priority Academic Program Development of JiangsuHigher Education Institutions.

关键词 GUARD learning HAS

分类号 TP39 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

1Liam T.Gaynor,Ramesh A.Shivdasani.The hens guarding epithelial cancer fox-houses[J].Cell Research,2022,32(3):225-226.
2张小文,肖菊香,魏能强,郭丽,朱润娟.Data Guard在数据库主备切换中的应用研究[J].中国高新科技,2022(14):92-94. 被引量：1
3Stijn van der Veen,Lanjuan Li.Introducing Our New Journal: Infectious Microbes & Diseases[J].Infectious Microbes & Diseases,2019,1(1):1-2. 被引量：1
4Zhichan Hu,Domenico Bongiovanni,Dario Jukić,Ema Jajtić,Shiqi Xia,Daohong Song,Jingjun Xu,Roberto Morandotti,Hrvoje Buljan,Zhigang Chen.Nonlinear control of photonic higher-order topological bound states in the continuum[J].Light(Science & Applications),2021,10(9):1726-1735. 被引量：3
5David Owino Manoa,Francis Mwaura.Predator-Proof Bomas as a Tool in Mitigating Human-Predator Conflict in Loitokitok Sub-County Amboseli Region of Kenya[J].Natural Resources,2016,7(1):28-39.
6Andras Szasz.Therapeutic Basis of Electromagnetic Resonances and Signal-Modulation[J].Open Journal of Biophysics,2021,11(3):314-350. 被引量：1
7Hugo C. T. Siqueira,Clarissa M. M. Stoffel De Siqueira,Marlon Miguel Bianchi De Lima,Leonardo T. C. Lins.Ultrasound-Guided Peribulbar Block with Blunt Canula for Cataract Surgery: A Review of Historical Case-Series[J].Open Journal of Ophthalmology,2022,12(3):322-334.
8潘梦琳,刘芬(指导).《长津湖》观后感[J].疯狂英语（初中天地）,2022(8):80-80.
9Weizi Huang,Zhezheng Fang,Xianzi Zheng,Jianping Qi,Wei Wu,Yi Lu.Green and controllable fabrication of nanocrystals from ionic liquids[J].Chinese Chemical Letters,2022,33(8):4079-4083.
10Na Wu,Qi-Yue Zhang,Yu-Jie Guo,Lu Zhou,Ling-Jun Zhang,Ming-Xing Wu,Wen-Peng Wang,Ya-Xia Yin,Peng Sheng,Sen Xin.Boron-doped three-dimensional MXene host for durable lithium-metal anode[J].Rare Metals,2022,41(7):2217-2222. 被引量：2

Frontiers of Computer Science

2022年第4期

浏览历史

内容加载中请稍等...

Improving deep reinforcement learning by safety guarding model via hazardous experience planning

相关作者

相关机构

相关主题

浏览历史