期刊文献+
共找到1篇文章
< 1 >
每页显示 20 50 100
Robustness Assessment of Asynchronous Advantage Actor-Critic Based on Dynamic Skewness and Sparseness Computation: A Parallel Computing View
1
作者 Tong Chen Ji-Qiang Liu +6 位作者 He Li shuo-ru wang Wen-Jia Niu En-Dong Tong Liang Chang Qi Alfred Chen Gang Li 《Journal of Computer Science & Technology》 SCIE EI CSCD 2021年第5期1002-1021,共20页
Reinforcement learning as autonomous learning is greatly driving artificial intelligence(AI)development to practical applications.Having demonstrated the potential to significantly improve synchronously parallel learn... Reinforcement learning as autonomous learning is greatly driving artificial intelligence(AI)development to practical applications.Having demonstrated the potential to significantly improve synchronously parallel learning,the parallel computing based asynchronous advantage actor-critic(A3C)opens a new door for reinforcement learning.Unfortunately,the acceleration's influence on A3C robustness has been largely overlooked.In this paper,we perform the first robustness assessment of A3C based on parallel computing.By perceiving the policy's action,we construct a global matrix of action probability deviation and define two novel measures of skewness and sparseness to form an integral robustness measure.Based on such static assessment,we then develop a dynamic robustness assessing algorithm through situational whole-space state sampling of changing episodes.Extensive experiments with different combinations of agent number and learning rate are implemented on an A3C-based pathfinding application,demonstrating that our proposed robustness assessment can effectively measure the robustness of A3C,which can achieve an accuracy of 83.3%. 展开更多
关键词 robustness assessment SKEWNESS SPARSENESS asynchronous advantage actor-critic reinforcement learning
原文传递
上一页 1 下一页 到第
使用帮助 返回顶部