期刊文献+
共找到2篇文章
< 1 >
每页显示 20 50 100
关于GAK-空间与(q)-性质
1
作者 罗成 王刚 于亚璇 《内蒙古大学学报(自然科学版)》 CAS CSCD 北大核心 2002年第6期603-605,共3页
文献〔2〕引入Banach空间的(q)-性质与GAK空间的概念,主要证明了Banach空间X具有(q)-性质的充要条件为lp[X]是GAK-空间.
关键词 GAK-空间 (q)-性质 BANACH空间 充要条件 弱序列完备性
下载PDF
A novel policy iteration based deterministic Q-learning for discrete-time nonlinear systems 被引量:8
2
作者 WEI QingLai LIU DeRong 《Science China Chemistry》 SCIE EI CAS CSCD 2015年第12期143-157,共15页
In this paper, a novel iterative Q-learning algorithm, called "policy iteration based deterministic Qlearning algorithm", is developed to solve the optimal control problems for discrete-time deterministic no... In this paper, a novel iterative Q-learning algorithm, called "policy iteration based deterministic Qlearning algorithm", is developed to solve the optimal control problems for discrete-time deterministic nonlinear systems. The idea is to use an iterative adaptive dynamic programming(ADP) technique to construct the iterative control law which optimizes the iterative Q function. When the optimal Q function is obtained, the optimal control law can be achieved by directly minimizing the optimal Q function, where the mathematical model of the system is not necessary. Convergence property is analyzed to show that the iterative Q function is monotonically non-increasing and converges to the solution of the optimality equation. It is also proven that any of the iterative control laws is a stable control law. Neural networks are employed to implement the policy iteration based deterministic Q-learning algorithm, by approximating the iterative Q function and the iterative control law, respectively. Finally, two simulation examples are presented to illustrate the performance of the developed algorithm. 展开更多
关键词 adaptive critic designs adaptive dynamic programming approximate dynamic programming q-LEARNING policy iteration neural networks nonlinear systems optimal control
原文传递
上一页 1 下一页 到第
使用帮助 返回顶部