期刊文献+

Policy Gradient Adaptive Dynamic Programming for Model-Free Multi-Objective Optimal Control

下载PDF
导出
摘要 Dear Editor,In this letter,the multi-objective optimal control problem of nonlinear discrete-time systems is investigated.A data-driven policy gradient algorithm is proposed in which the action-state value function is used to evaluate the policy.In the policy improvement process,the policy gradient based method is employed.
出处 《IEEE/CAA Journal of Automatica Sinica》 SCIE EI CSCD 2024年第4期1060-1062,共3页 自动化学报(英文版)
基金 the National Natural Science Foundation of China(61922063,62273255,62150026) in part by the Shanghai International Science and Technology Cooperation Project(21550760900,22510712000) the Shanghai Municipal Science and Technology Major Project(2021SHZDZX0100) the Fundamental Research Funds for the Central Universities。
关键词 POLICY GRADIENT OPTIMAL
  • 相关文献

参考文献1

二级参考文献3

共引文献18

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部