Policy Gradient Adaptive Dynamic Programming for Model-Free Multi-Objective Optimal Control

下载PDF

导出

摘要 Dear Editor,In this letter,the multi-objective optimal control problem of nonlinear discrete-time systems is investigated.A data-driven policy gradient algorithm is proposed in which the action-state value function is used to evaluate the policy.In the policy improvement process,the policy gradient based method is employed.

作者 Hao Zhang Yan Li Zhuping Wang Yi Ding Huaicheng Yan

机构地区 the Department of Control Science and Engineering the Key Laboratory of Advanced Control and Optimization for Chemical Processes of Ministry of Education

出处《IEEE/CAA Journal of Automatica Sinica》 SCIE EI CSCD 2024年第4期1060-1062,共3页 自动化学报（英文版）

基金 the National Natural Science Foundation of China(61922063,62273255,62150026) in part by the Shanghai International Science and Technology Cooperation Project(21550760900,22510712000) the Shanghai Municipal Science and Technology Major Project(2021SHZDZX0100) the Fundamental Research Funds for the Central Universities。

关键词 POLICY GRADIENT OPTIMAL

分类号 O232 [理学—运筹学与控制论]

引文网络
相关文献

参考文献1

1Jie Chen,Jian Sun,Gang Wang.From Unmanned Systems to Autonomous Intelligent Systems[J].Engineering,2022,8(5):16-19. 被引量：19

二级参考文献3

1孙健,邓方,陈杰.陆用运动体控制系统发展现状与趋势[J].自动化学报,2018,44(11):1985-1999. 被引量：10
2Kexin GUO,Xiuxian LI,Lihua XIE.Simultaneous cooperative relative localization and distributed formation control for multiple UAVs[J].Science China(Information Sciences),2020,63(1):234-236. 被引量：5
3Lele XI,Zhihong PENG,Lei JIAO,Ben M.CHEN.Smooth quadrotor trajectory generation for tracking a moving target in cluttered environments[J].Science China(Information Sciences),2021,64(7):128-143. 被引量：3

共引文献18

1Xia Jiang,Xianlin Zeng,Jian Sun,Jie Chen,Yue Wei.A Fully Distributed Hybrid Control Framework For Non-Differentiable Multi-Agent Optimization[J].IEEE/CAA Journal of Automatica Sinica,2022,9(10):1792-1800.
2段海滨,何杭轩,赵彦杰,王寅,霍梦真,牛轶峰,范彦铭,朱纪洪,袁莞迈,邓亦敏,李轩,罗德林.2022年无人机热点回眸[J].科技导报,2023,41(1):215-229. 被引量：4
3孙浩亮,孙建军.自主无人平台作战运用探要[J].国防科技,2023,44(2):128-135.
4侯娜,潘婧,于艳丽.武装无人机袭击对全球冲突的影响[J].国防科技,2023,44(3):111-120. 被引量：1
5胡景博,蒋平,杨克巍.无人智能装备应用分析[J].军民两用技术与产品,2023(6):10-13.
6沈博,武文亮,杨刚,周兴社.基于群体OODA的无人集群系统智能评价模型及方法[J].航空学报,2023,44(14):258-273. 被引量：4
7王志远,慈芳慧,尚俊颖.自主智能无人系统在体育场馆中的系统设计和应用分析——以全民健身为例[J].当代体育科技,2023,13(24):77-82. 被引量：4
8Yunzhe Men,Jian Sun,Jie Chen.Control of 2-D Semi-Markov Jump Systems:A View from Mode Generation Mechanism[J].IEEE/CAA Journal of Automatica Sinica,2024,11(1):258-260.
9Wenbo Li,Baoling Ning.Autonomous Recommendation of Fault Detection Algorithms for Spacecraft[J].IEEE/CAA Journal of Automatica Sinica,2024,11(1):273-275. 被引量：1
10MEN Yunzhe,SUN Jian.Composite Anti-Disturbance Control of Hidden Semi-Markov Jump Systems via Disturbance Observer[J].Journal of Systems Science & Complexity,2023,36(6):2255-2273.

1Chen AN,Jiaxi ZHOU,Kai WANG.Adaptive state-constrained/model-free iterative sliding mode control for aerial robot trajectory tracking[J].Applied Mathematics and Mechanics(English Edition),2024,45(4):603-618. 被引量：1
2ZHANG RuiXian,YANG JiaNan,LIANG Ye,LU ShengAo,DONG YiFei,YANG BaoQing,ZHANG LiXian.Navigation for autonomous vehicles via fast-stable and smooth reinforcement learning[J].Science China(Technological Sciences),2024,67(2):423-434.
3Shanshan ZHENG,Shuai LIU,Licheng WANG.Event-triggered distributed optimization for model-free multi-agent systems[J].Frontiers of Information Technology & Electronic Engineering,2024,25(2):214-224.
4HOU Tan,LI Yuanlong,LIN Zongli.An Improved Method for Approximating the Infinite-Horizon Value Function of the Discrete-Time Switched LQR Problem[J].Journal of Systems Science & Complexity,2024,37(1):22-39.
5Chun LI,Jinliang DING,Frank LLEWIS,Tianyou CHAI.Error-based adaptive optimal tracking control of nonlinear discrete-time systems[J].Science China(Information Sciences),2024,67(1):150-163. 被引量：1
6ZHANG RuiXian,HAN YiNing,SU Man,LIN ZeFeng,LI HaoWei,ZHANG LiXian.Robust reinforcement learning with UUB guarantee for safe motion control of autonomous robots[J].Science China(Technological Sciences),2024,67(1):172-182. 被引量：1
7Qi Lü,Bowen Ma.Time-inconsistent stochastic linear-quadratic control problem with indefinite control weight costs[J].Science China Mathematics,2024,67(1):211-236. 被引量：1
8Xiaoxuan Zhong,Yizhi Liang,Xiaoyu Wang,Haoying Lan,Xue Bai,Long Jin,Bai-Ou Guan.Free-moving-state microscopic imaging of cerebral oxygenation and hemodynamics with a photoacoustic fiberscope[J].Light(Science & Applications),2024,13(1):33-46. 被引量：5
9Guangchen WANG,Heng ZHANG.Value iteration algorithm for continuous-time linear quadratic stochastic optimal control problems[J].Science China(Information Sciences),2024,67(2):166-176.
10胡存刚,尹政,芮涛,陆格野,曹文平,唐曦.计及采样扰动抑制的电压源逆变器三矢量无模型预测电流控制方法[J].中国电机工程学报,2024,44(6):2408-2417. 被引量：1

IEEE/CAA Journal of Automatica Sinica

2024年第4期

浏览历史

内容加载中请稍等...

Policy Gradient Adaptive Dynamic Programming for Model-Free Multi-Objective Optimal Control

参考文献1

二级参考文献3

共引文献18

相关作者

相关机构

相关主题

浏览历史