摘要
采用伴随 - BP技术 ,将微分对策的两点边值求解问题转化为两个神经网络的学习问题 ,训练后的两个神经网络分别作为对策双方的最优控制器在线使用 ,避免了直接求解复杂的两点边值问题。对追逃微分对策问题的仿真结果表明 ,该方法对初始条件和噪声具有较好的鲁棒性。
Based on the adjoint-backpropagation technique,two point boundary value problem(TPBVP) of differential games is solved using two neural networks.The neural networks automatically adjust their weights to minimize and maximize the cost function of differential game system respectively.The converged neural networks can be used as the feedback optimal differential games controllers on-line,avoiding solving the complex two point boundary value problem directly. The simulation results for pursing-escaping differential games prove that the neural networks controllers present good robustness with respect to initial conditions and measuring noise.
出处
《控制与决策》
EI
CSCD
北大核心
2003年第1期123-125,共3页
Control and Decision
基金
国家自然科学基金资助项目 (6 990 40 02 )
国防预研基金资助项目 (0 0 J1.1.3HK010 2 )
关键词
神经网络
微分对策
控制器
设计
最优控制
学习问题
Differential games
Neural networks
Optimal control
Two point boundary value