Reinforcement Learning(RL)has emerged as a promising data-driven solution for wargaming decision-making.However,two domain challenges still exist:(1)dealing with discrete-continuous hybrid wargaming control and(2)acce...Reinforcement Learning(RL)has emerged as a promising data-driven solution for wargaming decision-making.However,two domain challenges still exist:(1)dealing with discrete-continuous hybrid wargaming control and(2)accelerating RL deployment with rich offline data.Existing RL methods fail to handle these two issues simultaneously,thereby we propose a novel offline RL method targeting hybrid action space.A new constrained action representation technique is developed to build a bidirectional mapping between the original hybrid action space and a latent space in a semantically consistent way.This allows learning a continuous latent policy with offline RL with better exploration feasibility and scalability and reconstructing it back to a needed hybrid policy.Critically,a novel offline RL optimization objective with adaptively adjusted constraints is designed to balance the alleviation and generalization of out-of-distribution actions.Our method demonstrates superior performance and generality across different tasks,particularly in typical realistic wargaming scenarios.展开更多
This paper presents the complex dynamics synthesis of the combat dy-namics series called tensor-centric warfare (TCW;for the first three parts of the series, see [1] [2] [3]), which includes tensor generalization of c...This paper presents the complex dynamics synthesis of the combat dy-namics series called tensor-centric warfare (TCW;for the first three parts of the series, see [1] [2] [3]), which includes tensor generalization of classical Lanchester-type combat equations, entropic Lie-dragging and commutators for modeling warfare uncertainty and symmetry, and various delta-strikes and missiles (both deterministic and random). The present paper gives a unique synthesis of the Red vs. Blue vectorfields into a single complex battle-vectorfield, using dynamics on Kähler manifolds as a rigorous framework for extending the TCW concept. The global Kähler dynamics framework, with its rigorous underpinning called the Kähler-Ricci flow, provides not only a new insight into the “geometry of warfare”, but also into the “physics of warfare”, in terms of Lagrangian and Hamiltonian structures of the battlefields. It also provides a convenient and efficient computational framework for entropic wargaming.展开更多
文摘Reinforcement Learning(RL)has emerged as a promising data-driven solution for wargaming decision-making.However,two domain challenges still exist:(1)dealing with discrete-continuous hybrid wargaming control and(2)accelerating RL deployment with rich offline data.Existing RL methods fail to handle these two issues simultaneously,thereby we propose a novel offline RL method targeting hybrid action space.A new constrained action representation technique is developed to build a bidirectional mapping between the original hybrid action space and a latent space in a semantically consistent way.This allows learning a continuous latent policy with offline RL with better exploration feasibility and scalability and reconstructing it back to a needed hybrid policy.Critically,a novel offline RL optimization objective with adaptively adjusted constraints is designed to balance the alleviation and generalization of out-of-distribution actions.Our method demonstrates superior performance and generality across different tasks,particularly in typical realistic wargaming scenarios.
文摘This paper presents the complex dynamics synthesis of the combat dy-namics series called tensor-centric warfare (TCW;for the first three parts of the series, see [1] [2] [3]), which includes tensor generalization of classical Lanchester-type combat equations, entropic Lie-dragging and commutators for modeling warfare uncertainty and symmetry, and various delta-strikes and missiles (both deterministic and random). The present paper gives a unique synthesis of the Red vs. Blue vectorfields into a single complex battle-vectorfield, using dynamics on Kähler manifolds as a rigorous framework for extending the TCW concept. The global Kähler dynamics framework, with its rigorous underpinning called the Kähler-Ricci flow, provides not only a new insight into the “geometry of warfare”, but also into the “physics of warfare”, in terms of Lagrangian and Hamiltonian structures of the battlefields. It also provides a convenient and efficient computational framework for entropic wargaming.