三支决策将不确定样本划分至边界域进行延迟决策,但需基于损失函数确定阈值,以划分边界域,然而,损失函数通常需要先验知识,具有一定的主观性,因此对边界域划分能力不足。针对这种问题,构建一种多目标三支决策边界域求解方法,从而更好地...三支决策将不确定样本划分至边界域进行延迟决策,但需基于损失函数确定阈值,以划分边界域,然而,损失函数通常需要先验知识,具有一定的主观性,因此对边界域划分能力不足。针对这种问题,构建一种多目标三支决策边界域求解方法,从而更好地划分边界域,提升分类性能。采用贝叶斯规则获取样本的条件概率;设定3个目标,包括降低边界域的不确定性、缩小边界域的大小以及减小整个决策区域的错误分类率,通过融入熵权法的TOPSIS(technique for order preference by similarity to an ideal solution)方法求取最优阈值,该方法采用熵权法计算这3个目标所占的权重,得到最优阈值,获得边界域,进行延迟决策;结合不同分类器对边界域进行分类。通过UCI数据集进行对比实验,根据分类准确率和F1值,表明该方法学习到的阈值能合理地划分边界域,建立的模型能取得更好的分类性能。展开更多
The pursuit-evasion game models the strategic interaction among players, attracting attention in many realistic scenarios, such as missile guidance, unmanned aerial vehicles, and target defense. Existing studies mainl...The pursuit-evasion game models the strategic interaction among players, attracting attention in many realistic scenarios, such as missile guidance, unmanned aerial vehicles, and target defense. Existing studies mainly concentrate on the cooperative pursuit of multiple players in two-dimensional pursuit-evasion games. However, these approaches can hardly be applied to practical situations where players usually move in three-dimensional space with a three-degree-of-freedom control. In this paper,we make the first attempt to investigate the equilibrium strategy of the realistic pursuit-evasion game, in which the pursuer follows a three-degree-of-freedom control, and the evader moves freely. First, we describe the pursuer's three-degree-of-freedom control and the evader's relative coordinate. We then rigorously derive the equilibrium strategy by solving the retrogressive path equation according to the Hamilton-Jacobi-Bellman-Isaacs(HJBI) method, which divides the pursuit-evasion process into the navigation and acceleration phases. Besides, we analyze the maximum allowable speed for the pursuer to capture the evader successfully and provide the strategy with which the evader can escape when the pursuer's speed exceeds the threshold. We further conduct comparison tests with various unilateral deviations to verify that the proposed strategy forms a Nash equilibrium.展开更多
文摘三支决策将不确定样本划分至边界域进行延迟决策,但需基于损失函数确定阈值,以划分边界域,然而,损失函数通常需要先验知识,具有一定的主观性,因此对边界域划分能力不足。针对这种问题,构建一种多目标三支决策边界域求解方法,从而更好地划分边界域,提升分类性能。采用贝叶斯规则获取样本的条件概率;设定3个目标,包括降低边界域的不确定性、缩小边界域的大小以及减小整个决策区域的错误分类率,通过融入熵权法的TOPSIS(technique for order preference by similarity to an ideal solution)方法求取最优阈值,该方法采用熵权法计算这3个目标所占的权重,得到最优阈值,获得边界域,进行延迟决策;结合不同分类器对边界域进行分类。通过UCI数据集进行对比实验,根据分类准确率和F1值,表明该方法学习到的阈值能合理地划分边界域,建立的模型能取得更好的分类性能。
基金supported in part by the Strategic Priority Research Program of Chinese Academy of Sciences(XDA27030100)National Natural Science Foundation of China(72293575, 11832001)。
文摘The pursuit-evasion game models the strategic interaction among players, attracting attention in many realistic scenarios, such as missile guidance, unmanned aerial vehicles, and target defense. Existing studies mainly concentrate on the cooperative pursuit of multiple players in two-dimensional pursuit-evasion games. However, these approaches can hardly be applied to practical situations where players usually move in three-dimensional space with a three-degree-of-freedom control. In this paper,we make the first attempt to investigate the equilibrium strategy of the realistic pursuit-evasion game, in which the pursuer follows a three-degree-of-freedom control, and the evader moves freely. First, we describe the pursuer's three-degree-of-freedom control and the evader's relative coordinate. We then rigorously derive the equilibrium strategy by solving the retrogressive path equation according to the Hamilton-Jacobi-Bellman-Isaacs(HJBI) method, which divides the pursuit-evasion process into the navigation and acceleration phases. Besides, we analyze the maximum allowable speed for the pursuer to capture the evader successfully and provide the strategy with which the evader can escape when the pursuer's speed exceeds the threshold. We further conduct comparison tests with various unilateral deviations to verify that the proposed strategy forms a Nash equilibrium.