In the process of performing a task,autonomous unmanned systems face the problem of scene changing,which requires the ability of real-time decision-making under dynamically changing scenes.Therefore,taking the unmanne...In the process of performing a task,autonomous unmanned systems face the problem of scene changing,which requires the ability of real-time decision-making under dynamically changing scenes.Therefore,taking the unmanned system coordinative region control operation as an example,this paper combines knowledge representation with probabilistic decisionmaking and proposes a role-based Bayesian decision model for autonomous unmanned systems that integrates scene cognition and individual preferences.Firstly,according to utility value decision theory,the role-based utility value decision model is proposed to realize task coordination according to the preference of the role that individual is assigned.Then,multi-entity Bayesian network is introduced for situation assessment,by which scenes and their uncertainty related to the operation are semantically described,so that the unmanned systems can conduct situation awareness in a set of scenes with uncertainty.Finally,the effectiveness of the proposed method is verified in a virtual task scenario.This research has important reference value for realizing scene cognition,improving cooperative decision-making ability under dynamic scenes,and achieving swarm level autonomy of unmanned systems.展开更多
To reach a higher level of autonomy for unmanned combat aerial vehicle(UCAV) in air combat games, this paper builds an autonomous maneuver decision system. In this system,the air combat game is regarded as a Markov pr...To reach a higher level of autonomy for unmanned combat aerial vehicle(UCAV) in air combat games, this paper builds an autonomous maneuver decision system. In this system,the air combat game is regarded as a Markov process, so that the air combat situation can be effectively calculated via Bayesian inference theory. According to the situation assessment result,adaptively adjusts the weights of maneuver decision factors, which makes the objective function more reasonable and ensures the superiority situation for UCAV. As the air combat game is characterized by highly dynamic and a significant amount of uncertainty,to enhance the robustness and effectiveness of maneuver decision results, fuzzy logic is used to build the functions of four maneuver decision factors. Accuracy prediction of opponent aircraft is also essential to ensure making a good decision; therefore, a prediction model of opponent aircraft is designed based on the elementary maneuver method. Finally, the moving horizon optimization strategy is used to effectively model the whole air combat maneuver decision process. Various simulations are performed on typical scenario test and close-in dogfight, the results sufficiently demonstrate the superiority of the designed maneuver decision method.展开更多
The distributed cooperative decision problems of missiles autonomous formation with network packet loss are investigated by using the potential game based on formation principles.In particular,a dynamic target allocat...The distributed cooperative decision problems of missiles autonomous formation with network packet loss are investigated by using the potential game based on formation principles.In particular,a dynamic target allocation method for missiles formation is provided based on the potential game and formation principles,after the introduction of cooperative guidance and control system of the missiles formation.Then we seek the optimization of a global utility function through autonomous missiles that are capable of making individually rational decisions to optimize their own utility functions.The first important aspect of the problem is to design an individual utility function considering the characteristics of the missiles formation,with which the objective of the missiles are localized to each missile yet aligned with the global utility function.The second is to equip the missiles with an appropriate coordination mechanism with each missile pursuing the optimization of its own utility function.We present the design procedure for the utility,and present a coordination mechanism based on spatial adaptive play and then introduce the idea of“cyclical selected spatial adaptive play”and“negotiation based on time division multiple address(TDMA)protocol formation support network”.Finally,we present simulations for the distributed dynamic target allocation on the comprehensive digital simulation system,and the results illustrate the effectiveness and engineering applicability of the method.展开更多
This paper proposes an autonomous maneuver decision method using transfer learning pigeon-inspired optimization(TLPIO)for unmanned combat aerial vehicles(UCAVs)in dogfight engagements.Firstly,a nonlinear F-16 aircraft...This paper proposes an autonomous maneuver decision method using transfer learning pigeon-inspired optimization(TLPIO)for unmanned combat aerial vehicles(UCAVs)in dogfight engagements.Firstly,a nonlinear F-16 aircraft model and automatic control system are constructed by a MATLAB/Simulink platform.Secondly,a 3-degrees-of-freedom(3-DOF)aircraft model is used as a maneuvering command generator,and the expanded elemental maneuver library is designed,so that the aircraft state reachable set can be obtained.Then,the game matrix is composed with the air combat situation evaluation function calculated according to the angle and range threats.Finally,a key point is that the objective function to be optimized is designed using the game mixed strategy,and the optimal mixed strategy is obtained by TLPIO.Significantly,the proposed TLPIO does not initialize the population randomly,but adopts the transfer learning method based on Kullback-Leibler(KL)divergence to initialize the population,which improves the search accuracy of the optimization algorithm.Besides,the convergence and time complexity of TLPIO are discussed.Comparison analysis with other classical optimization algorithms highlights the advantage of TLPIO.In the simulation of air combat,three initial scenarios are set,namely,opposite,offensive and defensive conditions.The effectiveness performance of the proposed autonomous maneuver decision method is verified by simulation results.展开更多
The decision-making under complex urban environment become one of the key issues that restricts the rapid development of the autonomous vehicles. The difficulty in making timely and accurate decisions like human being...The decision-making under complex urban environment become one of the key issues that restricts the rapid development of the autonomous vehicles. The difficulty in making timely and accurate decisions like human beings under highly dynamic traffic environment is a major challenge for autonomous driving. Car-following has been regarded as the simplest but essential driving behavior among driving tasks and has received extensive attention from researchers around the world. This work addresses this problem and proposes a novel method RSAN(rough-set artificial neural network) to learn the decisions from excellent human drivers. A virtual urban traffic environment was built by Pre Scan and driving simulation was conducted to obtain a broad set of relevant data such as experienced drivers' behavior data and surrounding vehicles' motion data. Then, rough set was used to preprocess these data to extract the key influential factors on decision and reduce the impact of uncertain data and noise data. And the car-following decision was learned by neural network in which key factor was the input and acceleration was the output. The result shows the better convergence speed and the better decision accuracy of RSAN than ANN. Findings of this work contributes to the empirical understanding of driver's decision-making process and it provides a theoretical basis for the study of car-following decision-making under complex and dynamic environment.展开更多
自动驾驶车辆决策系统直接影响车辆综合行驶性能,是实现自动驾驶技术需要解决的关键难题之一。基于深度强化学习算法DDPG(deep deterministic policy gradient),针对此问题提出了一种端到端驾驶行为决策模型。首先,结合驾驶员模型选取...自动驾驶车辆决策系统直接影响车辆综合行驶性能,是实现自动驾驶技术需要解决的关键难题之一。基于深度强化学习算法DDPG(deep deterministic policy gradient),针对此问题提出了一种端到端驾驶行为决策模型。首先,结合驾驶员模型选取自车、道路、干扰车辆等共64维度状态空间信息作为输入数据集对决策模型进行训练,决策模型输出合理的驾驶行为以及控制量,为解决训练测试中的奖励和控制量突变问题,改进DDPG决策模型对决策控制效果进行优化,并在TORCS(the open racing car simulator)平台进行仿真实验验证。结果表明:所提出的决策模型可以根据车辆和环境实时状态信息输出合理的驾驶行为以及控制量,与DDPG模型相比,改进的模型具有更好的控制精度,且车辆横向速度显著减小,车辆舒适性以及车辆稳定性明显改善。展开更多
基金the Military Science Postgraduate Project of PLA(JY2020B006).
文摘In the process of performing a task,autonomous unmanned systems face the problem of scene changing,which requires the ability of real-time decision-making under dynamically changing scenes.Therefore,taking the unmanned system coordinative region control operation as an example,this paper combines knowledge representation with probabilistic decisionmaking and proposes a role-based Bayesian decision model for autonomous unmanned systems that integrates scene cognition and individual preferences.Firstly,according to utility value decision theory,the role-based utility value decision model is proposed to realize task coordination according to the preference of the role that individual is assigned.Then,multi-entity Bayesian network is introduced for situation assessment,by which scenes and their uncertainty related to the operation are semantically described,so that the unmanned systems can conduct situation awareness in a set of scenes with uncertainty.Finally,the effectiveness of the proposed method is verified in a virtual task scenario.This research has important reference value for realizing scene cognition,improving cooperative decision-making ability under dynamic scenes,and achieving swarm level autonomy of unmanned systems.
基金supported by the National Natural Science Foundation of China(61601505)the Aeronautical Science Foundation of China(20155196022)the Shaanxi Natural Science Foundation of China(2016JQ6050)
文摘To reach a higher level of autonomy for unmanned combat aerial vehicle(UCAV) in air combat games, this paper builds an autonomous maneuver decision system. In this system,the air combat game is regarded as a Markov process, so that the air combat situation can be effectively calculated via Bayesian inference theory. According to the situation assessment result,adaptively adjusts the weights of maneuver decision factors, which makes the objective function more reasonable and ensures the superiority situation for UCAV. As the air combat game is characterized by highly dynamic and a significant amount of uncertainty,to enhance the robustness and effectiveness of maneuver decision results, fuzzy logic is used to build the functions of four maneuver decision factors. Accuracy prediction of opponent aircraft is also essential to ensure making a good decision; therefore, a prediction model of opponent aircraft is designed based on the elementary maneuver method. Finally, the moving horizon optimization strategy is used to effectively model the whole air combat maneuver decision process. Various simulations are performed on typical scenario test and close-in dogfight, the results sufficiently demonstrate the superiority of the designed maneuver decision method.
基金supported by the Industrial Technology Development Program(B1120131046)
文摘The distributed cooperative decision problems of missiles autonomous formation with network packet loss are investigated by using the potential game based on formation principles.In particular,a dynamic target allocation method for missiles formation is provided based on the potential game and formation principles,after the introduction of cooperative guidance and control system of the missiles formation.Then we seek the optimization of a global utility function through autonomous missiles that are capable of making individually rational decisions to optimize their own utility functions.The first important aspect of the problem is to design an individual utility function considering the characteristics of the missiles formation,with which the objective of the missiles are localized to each missile yet aligned with the global utility function.The second is to equip the missiles with an appropriate coordination mechanism with each missile pursuing the optimization of its own utility function.We present the design procedure for the utility,and present a coordination mechanism based on spatial adaptive play and then introduce the idea of“cyclical selected spatial adaptive play”and“negotiation based on time division multiple address(TDMA)protocol formation support network”.Finally,we present simulations for the distributed dynamic target allocation on the comprehensive digital simulation system,and the results illustrate the effectiveness and engineering applicability of the method.
基金the Science and Technology Innovation 2030-Key Project of“New Generation Artificial Intelligence”(2018AAA0100803)the National Natural Science Foundation of China(U20B2071,91948204,T2121003,U1913602)。
文摘This paper proposes an autonomous maneuver decision method using transfer learning pigeon-inspired optimization(TLPIO)for unmanned combat aerial vehicles(UCAVs)in dogfight engagements.Firstly,a nonlinear F-16 aircraft model and automatic control system are constructed by a MATLAB/Simulink platform.Secondly,a 3-degrees-of-freedom(3-DOF)aircraft model is used as a maneuvering command generator,and the expanded elemental maneuver library is designed,so that the aircraft state reachable set can be obtained.Then,the game matrix is composed with the air combat situation evaluation function calculated according to the angle and range threats.Finally,a key point is that the objective function to be optimized is designed using the game mixed strategy,and the optimal mixed strategy is obtained by TLPIO.Significantly,the proposed TLPIO does not initialize the population randomly,but adopts the transfer learning method based on Kullback-Leibler(KL)divergence to initialize the population,which improves the search accuracy of the optimization algorithm.Besides,the convergence and time complexity of TLPIO are discussed.Comparison analysis with other classical optimization algorithms highlights the advantage of TLPIO.In the simulation of air combat,three initial scenarios are set,namely,opposite,offensive and defensive conditions.The effectiveness performance of the proposed autonomous maneuver decision method is verified by simulation results.
基金Project(9142020013)support by the National Natural Science Foundation of China
文摘The decision-making under complex urban environment become one of the key issues that restricts the rapid development of the autonomous vehicles. The difficulty in making timely and accurate decisions like human beings under highly dynamic traffic environment is a major challenge for autonomous driving. Car-following has been regarded as the simplest but essential driving behavior among driving tasks and has received extensive attention from researchers around the world. This work addresses this problem and proposes a novel method RSAN(rough-set artificial neural network) to learn the decisions from excellent human drivers. A virtual urban traffic environment was built by Pre Scan and driving simulation was conducted to obtain a broad set of relevant data such as experienced drivers' behavior data and surrounding vehicles' motion data. Then, rough set was used to preprocess these data to extract the key influential factors on decision and reduce the impact of uncertain data and noise data. And the car-following decision was learned by neural network in which key factor was the input and acceleration was the output. The result shows the better convergence speed and the better decision accuracy of RSAN than ANN. Findings of this work contributes to the empirical understanding of driver's decision-making process and it provides a theoretical basis for the study of car-following decision-making under complex and dynamic environment.
文摘自动驾驶车辆决策系统直接影响车辆综合行驶性能,是实现自动驾驶技术需要解决的关键难题之一。基于深度强化学习算法DDPG(deep deterministic policy gradient),针对此问题提出了一种端到端驾驶行为决策模型。首先,结合驾驶员模型选取自车、道路、干扰车辆等共64维度状态空间信息作为输入数据集对决策模型进行训练,决策模型输出合理的驾驶行为以及控制量,为解决训练测试中的奖励和控制量突变问题,改进DDPG决策模型对决策控制效果进行优化,并在TORCS(the open racing car simulator)平台进行仿真实验验证。结果表明:所提出的决策模型可以根据车辆和环境实时状态信息输出合理的驾驶行为以及控制量,与DDPG模型相比,改进的模型具有更好的控制精度,且车辆横向速度显著减小,车辆舒适性以及车辆稳定性明显改善。