Aiming at the problem of multi-UAV pursuit-evasion confrontation, a UAV cooperative maneuver method based on an improved multi-agent deep reinforcement learning(MADRL) is proposed. In this method, an improved Comm Net...Aiming at the problem of multi-UAV pursuit-evasion confrontation, a UAV cooperative maneuver method based on an improved multi-agent deep reinforcement learning(MADRL) is proposed. In this method, an improved Comm Net network based on a communication mechanism is introduced into a deep reinforcement learning algorithm to solve the multi-agent problem. A layer of gated recurrent unit(GRU) is added to the actor-network structure to remember historical environmental states. Subsequently,another GRU is designed as a communication channel in the Comm Net core network layer to refine communication information between UAVs. Finally, the simulation results of the algorithm in two sets of scenarios are given, and the results show that the method has good effectiveness and applicability.展开更多
In order to achieve higher spectrum efficiency in cognitive radio (CR) systems, a closed-form expression of the optimal decision threshold for soft decision cooperative spectrum sensing based on the minimum total er...In order to achieve higher spectrum efficiency in cognitive radio (CR) systems, a closed-form expression of the optimal decision threshold for soft decision cooperative spectrum sensing based on the minimum total error probability criterion is derived. With the analytical expression of the optimal decision threshold, the impact of different sensing parameters on the threshold value is studied. Theoretical analyses show that the optimal threshold achieves an efficient trade-off between the missed detection probability and the false alarm probability. Simulation results illustrate that the average signal-to-noise ratio (SNR) and the soft combination schemes have a great influence on the optimal threshold value, whereas the number of samples has a weak impact on the optimal threshold value. Furthermore, for the maximal ratio combing (MRC) and the modified deflection coefficient (MDC) schemes, the optimal decision threshold value increases and approaches a corresponding individual limit value while the number of CR users increases. But the number of CR users has a weak influence on the optimal decision threshold for the equal gain combining (EGC) scheme.展开更多
Cooperative autonomous air combat of multiple unmanned aerial vehicles(UAVs)is one of the main combat modes in future air warfare,which becomes even more complicated with highly changeable situation and uncertain info...Cooperative autonomous air combat of multiple unmanned aerial vehicles(UAVs)is one of the main combat modes in future air warfare,which becomes even more complicated with highly changeable situation and uncertain information of the opponents.As such,this paper presents a cooperative decision-making method based on incomplete information dynamic game to generate maneuver strategies for multiple UAVs in air combat.Firstly,a cooperative situation assessment model is presented to measure the overall combat situation.Secondly,an incomplete information dynamic game model is proposed to model the dynamic process of air combat,and a dynamic Bayesian network is designed to infer the tactical intention of the opponent.Then a reinforcement learning framework based on multiagent deep deterministic policy gradient is established to obtain the perfect Bayes-Nash equilibrium solution of the air combat game model.Finally,a series of simulations are conducted to verify the effectiveness of the proposed method,and the simulation results show effective synergies and cooperative tactics.展开更多
Wireless sensors networks (WSNs) combined with cognitive radio have developed and solved the limited space of the frequency spectrum. In this paper, we propose different types of spectrums sensing and their own decisi...Wireless sensors networks (WSNs) combined with cognitive radio have developed and solved the limited space of the frequency spectrum. In this paper, we propose different types of spectrums sensing and their own decisions depend on the probabilities that applied into fusion center, and how these probabilities’ techniques help to enhance the energy consumption of WSNs. In the same way, the importance of designing balanced distribution between the wireless sensors networks and their own sinks. This research also provides an overview of security issues in CR-WSN, especially in Spectrum Sensing Data Falsification (SSDF) attacks that enforces harmful effects on spectrum sensing and spectrum sharing. We adopt OR rule as four types of CRSN sensing protocolin greenhouses application by using Matlab and Netsim simulators. Our results show that the designing balanced wireless sensors and their sinks in greenhouses are very significant to decrease the energy, which is due to the traffic congestion in the sink range area. Furthermore, by applying OR rule has enhanced the energy consumption, and improved the sensors network lifetime compared to cognitive radio network.展开更多
This paper is based on a resource constrained active network project;the constraint of the local resource and the time constraint of the cooperation resource are considered simultaneously.And the respective benefit of...This paper is based on a resource constrained active network project;the constraint of the local resource and the time constraint of the cooperation resource are considered simultaneously.And the respective benefit of the manager and cooperation partners is also considered simultaneously.And a cooperation planning model based on bilevel multi-objective programming is de- signed,according to the due time and total cost.And an extended CNP based on the permitted range for resource and time requests is presented.A larger task set in scheduling cycle is on the permitting for the request of cooperation resource and time while the task manager itself may be permitted biding for tasks.As a result,the optimization space for the cooperation planning is enlarged.So not every bidding task is successfully bid by invitee,and the task manager itself takes on some bidding tasks.Finally,the genetic algorithm is given and the validity and feasibility of the model is proved by a case.展开更多
Purchasing agricultural insurance and joining agricultural cooperatives are two prevalent instruments used by farmers in China for dealing with agricultural risks. Data from 443 swine farmers in Jiangsu and Henan prov...Purchasing agricultural insurance and joining agricultural cooperatives are two prevalent instruments used by farmers in China for dealing with agricultural risks. Data from 443 swine farmers in Jiangsu and Henan provinces of China were collected. Factors affecting the farmers’ decision to purchase agricultural insurance and join agricultural cooperatives were assessed. The possibility of simultaneous use of both instruments and the potential correlation between these two decisions was considered as well. Results showed that the farmers’ decision to use agricultural insurance and cooperatives was positively correlated, indicating that farmers who purchased agricultural insurance which mainly used to mitigate production risks were more likely to join agricultural cooperatives which more used to share market risks, and vice versa. Farmers’ knowledge of swine insurance and trust in the local government positively impacted the purchase of agricultural insurance, while education, years involved in swine production and scale of swine production positively impacted farmers joining agricultural cooperatives.展开更多
The paper presents our research efforts motivated by the apparent need to combine conventional,preexisting computing functions with novel knowledge--based functions. This has been likened to what occurred in the evolu...The paper presents our research efforts motivated by the apparent need to combine conventional,preexisting computing functions with novel knowledge--based functions. This has been likened to what occurred in the evolution of primates, where the 'new brain' (the cortex) was added to, layered upon, and given control over the 'old brain' common to the less complex animals.展开更多
As one of the major contributions of biology to competitive decision making, evolutionary game theory provides a useful tool for studying the evolution of cooperation. To achieve the optimal solution for unmanned aeri...As one of the major contributions of biology to competitive decision making, evolutionary game theory provides a useful tool for studying the evolution of cooperation. To achieve the optimal solution for unmanned aerial vehicles (UAVs) that are car- rying out a sensing task, this paper presents a Markov decision evolutionary game (MDEG) based learning algorithm. Each in- dividual in the algorithm follows a Markov decision strategy to maximize its payoff against the well known Tit-for-Tat strate- gy. Simulation results demonstrate that the MDEG theory based approach effectively improves the collective payoff of the roam. The proposed algorithm can not only obtain the best action sequence but also a sub-optimal Markov policy that is inde- pendent of the game duration. Furthermore, the paper also studies the emergence of cooperation in the evolution of self-regarded UAVs. The results show that it is the adaptive ability of the MDEG based approach as well as the perfect balance between revenge and forgiveness of the Tit-for-Tat strategy that the emergence of cooperation should be attributed to.展开更多
In order to achieve the optimal attack outcome in the air combat under the beyond visual range(BVR)condition,the decision-making(DM)problem which is to set a proper assignment for the friendly fighters on the hostile ...In order to achieve the optimal attack outcome in the air combat under the beyond visual range(BVR)condition,the decision-making(DM)problem which is to set a proper assignment for the friendly fighters on the hostile fighters is the most crucial task for cooperative multiple target attack(CMTA).In this paper,a heuristic quantum genetic algorithm(HQGA)is proposed to solve the DM problem.The originality of our work can be supported in the following aspects:(1)the HQGA assigns all hostile fighters to every missile rather than fighters so that the HQGA can encode chromosomes with quantum bits(Q-bits);(2)the relative successful sequence probability(RSSP)is defined,based on which the priority attack vector is constructed;(3)the HQGA can heuristically modify quantum chromosomes according to modification technique proposed in this paper;(4)last but not the least,in some special conditions,the HQGA gets rid of the constraint described by other algorithms that to obtain a better result.In the end of this paper,two examples are illustrated to show that the HQGA has its own advantage over other algorithms when dealing with the DM problem in the context of CMTA.展开更多
动态障碍物一直是阻碍智能体自主导航发展的关键因素,而躲避障碍物和清理障碍物是两种解决动态障碍物问题的有效方法。近年来,多智能体躲避动态障碍物(避障)问题受到了广大学者的关注,优秀的多智能体避障算法纷纷涌现。然而,多智能体清...动态障碍物一直是阻碍智能体自主导航发展的关键因素,而躲避障碍物和清理障碍物是两种解决动态障碍物问题的有效方法。近年来,多智能体躲避动态障碍物(避障)问题受到了广大学者的关注,优秀的多智能体避障算法纷纷涌现。然而,多智能体清理动态障碍物(清障)问题却无人问津,相对应的多智能体清障算法更是屈指可数。为解决多智能体清障问题,文中提出了一种基于深度确定性策略梯度与注意力Critic的多智能体协同清障算法(Multi-Agent Cooperative Algorithm for Obstacle Clearance Based on Deep Deterministic Policy Gradient and Attention Critic, MACOC)。首先,创建了首个多智能体协同清障的环境模型,定义了多智能体及动态障碍物的运动学模型,并根据智能体和动态障碍物数量的不同,构建了4种仿真实验环境;其次,将多智能体协同清障过程定义为马尔可夫决策过程(Markov Decision Process, MDP),构建了多智能体t的状态空间、动作空间和奖励函数;最后,提出一种基于深度确定性策略梯度与注意力Critic的多智能体协同清障算法,并在多智能体协同清障仿真环境中与经典的多智能体强化学习算法进行对比。实验证明,相比对比算法,所提出的MACOC算法清障的成功率更高、速度更快,对复杂环境的适应性更好。展开更多
基金supported in part by the National Key Laboratory of Air-based Information Perception and Fusion and the Aeronautical Science Foundation of China (Grant No. 20220001068001)National Natural Science Foundation of China (Grant No.61673327)+1 种基金Natural Science Basic Research Plan in Shaanxi Province,China (Grant No. 2023-JC-QN-0733)China IndustryUniversity-Research Innovation Foundation (Grant No. 2022IT188)。
文摘Aiming at the problem of multi-UAV pursuit-evasion confrontation, a UAV cooperative maneuver method based on an improved multi-agent deep reinforcement learning(MADRL) is proposed. In this method, an improved Comm Net network based on a communication mechanism is introduced into a deep reinforcement learning algorithm to solve the multi-agent problem. A layer of gated recurrent unit(GRU) is added to the actor-network structure to remember historical environmental states. Subsequently,another GRU is designed as a communication channel in the Comm Net core network layer to refine communication information between UAVs. Finally, the simulation results of the algorithm in two sets of scenarios are given, and the results show that the method has good effectiveness and applicability.
基金The National Natural Science Foundation of China(No.61271207,61372104)the National Science and Technology Major Project(No.2010ZX0300600201)the Specialized Development Foundation for the Achievement Transformation of Jiangsu Province(No.BA2010023)
文摘In order to achieve higher spectrum efficiency in cognitive radio (CR) systems, a closed-form expression of the optimal decision threshold for soft decision cooperative spectrum sensing based on the minimum total error probability criterion is derived. With the analytical expression of the optimal decision threshold, the impact of different sensing parameters on the threshold value is studied. Theoretical analyses show that the optimal threshold achieves an efficient trade-off between the missed detection probability and the false alarm probability. Simulation results illustrate that the average signal-to-noise ratio (SNR) and the soft combination schemes have a great influence on the optimal threshold value, whereas the number of samples has a weak impact on the optimal threshold value. Furthermore, for the maximal ratio combing (MRC) and the modified deflection coefficient (MDC) schemes, the optimal decision threshold value increases and approaches a corresponding individual limit value while the number of CR users increases. But the number of CR users has a weak influence on the optimal decision threshold for the equal gain combining (EGC) scheme.
基金supported by the National Natural Science Foundation of China(Grant No.61933010 and 61903301)Shaanxi Aerospace Flight Vehicle Design Key Laboratory。
文摘Cooperative autonomous air combat of multiple unmanned aerial vehicles(UAVs)is one of the main combat modes in future air warfare,which becomes even more complicated with highly changeable situation and uncertain information of the opponents.As such,this paper presents a cooperative decision-making method based on incomplete information dynamic game to generate maneuver strategies for multiple UAVs in air combat.Firstly,a cooperative situation assessment model is presented to measure the overall combat situation.Secondly,an incomplete information dynamic game model is proposed to model the dynamic process of air combat,and a dynamic Bayesian network is designed to infer the tactical intention of the opponent.Then a reinforcement learning framework based on multiagent deep deterministic policy gradient is established to obtain the perfect Bayes-Nash equilibrium solution of the air combat game model.Finally,a series of simulations are conducted to verify the effectiveness of the proposed method,and the simulation results show effective synergies and cooperative tactics.
文摘Wireless sensors networks (WSNs) combined with cognitive radio have developed and solved the limited space of the frequency spectrum. In this paper, we propose different types of spectrums sensing and their own decisions depend on the probabilities that applied into fusion center, and how these probabilities’ techniques help to enhance the energy consumption of WSNs. In the same way, the importance of designing balanced distribution between the wireless sensors networks and their own sinks. This research also provides an overview of security issues in CR-WSN, especially in Spectrum Sensing Data Falsification (SSDF) attacks that enforces harmful effects on spectrum sensing and spectrum sharing. We adopt OR rule as four types of CRSN sensing protocolin greenhouses application by using Matlab and Netsim simulators. Our results show that the designing balanced wireless sensors and their sinks in greenhouses are very significant to decrease the energy, which is due to the traffic congestion in the sink range area. Furthermore, by applying OR rule has enhanced the energy consumption, and improved the sensors network lifetime compared to cognitive radio network.
文摘This paper is based on a resource constrained active network project;the constraint of the local resource and the time constraint of the cooperation resource are considered simultaneously.And the respective benefit of the manager and cooperation partners is also considered simultaneously.And a cooperation planning model based on bilevel multi-objective programming is de- signed,according to the due time and total cost.And an extended CNP based on the permitted range for resource and time requests is presented.A larger task set in scheduling cycle is on the permitting for the request of cooperation resource and time while the task manager itself may be permitted biding for tasks.As a result,the optimization space for the cooperation planning is enlarged.So not every bidding task is successfully bid by invitee,and the task manager itself takes on some bidding tasks.Finally,the genetic algorithm is given and the validity and feasibility of the model is proved by a case.
基金supported by the National Natural Science Foundation of China (71673139)the Humanities and Social Science Youth Foundation of the Ministry of Education of China (19YJC790186)the Project of Philosophy and Social Science in the Colleges and Universities of the Jiangsu Province, China (2018SJA0133)
文摘Purchasing agricultural insurance and joining agricultural cooperatives are two prevalent instruments used by farmers in China for dealing with agricultural risks. Data from 443 swine farmers in Jiangsu and Henan provinces of China were collected. Factors affecting the farmers’ decision to purchase agricultural insurance and join agricultural cooperatives were assessed. The possibility of simultaneous use of both instruments and the potential correlation between these two decisions was considered as well. Results showed that the farmers’ decision to use agricultural insurance and cooperatives was positively correlated, indicating that farmers who purchased agricultural insurance which mainly used to mitigate production risks were more likely to join agricultural cooperatives which more used to share market risks, and vice versa. Farmers’ knowledge of swine insurance and trust in the local government positively impacted the purchase of agricultural insurance, while education, years involved in swine production and scale of swine production positively impacted farmers joining agricultural cooperatives.
文摘The paper presents our research efforts motivated by the apparent need to combine conventional,preexisting computing functions with novel knowledge--based functions. This has been likened to what occurred in the evolution of primates, where the 'new brain' (the cortex) was added to, layered upon, and given control over the 'old brain' common to the less complex animals.
基金supported by the National Natural Science Foundation of China(Grant Nos.61425008,61333004 and 61273054)Top-Notch Young Talents Program of China,and Aeronautical Foundation of China(Grant No.20135851042)
文摘As one of the major contributions of biology to competitive decision making, evolutionary game theory provides a useful tool for studying the evolution of cooperation. To achieve the optimal solution for unmanned aerial vehicles (UAVs) that are car- rying out a sensing task, this paper presents a Markov decision evolutionary game (MDEG) based learning algorithm. Each in- dividual in the algorithm follows a Markov decision strategy to maximize its payoff against the well known Tit-for-Tat strate- gy. Simulation results demonstrate that the MDEG theory based approach effectively improves the collective payoff of the roam. The proposed algorithm can not only obtain the best action sequence but also a sub-optimal Markov policy that is inde- pendent of the game duration. Furthermore, the paper also studies the emergence of cooperation in the evolution of self-regarded UAVs. The results show that it is the adaptive ability of the MDEG based approach as well as the perfect balance between revenge and forgiveness of the Tit-for-Tat strategy that the emergence of cooperation should be attributed to.
基金supported by National Nature Science Foundation of China,and the supporting project is“Study on parallel intelligent optimization simulation with combination of qualitative and quantitative method”(61004089)supported by the Graduate Student Innovation Practice Foundation of Beihang University in China(YCSJ-01-201205),which is“Research of an efficient and intelligent optimization method and application in aircraft shape design”.
文摘In order to achieve the optimal attack outcome in the air combat under the beyond visual range(BVR)condition,the decision-making(DM)problem which is to set a proper assignment for the friendly fighters on the hostile fighters is the most crucial task for cooperative multiple target attack(CMTA).In this paper,a heuristic quantum genetic algorithm(HQGA)is proposed to solve the DM problem.The originality of our work can be supported in the following aspects:(1)the HQGA assigns all hostile fighters to every missile rather than fighters so that the HQGA can encode chromosomes with quantum bits(Q-bits);(2)the relative successful sequence probability(RSSP)is defined,based on which the priority attack vector is constructed;(3)the HQGA can heuristically modify quantum chromosomes according to modification technique proposed in this paper;(4)last but not the least,in some special conditions,the HQGA gets rid of the constraint described by other algorithms that to obtain a better result.In the end of this paper,two examples are illustrated to show that the HQGA has its own advantage over other algorithms when dealing with the DM problem in the context of CMTA.
文摘动态障碍物一直是阻碍智能体自主导航发展的关键因素,而躲避障碍物和清理障碍物是两种解决动态障碍物问题的有效方法。近年来,多智能体躲避动态障碍物(避障)问题受到了广大学者的关注,优秀的多智能体避障算法纷纷涌现。然而,多智能体清理动态障碍物(清障)问题却无人问津,相对应的多智能体清障算法更是屈指可数。为解决多智能体清障问题,文中提出了一种基于深度确定性策略梯度与注意力Critic的多智能体协同清障算法(Multi-Agent Cooperative Algorithm for Obstacle Clearance Based on Deep Deterministic Policy Gradient and Attention Critic, MACOC)。首先,创建了首个多智能体协同清障的环境模型,定义了多智能体及动态障碍物的运动学模型,并根据智能体和动态障碍物数量的不同,构建了4种仿真实验环境;其次,将多智能体协同清障过程定义为马尔可夫决策过程(Markov Decision Process, MDP),构建了多智能体t的状态空间、动作空间和奖励函数;最后,提出一种基于深度确定性策略梯度与注意力Critic的多智能体协同清障算法,并在多智能体协同清障仿真环境中与经典的多智能体强化学习算法进行对比。实验证明,相比对比算法,所提出的MACOC算法清障的成功率更高、速度更快,对复杂环境的适应性更好。