This paper tackles the formation-containment control problem of fixed-wing unmanned aerial vehicle(UAV)swarm with model uncertainties for dynamic target tracking in three-dimensional space in the faulty case of UAVs’...This paper tackles the formation-containment control problem of fixed-wing unmanned aerial vehicle(UAV)swarm with model uncertainties for dynamic target tracking in three-dimensional space in the faulty case of UAVs’actuator and sensor.The fixed-wing UAV swarm under consideration is organized as a“multi-leader-multi-follower”structure,in which only several leaders can obtain the dynamic target information while others only receive the neighbors’information through the communication network.To simultaneously realize the formation,containment,and dynamic target tracking,a two-layer control framework is adopted to decouple the problem into two subproblems:reference trajectory generation and trajectory tracking.In the upper layer,a distributed finite-time estimator(DFTE)is proposed to generate each UAV’s reference trajectory in accordance with the control objective.Subsequently,a distributed composite robust fault-tolerant trajectory tracking controller is developed in the lower layer,where a novel adaptive extended super-twisting(AESTW)algorithm with a finite-time extended state observer(FTESO)is involved in solving the robust trajectory tracking control problem under model uncertainties,actuator,and sensor faults.The proposed controller simultaneously guarantees rapidness and enhances the system’s robustness with fewer chattering effects.Finally,corresponding simulations are carried out to demonstrate the effectiveness and competitiveness of the proposed two-layer fault-tolerant cooperative control scheme.展开更多
It is essential to maximize capacity while satisfying the transmission time delay of unmanned aerial vehicle(UAV)swarm communication system.In order to address this challenge,a dynamic decentralized optimization mecha...It is essential to maximize capacity while satisfying the transmission time delay of unmanned aerial vehicle(UAV)swarm communication system.In order to address this challenge,a dynamic decentralized optimization mechanism is presented for the realization of joint spectrum and power(JSAP)resource allocation based on deep Q-learning networks(DQNs).Each UAV to UAV(U2U)link is regarded as an agent that is capable of identifying the optimal spectrum and power to communicate with one another.The convolutional neural network,target network,and experience replay are adopted while training.The findings of the simulation indicate that the proposed method has the potential to improve both communication capacity and probability of successful data transmission when compared with random centralized assignment and multichannel access methods.展开更多
The source location based on the hybrid time difference of arrival(TDOA)/frequency difference of arrival(FDOA) is a basic problem in wireless sensor networks, and the layout of sensors in the hybrid TDOA/FDOA position...The source location based on the hybrid time difference of arrival(TDOA)/frequency difference of arrival(FDOA) is a basic problem in wireless sensor networks, and the layout of sensors in the hybrid TDOA/FDOA positioning will greatly affect the accuracy of positioning. Using unmanned aerial vehicle(UAV) as base stations, by optimizing the trajectory of the UAV swarm, an optimal positioning configuration is formed to improve the accuracy of the target position and velocity estimation. In this paper, a hybrid TDOA/FDOA positioning model is first established, and the positioning accuracy of the hybrid TDOA/FDOA under different positioning configurations and different measurement errors is simulated by the geometric dilution of precision(GDOP) factor. Second, the Cramer-Rao lower bound(CRLB) matrix of hybrid TDOA/FDOA location under different moving states of the target is derived theoretically, the objective function of the track optimization is obtained, and the track of the UAV swarm is optimized in real time. The simulation results show that the track optimization effectively improves the accuracy of the target position and velocity estimation.展开更多
This paper presents a path planning approach for rotary unmanned aerial vehicles(R-UAVs)in a known static rough terrain environment.This approach aims to find collision-free and feasible paths with minimum altitude,le...This paper presents a path planning approach for rotary unmanned aerial vehicles(R-UAVs)in a known static rough terrain environment.This approach aims to find collision-free and feasible paths with minimum altitude,length and angle variable rate.First,a three-dimensional(3D)modeling method is proposed to reduce the computation burden of the dynamic models of R-UAVs.Considering the length,height and tuning angle of a path,the path planning of R-UAVs is described as a tri-objective optimization problem.Then,an improved multi-objective particle swarm optimization algorithm is developed.To render the algorithm more effective in dealing with this problem,a vibration function is introduced into the collided solutions to improve the algorithm efficiency.Meanwhile,the selection of the global best position is taken into account by the reference point method.Finally,the experimental environment is built with the help of the Google map and the 3D terrain generator World Machine.Experimental results under two different rough terrains from Guilin and Lanzhou of China demonstrate the capabilities of the proposed algorithm in finding Pareto optimal paths.展开更多
This paper studies a special defense game using unmanned aerial vehicle(UAV)swarm against a fast intruder.The fast intruder applies an offensive strategy based on the artificial potential field method and Apollonius c...This paper studies a special defense game using unmanned aerial vehicle(UAV)swarm against a fast intruder.The fast intruder applies an offensive strategy based on the artificial potential field method and Apollonius circle to scout a certain destination.As defenders,the UAVs are arranged into three layers:the forward layer,the midfield layer and the back layer.The co-defense mechanism,including the role derivation method of UAV swarm and a guidance law based on the co-defense front point,is introduced for UAV swarm to co-detect the intruder.Besides,five formations are designed for comparative analysis when ten UAVs are applied.Through Monte Carlo experiments and ablation experiment,the effectiveness of the proposed co-defense method has been verified.展开更多
The unmanned aerial vehicle(UAV)swarm technology is one of the research hotspots in recent years.With the continuous improvement of autonomous intelligence of UAV,the swarm technology of UAV will become one of the mai...The unmanned aerial vehicle(UAV)swarm technology is one of the research hotspots in recent years.With the continuous improvement of autonomous intelligence of UAV,the swarm technology of UAV will become one of the main trends of UAV development in the future.This paper studies the behavior decision-making process of UAV swarm rendezvous task based on the double deep Q network(DDQN)algorithm.We design a guided reward function to effectively solve the problem of algorithm convergence caused by the sparse return problem in deep reinforcement learning(DRL)for the long period task.We also propose the concept of temporary storage area,optimizing the memory playback unit of the traditional DDQN algorithm,improving the convergence speed of the algorithm,and speeding up the training process of the algorithm.Different from traditional task environment,this paper establishes a continuous state-space task environment model to improve the authentication process of UAV task environment.Based on the DDQN algorithm,the collaborative tasks of UAV swarm in different task scenarios are trained.The experimental results validate that the DDQN algorithm is efficient in terms of training UAV swarm to complete the given collaborative tasks while meeting the requirements of UAV swarm for centralization and autonomy,and improving the intelligence of UAV swarm collaborative task execution.The simulation results show that after training,the proposed UAV swarm can carry out the rendezvous task well,and the success rate of the mission reaches 90%.展开更多
Cooperative search-attack is an important application of unmanned aerial vehicle(UAV)swarm in military field.The coupling between path planning and task allocation,the heterogeneity of UAVs,and the dynamic nature of t...Cooperative search-attack is an important application of unmanned aerial vehicle(UAV)swarm in military field.The coupling between path planning and task allocation,the heterogeneity of UAVs,and the dynamic nature of task environment greatly increase the complexity and difficulty of the UAV swarm cooperative search-attack mission planning problem.Inspired by the collaborative hunting behavior of wolf pack,a distributed selforganizing method for UAV swarm search-attack mission planning is proposed.First,to solve the multi-target search problem in unknown environments,a wolf scouting behavior-inspired cooperative search algorithm for UAV swarm is designed.Second,a distributed self-organizing task allocation algorithm for UAV swarm cooperative attacking of targets is proposed by analyzing the flexible labor division behavior of wolves.By abstracting the UAV as a simple artificial wolf agent,the flexible motion planning and group task coordinating for UAV swarm can be realized by self-organizing.The effectiveness of the proposed method is verified by a set of simulation experiments,the stability and scalability are evaluated,and the integrated solution for the coupled path planning and task allocation problems for the UAV swarm cooperative search-attack task can be well performed.展开更多
An ant colony optimization with artificial potential field(ACOAPF)algorithm is proposed to solve the cooperative search mission planning problem of unmanned aerial vehicle(UAV)swarm.This algorithm adopts a distributed...An ant colony optimization with artificial potential field(ACOAPF)algorithm is proposed to solve the cooperative search mission planning problem of unmanned aerial vehicle(UAV)swarm.This algorithm adopts a distributed architecture where each UAV is considered as an ant and makes decision autonomously.At each decision step,the ants choose the next gird according to the state transition rule and update its own artificial potential field and pheromone map based on the current search results.Through iterations of this process,the cooperative search of UAV swarm for mission area is realized.The state transition rule is divided into two types.If the artificial potential force is larger than a threshold,the deterministic transition rule is adopted,otherwise a heuristic transition rule is used.The deterministic transition rule can ensure UAVs to avoid the threat or approach the target quickly.And the heuristics transition rule considering the pheromone and heuristic information ensures the continuous search of area with the goal of covering more unknown area and finding more targets.Finally,simulations are carried out to verify the effectiveness of the proposed ACOAPF algorithm for cooperative search mission of UAV swarm.展开更多
A decentralized task planning algorithm is proposed for heterogeneous unmanned aerial vehicle(UAV)swarm with different capabilities.The algorithm extends the consensus-based bundle algorithm(CBBA)to account for a more...A decentralized task planning algorithm is proposed for heterogeneous unmanned aerial vehicle(UAV)swarm with different capabilities.The algorithm extends the consensus-based bundle algorithm(CBBA)to account for a more realistic and complex environment.The extension of the algorithm includes handling multi-agent task that requires multiple UAVs collaboratively completed in coordination,and consideration of avoiding obstacles in task scenarios.We propose a new consensus algorithm to solve the multi-agent task allocation problem and use the Dubins algorithm to design feasible paths for UAVs to avoid obstacles and consider motion constraints.Experimental results show that the CBBA extension algorithm can converge to a conflict-free and feasible solution for multi-agent task planning problems.展开更多
To solve the problem of time difference of arrival(TDOA)positioning and tracking of targets by the unmanned aerial vehicles(UAV)swarm in future air combat,this paper adopts the TDOA positioning method and uses time di...To solve the problem of time difference of arrival(TDOA)positioning and tracking of targets by the unmanned aerial vehicles(UAV)swarm in future air combat,this paper adopts the TDOA positioning method and uses time difference sensors of the UAV swarm to locate target radiation sources.Firstly,a TDOA model for the target is set up for the UAV swarm under the condition that the error variance varies with the received signal-to-noise ratio.The accuracy of the positioning error is analyzed by geometric dilution of precision(GDOP).The D-optimality criterion of the positioning model is theoretically derived.The target is positioned and settled,and the maximum value of the Fisher information matrix determinant is used as the optimization objective function to optimize the track of the UAV in real time.Simulation results show that the track optimization improves the positioning accuracy and stability of the UAV swarm to the target.展开更多
Projects on unmanned aerial vehicle(UAV) swarms have been initiated in a big way in the last few years, especially from 2015 to 2016. As a result, the number of related works on UAV swarms has been on the rise, with t...Projects on unmanned aerial vehicle(UAV) swarms have been initiated in a big way in the last few years, especially from 2015 to 2016. As a result, the number of related works on UAV swarms has been on the rise, with the rate of growth dramatically accelerating since 2017. This research conducts a bibliometric analysis of robotics swarms and UAV swarms to answer the following questions:(i) Disciplines mentioned in the UAV swarms research.(ii) The future development trends and hotspots in the UAV swarms research.(iii) Tracking related outcomes in the UAV swarms research.展开更多
The deep deterministic policy gradient(DDPG)algo-rithm is an off-policy method that combines two mainstream reinforcement learning methods based on value iteration and policy iteration.Using the DDPG algorithm,agents ...The deep deterministic policy gradient(DDPG)algo-rithm is an off-policy method that combines two mainstream reinforcement learning methods based on value iteration and policy iteration.Using the DDPG algorithm,agents can explore and summarize the environment to achieve autonomous deci-sions in the continuous state space and action space.In this paper,a cooperative defense with DDPG via swarms of unmanned aerial vehicle(UAV)is developed and validated,which has shown promising practical value in the effect of defending.We solve the sparse rewards problem of reinforcement learning pair in a long-term task by building the reward function of UAV swarms and optimizing the learning process of artificial neural network based on the DDPG algorithm to reduce the vibration in the learning process.The experimental results show that the DDPG algorithm can guide the UAVs swarm to perform the defense task efficiently,meeting the requirements of a UAV swarm for non-centralization,autonomy,and promoting the intelligent development of UAVs swarm as well as the decision-making process.展开更多
电离层中释放的金属蒸气产生人工等离子体云团,其可显著改变无线电波传播。本文利用几何绕射理论(geometrical theory of diffraction, GTD)和有限元法(finite element method, FEM)相结合的方法,给出了经由天线、人工等离子云团和无人...电离层中释放的金属蒸气产生人工等离子体云团,其可显著改变无线电波传播。本文利用几何绕射理论(geometrical theory of diffraction, GTD)和有限元法(finite element method, FEM)相结合的方法,给出了经由天线、人工等离子云团和无人机(unmanned aerial vehicle, UAV)群组成的传播链路中信号强度计算方法。利用30~70 MHz甚高频(very high frequency, VHF)信号研究人工等离子体云团与UAV群的复合散射特性,得出如下结论:接收功率随着信号频率增加呈下降趋势;当机群由N架UAV构成时,阵因子迭加使机群雷达散射截面(radar cross section, RCS)出现一定的起伏,同相迭加时,接收功率可比单个UAV高约20lg N dB;利用人工等离子体云团散射可实现VHF频段用于对米级尺度RCS目标进行超视距探测,有助于解决紧急情况下电离层扰动对高频探测的不利影响。展开更多
基金the National Natural Science Foundation of China(61933010)the Natural Science Basic Research Plan in Shaanxi Province of China(2023-JC-QN-0733).
文摘This paper tackles the formation-containment control problem of fixed-wing unmanned aerial vehicle(UAV)swarm with model uncertainties for dynamic target tracking in three-dimensional space in the faulty case of UAVs’actuator and sensor.The fixed-wing UAV swarm under consideration is organized as a“multi-leader-multi-follower”structure,in which only several leaders can obtain the dynamic target information while others only receive the neighbors’information through the communication network.To simultaneously realize the formation,containment,and dynamic target tracking,a two-layer control framework is adopted to decouple the problem into two subproblems:reference trajectory generation and trajectory tracking.In the upper layer,a distributed finite-time estimator(DFTE)is proposed to generate each UAV’s reference trajectory in accordance with the control objective.Subsequently,a distributed composite robust fault-tolerant trajectory tracking controller is developed in the lower layer,where a novel adaptive extended super-twisting(AESTW)algorithm with a finite-time extended state observer(FTESO)is involved in solving the robust trajectory tracking control problem under model uncertainties,actuator,and sensor faults.The proposed controller simultaneously guarantees rapidness and enhances the system’s robustness with fewer chattering effects.Finally,corresponding simulations are carried out to demonstrate the effectiveness and competitiveness of the proposed two-layer fault-tolerant cooperative control scheme.
基金supported by the National Natural Science Foundation of China(62031017,61971221).
文摘It is essential to maximize capacity while satisfying the transmission time delay of unmanned aerial vehicle(UAV)swarm communication system.In order to address this challenge,a dynamic decentralized optimization mechanism is presented for the realization of joint spectrum and power(JSAP)resource allocation based on deep Q-learning networks(DQNs).Each UAV to UAV(U2U)link is regarded as an agent that is capable of identifying the optimal spectrum and power to communicate with one another.The convolutional neural network,target network,and experience replay are adopted while training.The findings of the simulation indicate that the proposed method has the potential to improve both communication capacity and probability of successful data transmission when compared with random centralized assignment and multichannel access methods.
基金supported by the National Natural Science Foundation of China (61502522)Equipment Pre-Research Field Fund(JZX7Y20190253036101)+1 种基金Equipment Pre-Research Ministry of Education Joint Fund (6141A02033703)Hubei Provincial Natural Scie nce Foundation (2019CFC897)。
文摘The source location based on the hybrid time difference of arrival(TDOA)/frequency difference of arrival(FDOA) is a basic problem in wireless sensor networks, and the layout of sensors in the hybrid TDOA/FDOA positioning will greatly affect the accuracy of positioning. Using unmanned aerial vehicle(UAV) as base stations, by optimizing the trajectory of the UAV swarm, an optimal positioning configuration is formed to improve the accuracy of the target position and velocity estimation. In this paper, a hybrid TDOA/FDOA positioning model is first established, and the positioning accuracy of the hybrid TDOA/FDOA under different positioning configurations and different measurement errors is simulated by the geometric dilution of precision(GDOP) factor. Second, the Cramer-Rao lower bound(CRLB) matrix of hybrid TDOA/FDOA location under different moving states of the target is derived theoretically, the objective function of the track optimization is obtained, and the track of the UAV swarm is optimized in real time. The simulation results show that the track optimization effectively improves the accuracy of the target position and velocity estimation.
基金supported by the National Natural Science Foundation of China(6167321461673217+2 种基金61673219)the Natural Science Foundation of the Jiangsu Higher Education Institutions of China(18KJB120011)the Postgraduate Research and Practice Innovation Program of Jiangsu Province(KYCX19_0299)
文摘This paper presents a path planning approach for rotary unmanned aerial vehicles(R-UAVs)in a known static rough terrain environment.This approach aims to find collision-free and feasible paths with minimum altitude,length and angle variable rate.First,a three-dimensional(3D)modeling method is proposed to reduce the computation burden of the dynamic models of R-UAVs.Considering the length,height and tuning angle of a path,the path planning of R-UAVs is described as a tri-objective optimization problem.Then,an improved multi-objective particle swarm optimization algorithm is developed.To render the algorithm more effective in dealing with this problem,a vibration function is introduced into the collided solutions to improve the algorithm efficiency.Meanwhile,the selection of the global best position is taken into account by the reference point method.Finally,the experimental environment is built with the help of the Google map and the 3D terrain generator World Machine.Experimental results under two different rough terrains from Guilin and Lanzhou of China demonstrate the capabilities of the proposed algorithm in finding Pareto optimal paths.
基金the Aeronautical Science Foundation of China(2020Z023053001).
文摘This paper studies a special defense game using unmanned aerial vehicle(UAV)swarm against a fast intruder.The fast intruder applies an offensive strategy based on the artificial potential field method and Apollonius circle to scout a certain destination.As defenders,the UAVs are arranged into three layers:the forward layer,the midfield layer and the back layer.The co-defense mechanism,including the role derivation method of UAV swarm and a guidance law based on the co-defense front point,is introduced for UAV swarm to co-detect the intruder.Besides,five formations are designed for comparative analysis when ten UAVs are applied.Through Monte Carlo experiments and ablation experiment,the effectiveness of the proposed co-defense method has been verified.
基金supported by the Aeronautical Science Foundation(2017ZC53033).
文摘The unmanned aerial vehicle(UAV)swarm technology is one of the research hotspots in recent years.With the continuous improvement of autonomous intelligence of UAV,the swarm technology of UAV will become one of the main trends of UAV development in the future.This paper studies the behavior decision-making process of UAV swarm rendezvous task based on the double deep Q network(DDQN)algorithm.We design a guided reward function to effectively solve the problem of algorithm convergence caused by the sparse return problem in deep reinforcement learning(DRL)for the long period task.We also propose the concept of temporary storage area,optimizing the memory playback unit of the traditional DDQN algorithm,improving the convergence speed of the algorithm,and speeding up the training process of the algorithm.Different from traditional task environment,this paper establishes a continuous state-space task environment model to improve the authentication process of UAV task environment.Based on the DDQN algorithm,the collaborative tasks of UAV swarm in different task scenarios are trained.The experimental results validate that the DDQN algorithm is efficient in terms of training UAV swarm to complete the given collaborative tasks while meeting the requirements of UAV swarm for centralization and autonomy,and improving the intelligence of UAV swarm collaborative task execution.The simulation results show that after training,the proposed UAV swarm can carry out the rendezvous task well,and the success rate of the mission reaches 90%.
基金supported by the National Natural Science Foundation of China(61502534)the Shaanxi Provincial Natural Science Foundation(2020JQ-493)+2 种基金the Integrative Equipment Research Project of Armed Police Force(WJ20211A030018)the Military Science Project of the National Social Science Fund(WJ2019-SKJJ-C-092)the Theoretical Research Foundation of Armed Police Engineering University(WJY202148)。
文摘Cooperative search-attack is an important application of unmanned aerial vehicle(UAV)swarm in military field.The coupling between path planning and task allocation,the heterogeneity of UAVs,and the dynamic nature of task environment greatly increase the complexity and difficulty of the UAV swarm cooperative search-attack mission planning problem.Inspired by the collaborative hunting behavior of wolf pack,a distributed selforganizing method for UAV swarm search-attack mission planning is proposed.First,to solve the multi-target search problem in unknown environments,a wolf scouting behavior-inspired cooperative search algorithm for UAV swarm is designed.Second,a distributed self-organizing task allocation algorithm for UAV swarm cooperative attacking of targets is proposed by analyzing the flexible labor division behavior of wolves.By abstracting the UAV as a simple artificial wolf agent,the flexible motion planning and group task coordinating for UAV swarm can be realized by self-organizing.The effectiveness of the proposed method is verified by a set of simulation experiments,the stability and scalability are evaluated,and the integrated solution for the coupled path planning and task allocation problems for the UAV swarm cooperative search-attack task can be well performed.
基金supported by the National Natural Science Foundation of China (Nos.61973158, 61673209)the Aeronautical Science Foundation (No.2016ZA52009)
文摘An ant colony optimization with artificial potential field(ACOAPF)algorithm is proposed to solve the cooperative search mission planning problem of unmanned aerial vehicle(UAV)swarm.This algorithm adopts a distributed architecture where each UAV is considered as an ant and makes decision autonomously.At each decision step,the ants choose the next gird according to the state transition rule and update its own artificial potential field and pheromone map based on the current search results.Through iterations of this process,the cooperative search of UAV swarm for mission area is realized.The state transition rule is divided into two types.If the artificial potential force is larger than a threshold,the deterministic transition rule is adopted,otherwise a heuristic transition rule is used.The deterministic transition rule can ensure UAVs to avoid the threat or approach the target quickly.And the heuristics transition rule considering the pheromone and heuristic information ensures the continuous search of area with the goal of covering more unknown area and finding more targets.Finally,simulations are carried out to verify the effectiveness of the proposed ACOAPF algorithm for cooperative search mission of UAV swarm.
文摘A decentralized task planning algorithm is proposed for heterogeneous unmanned aerial vehicle(UAV)swarm with different capabilities.The algorithm extends the consensus-based bundle algorithm(CBBA)to account for a more realistic and complex environment.The extension of the algorithm includes handling multi-agent task that requires multiple UAVs collaboratively completed in coordination,and consideration of avoiding obstacles in task scenarios.We propose a new consensus algorithm to solve the multi-agent task allocation problem and use the Dubins algorithm to design feasible paths for UAVs to avoid obstacles and consider motion constraints.Experimental results show that the CBBA extension algorithm can converge to a conflict-free and feasible solution for multi-agent task planning problems.
基金This work was supported by the National Natural Science Foundation of China(61502522)the Equipment Pre-Research Field Fund(JZX7Y20190253036101)+1 种基金the Equipment Pre-Research Ministry of Education Joint Fund(6141A02033703)the Hubei Provincial Natural Science Foundation(2019CFC897).
文摘To solve the problem of time difference of arrival(TDOA)positioning and tracking of targets by the unmanned aerial vehicles(UAV)swarm in future air combat,this paper adopts the TDOA positioning method and uses time difference sensors of the UAV swarm to locate target radiation sources.Firstly,a TDOA model for the target is set up for the UAV swarm under the condition that the error variance varies with the received signal-to-noise ratio.The accuracy of the positioning error is analyzed by geometric dilution of precision(GDOP).The D-optimality criterion of the positioning model is theoretically derived.The target is positioned and settled,and the maximum value of the Fisher information matrix determinant is used as the optimization objective function to optimize the track of the UAV in real time.Simulation results show that the track optimization improves the positioning accuracy and stability of the UAV swarm to the target.
文摘Projects on unmanned aerial vehicle(UAV) swarms have been initiated in a big way in the last few years, especially from 2015 to 2016. As a result, the number of related works on UAV swarms has been on the rise, with the rate of growth dramatically accelerating since 2017. This research conducts a bibliometric analysis of robotics swarms and UAV swarms to answer the following questions:(i) Disciplines mentioned in the UAV swarms research.(ii) The future development trends and hotspots in the UAV swarms research.(iii) Tracking related outcomes in the UAV swarms research.
基金supported by the Key Research and Development Program of Shaanxi(2022GY-089)the Natural Science Basic Research Program of Shaanxi(2022JQ-593).
文摘The deep deterministic policy gradient(DDPG)algo-rithm is an off-policy method that combines two mainstream reinforcement learning methods based on value iteration and policy iteration.Using the DDPG algorithm,agents can explore and summarize the environment to achieve autonomous deci-sions in the continuous state space and action space.In this paper,a cooperative defense with DDPG via swarms of unmanned aerial vehicle(UAV)is developed and validated,which has shown promising practical value in the effect of defending.We solve the sparse rewards problem of reinforcement learning pair in a long-term task by building the reward function of UAV swarms and optimizing the learning process of artificial neural network based on the DDPG algorithm to reduce the vibration in the learning process.The experimental results show that the DDPG algorithm can guide the UAVs swarm to perform the defense task efficiently,meeting the requirements of a UAV swarm for non-centralization,autonomy,and promoting the intelligent development of UAVs swarm as well as the decision-making process.
文摘电离层中释放的金属蒸气产生人工等离子体云团,其可显著改变无线电波传播。本文利用几何绕射理论(geometrical theory of diffraction, GTD)和有限元法(finite element method, FEM)相结合的方法,给出了经由天线、人工等离子云团和无人机(unmanned aerial vehicle, UAV)群组成的传播链路中信号强度计算方法。利用30~70 MHz甚高频(very high frequency, VHF)信号研究人工等离子体云团与UAV群的复合散射特性,得出如下结论:接收功率随着信号频率增加呈下降趋势;当机群由N架UAV构成时,阵因子迭加使机群雷达散射截面(radar cross section, RCS)出现一定的起伏,同相迭加时,接收功率可比单个UAV高约20lg N dB;利用人工等离子体云团散射可实现VHF频段用于对米级尺度RCS目标进行超视距探测,有助于解决紧急情况下电离层扰动对高频探测的不利影响。