A conflict of three players, including an attacker, a defender, and a target with bounded control is discussed based on the differential game theories in which the target and the defender use an optimal pursuit strate...A conflict of three players, including an attacker, a defender, and a target with bounded control is discussed based on the differential game theories in which the target and the defender use an optimal pursuit strategy. The current approach chooses the miss distance as the outcome of the conflict. Different optimal guidance laws are investigated, and feasible conditions are analyzed for the attacker to accomplish an attacking task. For some given conditions, the attacker cannot intercept the target by only using a one-to-one optimal pursuit guidance law; thus, a guidance law for the attacker to reach a critical safe value is investigated.Specifically, the guidance law is divided into two parts. Before the engagement time between the defender and the attacker, the attacker uses this derived guidance law to guarantee that the evasion distance from the defender is safe, and that the zero-effort-miss(ZEM) distance between the attacker and the target is the smallest.After that engagement time, the attacker uses the optimal one-toone guidance law to accomplish the pursuit task. The advantages and limited conditions of these derived guidance laws are also investigated by using nonlinear simulations.展开更多
In recent years,with the continuous advancement of the intelligent process of the Internet of Vehicles(IoV),the problem of privacy leakage in IoV has become increasingly prominent.The research on the privacy protectio...In recent years,with the continuous advancement of the intelligent process of the Internet of Vehicles(IoV),the problem of privacy leakage in IoV has become increasingly prominent.The research on the privacy protection of the IoV has become the focus of the society.This paper analyzes the advantages and disadvantages of the existing location privacy protection system structure and algorithms,proposes a privacy protection system structure based on untrusted data collection server,and designs a vehicle location acquisition algorithm based on a local differential privacy and game model.The algorithm first meshes the road network space.Then,the dynamic game model is introduced into the game user location privacy protection model and the attacker location semantic inference model,thereby minimizing the possibility of exposing the regional semantic privacy of the k-location set while maximizing the availability of the service.On this basis,a statistical method is designed,which satisfies the local differential privacy of k-location sets and obtains unbiased estimation of traffic density in different regions.Finally,this paper verifies the algorithm based on the data set of mobile vehicles in Shanghai.The experimental results show that the algorithm can guarantee the user’s location privacy and location semantic privacy while satisfying the service quality requirements,and provide better privacy protection and service for the users of the IoV.展开更多
针对航天器与非合作目标追逃博弈的生存型微分对策拦截问题,基于强化学习研究了追逃博弈策略,提出了自适应增强随机搜索(adaptive-augmented random search,A-ARS)算法。针对序贯决策的稀疏奖励难题,设计了基于策略参数空间扰动的探索方...针对航天器与非合作目标追逃博弈的生存型微分对策拦截问题,基于强化学习研究了追逃博弈策略,提出了自适应增强随机搜索(adaptive-augmented random search,A-ARS)算法。针对序贯决策的稀疏奖励难题,设计了基于策略参数空间扰动的探索方法,加快策略收敛速度;针对可能过早陷入局部最优问题设计了新颖度函数并引导策略更新,可提升数据利用效率;通过数值仿真验证并与增强随机搜索(augmented random search,ARS)、近端策略优化算法(proximal policy optimization,PPO)以及深度确定性策略梯度下降算法(deep deterministic policy gradient,DDPG)进行对比,验证了此方法的有效性和先进性。展开更多
In the global environment of pursuing resource regeneration and green environmental protection, more and more wasted clothing need to be solved. In order to make full use of the wasted clothing and save land and soil ...In the global environment of pursuing resource regeneration and green environmental protection, more and more wasted clothing need to be solved. In order to make full use of the wasted clothing and save land and soil resources, an idea of wasted clothing's recycling and remanufacturing is put forward. In the new idea a pricing game model is established basing on Stacklberg differential game theory between traditional and remanufactured clothing. In this model, the differences in consumers' willingness to pay and the government's subsidies are considered. Government's optimal subsidy are obtained which ensure not only the interests of manufacturers but also environmental reputation and maximum social benefits. The study is helpful to push the wasted clothing's recycling and remanufacturing plan. It makes some index more precise quantification as government's subsidy, manufacturers and the social benefits. Government and manufactures can make the detailed cooperation plan reference to it.展开更多
For intercepting modern high maneuverable targets, a novel adaptive weighted differential game guidance law based on the game theory of mixed strategy is proposed, combining two guidance laws which are derived from th...For intercepting modern high maneuverable targets, a novel adaptive weighted differential game guidance law based on the game theory of mixed strategy is proposed, combining two guidance laws which are derived from the perfect and imperfect in- formation pattern, respectively. The weights vary according to the estimated error of the target's acceleration, the guidance law is generated by directly using the estimation of target's acceleration when the estimated error is small, and a differential game guidance law with adaptive penalty coefficient is implemented when the estimated error is large. The adaptive penalty coeffi- cients are not constants and they can be adjusted with current target maneuverability. The superior homing performance of the new guidance law is verified by computer simulations.展开更多
基金supported by the National Natural Science Foundation of China(11672093)
文摘A conflict of three players, including an attacker, a defender, and a target with bounded control is discussed based on the differential game theories in which the target and the defender use an optimal pursuit strategy. The current approach chooses the miss distance as the outcome of the conflict. Different optimal guidance laws are investigated, and feasible conditions are analyzed for the attacker to accomplish an attacking task. For some given conditions, the attacker cannot intercept the target by only using a one-to-one optimal pursuit guidance law; thus, a guidance law for the attacker to reach a critical safe value is investigated.Specifically, the guidance law is divided into two parts. Before the engagement time between the defender and the attacker, the attacker uses this derived guidance law to guarantee that the evasion distance from the defender is safe, and that the zero-effort-miss(ZEM) distance between the attacker and the target is the smallest.After that engagement time, the attacker uses the optimal one-toone guidance law to accomplish the pursuit task. The advantages and limited conditions of these derived guidance laws are also investigated by using nonlinear simulations.
基金This work is supported by Major Scientific and Technological Special Project of Guizhou Province(20183001)Research on the education mode for complicate skill students in new media with cross specialty integration(22150117092)+2 种基金Open Foundation of Guizhou Provincial Key Laboratory of Public Big Data(2018BDKFJJ014)Open Foundation of Guizhou Provincial Key Laboratory of Public Big Data(2018BDKFJJ019)Open Foundation of Guizhou Provincial Key Laboratory of Public Big Data(2018BDKFJJ022).
文摘In recent years,with the continuous advancement of the intelligent process of the Internet of Vehicles(IoV),the problem of privacy leakage in IoV has become increasingly prominent.The research on the privacy protection of the IoV has become the focus of the society.This paper analyzes the advantages and disadvantages of the existing location privacy protection system structure and algorithms,proposes a privacy protection system structure based on untrusted data collection server,and designs a vehicle location acquisition algorithm based on a local differential privacy and game model.The algorithm first meshes the road network space.Then,the dynamic game model is introduced into the game user location privacy protection model and the attacker location semantic inference model,thereby minimizing the possibility of exposing the regional semantic privacy of the k-location set while maximizing the availability of the service.On this basis,a statistical method is designed,which satisfies the local differential privacy of k-location sets and obtains unbiased estimation of traffic density in different regions.Finally,this paper verifies the algorithm based on the data set of mobile vehicles in Shanghai.The experimental results show that the algorithm can guarantee the user’s location privacy and location semantic privacy while satisfying the service quality requirements,and provide better privacy protection and service for the users of the IoV.
文摘针对航天器与非合作目标追逃博弈的生存型微分对策拦截问题,基于强化学习研究了追逃博弈策略,提出了自适应增强随机搜索(adaptive-augmented random search,A-ARS)算法。针对序贯决策的稀疏奖励难题,设计了基于策略参数空间扰动的探索方法,加快策略收敛速度;针对可能过早陷入局部最优问题设计了新颖度函数并引导策略更新,可提升数据利用效率;通过数值仿真验证并与增强随机搜索(augmented random search,ARS)、近端策略优化算法(proximal policy optimization,PPO)以及深度确定性策略梯度下降算法(deep deterministic policy gradient,DDPG)进行对比,验证了此方法的有效性和先进性。
文摘In the global environment of pursuing resource regeneration and green environmental protection, more and more wasted clothing need to be solved. In order to make full use of the wasted clothing and save land and soil resources, an idea of wasted clothing's recycling and remanufacturing is put forward. In the new idea a pricing game model is established basing on Stacklberg differential game theory between traditional and remanufactured clothing. In this model, the differences in consumers' willingness to pay and the government's subsidies are considered. Government's optimal subsidy are obtained which ensure not only the interests of manufacturers but also environmental reputation and maximum social benefits. The study is helpful to push the wasted clothing's recycling and remanufacturing plan. It makes some index more precise quantification as government's subsidy, manufacturers and the social benefits. Government and manufactures can make the detailed cooperation plan reference to it.
基金National Natural Science Foundation of China (60874040)
文摘For intercepting modern high maneuverable targets, a novel adaptive weighted differential game guidance law based on the game theory of mixed strategy is proposed, combining two guidance laws which are derived from the perfect and imperfect in- formation pattern, respectively. The weights vary according to the estimated error of the target's acceleration, the guidance law is generated by directly using the estimation of target's acceleration when the estimated error is small, and a differential game guidance law with adaptive penalty coefficient is implemented when the estimated error is large. The adaptive penalty coeffi- cients are not constants and they can be adjusted with current target maneuverability. The superior homing performance of the new guidance law is verified by computer simulations.