This paper presents a novel cooperative value iteration(VI)-based adaptive dynamic programming method for multi-player differential game models with a convergence proof.The players are divided into two groups in the l...This paper presents a novel cooperative value iteration(VI)-based adaptive dynamic programming method for multi-player differential game models with a convergence proof.The players are divided into two groups in the learning process and adapt their policies sequentially.Our method removes the dependence of admissible initial policies,which is one of the main drawbacks of the PI-based frameworks.Furthermore,this algorithm enables the players to adapt their control policies without full knowledge of others’ system parameters or control laws.The efficacy of our method is illustrated by three examples.展开更多
Cooperative autonomous air combat of multiple unmanned aerial vehicles(UAVs)is one of the main combat modes in future air warfare,which becomes even more complicated with highly changeable situation and uncertain info...Cooperative autonomous air combat of multiple unmanned aerial vehicles(UAVs)is one of the main combat modes in future air warfare,which becomes even more complicated with highly changeable situation and uncertain information of the opponents.As such,this paper presents a cooperative decision-making method based on incomplete information dynamic game to generate maneuver strategies for multiple UAVs in air combat.Firstly,a cooperative situation assessment model is presented to measure the overall combat situation.Secondly,an incomplete information dynamic game model is proposed to model the dynamic process of air combat,and a dynamic Bayesian network is designed to infer the tactical intention of the opponent.Then a reinforcement learning framework based on multiagent deep deterministic policy gradient is established to obtain the perfect Bayes-Nash equilibrium solution of the air combat game model.Finally,a series of simulations are conducted to verify the effectiveness of the proposed method,and the simulation results show effective synergies and cooperative tactics.展开更多
This paper is concerned with Hepato-Cellular Carcinoma (HCC) patients treated naturopathic agents. Patients treated with ≥4 agents survived significantly longer than patients treated with ≤3 agents. The great effect...This paper is concerned with Hepato-Cellular Carcinoma (HCC) patients treated naturopathic agents. Patients treated with ≥4 agents survived significantly longer than patients treated with ≤3 agents. The great effect is seen in patients treated with at least 4 agents that include Cordyceps sinensis. This greater certainty of patient survival without toxic side effects is significant benefit comparing with the conventional therapy. Treatment of HCC with a regimen of ≥4 agents prepared from natural products is associated with greater certainty of patient survival in a substantial portion of patients. The information dynamic model for certainty of patient survival is derived based on fluid mechanics, where a series of approximate solutions of the flow between two parallel flat walls, one of which is at rest, the other is suddenly accelerated from the rest to a constant velocity are used. The kinetic energy of certainty of patient survival decreases with increasing time, while the potential energy increases with increasing time. Total mechanical energy of patients treated with 4 or more agents is smaller than that treated with 3 or fewer agents. The kinetic energy (potential energy) of patients treated with 4 or more agents decreases (increases) more slower than the kinetic energy (potential energy) of patients treated with 3 or fewer agents.展开更多
Nowadays, security defence of network uses the game theory, which mostly applies complete information game model or even the static game model. To get closer to the actual network and defend actively, we propose a net...Nowadays, security defence of network uses the game theory, which mostly applies complete information game model or even the static game model. To get closer to the actual network and defend actively, we propose a network attack-defence game model by using signalling game, which is modelled in the way of dynamic and incomplete information. We improve the traditional attack-defence strategies quantization method to meet the needs of the network signalling game model. Moreover, we give the calculation of the game equilibrium and analyse the optimal defence scheme. Finally, we analyse and verify effectiveness of the model and method through a simulation experiment.展开更多
Given the fragmentation of public opinion dissemination and the lag of network users’cognition,the paper analyzes public opinion dissemination with incomplete information,which can provide reference for us to control...Given the fragmentation of public opinion dissemination and the lag of network users’cognition,the paper analyzes public opinion dissemination with incomplete information,which can provide reference for us to control and guide the spread of public opinion.Based on the derivative and secondary radiation of public opinion dissemination with incomplete information,the Susceptible-Susceptible-Infected-Recovered-Recovered-Infected(SSIRR-I)model is proposed.Given the interaction between users,the Deffuant opinion dynamics model and evolutionary game theory are introduced to simulate the public opinion game between dissemination and immune nodes.Finally,the numerical simulation and results analysis are given.The results reveal that the rate of opinion convergence significantly affects disseminating public opinion,which is positively correlated with the promotion effect of the dissemination node and negatively correlated with the suppression effect of the immune node of public opinion dissemination.Derivative and secondary radiations have different effects on public opinion dissemination in the early stage,but promote public opinion dissemination in the later stage.The dominant immune nodes have an apparent inhibitory effect on the spread of public opinion;nevertheless,they cannot block the dissemination of public opinion.展开更多
In this paper, we proposed a general form of a multi-team Bertrand game. Then, we studied a two-team Bertrand game, each team consists of two firms, with heterogeneous strategies among teams and homogeneous strategies...In this paper, we proposed a general form of a multi-team Bertrand game. Then, we studied a two-team Bertrand game, each team consists of two firms, with heterogeneous strategies among teams and homogeneous strategies among players. We find the equilibrium solutions and the conditions of their local stability. Numerical simulations were used to illustrate the complex behaviour of the proposed model, such as period doubling bifurcation and chaos. Finally, we used the feedback control method to control the model.展开更多
基金supported by the Industry-University-Research Cooperation Fund Project of the Eighth Research Institute of China Aerospace Science and Technology Corporation (USCAST2022-11)Aeronautical Science Foundation of China (20220001057001)。
文摘This paper presents a novel cooperative value iteration(VI)-based adaptive dynamic programming method for multi-player differential game models with a convergence proof.The players are divided into two groups in the learning process and adapt their policies sequentially.Our method removes the dependence of admissible initial policies,which is one of the main drawbacks of the PI-based frameworks.Furthermore,this algorithm enables the players to adapt their control policies without full knowledge of others’ system parameters or control laws.The efficacy of our method is illustrated by three examples.
基金supported by the National Natural Science Foundation of China(Grant No.61933010 and 61903301)Shaanxi Aerospace Flight Vehicle Design Key Laboratory。
文摘Cooperative autonomous air combat of multiple unmanned aerial vehicles(UAVs)is one of the main combat modes in future air warfare,which becomes even more complicated with highly changeable situation and uncertain information of the opponents.As such,this paper presents a cooperative decision-making method based on incomplete information dynamic game to generate maneuver strategies for multiple UAVs in air combat.Firstly,a cooperative situation assessment model is presented to measure the overall combat situation.Secondly,an incomplete information dynamic game model is proposed to model the dynamic process of air combat,and a dynamic Bayesian network is designed to infer the tactical intention of the opponent.Then a reinforcement learning framework based on multiagent deep deterministic policy gradient is established to obtain the perfect Bayes-Nash equilibrium solution of the air combat game model.Finally,a series of simulations are conducted to verify the effectiveness of the proposed method,and the simulation results show effective synergies and cooperative tactics.
基金supported by Foundation for Innovative Research Groups of National Natural Science Foundation of China(NSFC)(61321002)National Science Fund for Distinguished Young Scholars(60925011)+2 种基金Projects of Major International(Regional)Joint Research Program NSFC(61120106010)Beijing Education Committee Cooperation Building Foundation Project,Program for Changjiang Scholars and Innovative Research Team in University(IRT1208)Chang Jiang Scholars Program and National Natural Science Foundation of China(61203078)
文摘This paper is concerned with Hepato-Cellular Carcinoma (HCC) patients treated naturopathic agents. Patients treated with ≥4 agents survived significantly longer than patients treated with ≤3 agents. The great effect is seen in patients treated with at least 4 agents that include Cordyceps sinensis. This greater certainty of patient survival without toxic side effects is significant benefit comparing with the conventional therapy. Treatment of HCC with a regimen of ≥4 agents prepared from natural products is associated with greater certainty of patient survival in a substantial portion of patients. The information dynamic model for certainty of patient survival is derived based on fluid mechanics, where a series of approximate solutions of the flow between two parallel flat walls, one of which is at rest, the other is suddenly accelerated from the rest to a constant velocity are used. The kinetic energy of certainty of patient survival decreases with increasing time, while the potential energy increases with increasing time. Total mechanical energy of patients treated with 4 or more agents is smaller than that treated with 3 or fewer agents. The kinetic energy (potential energy) of patients treated with 4 or more agents decreases (increases) more slower than the kinetic energy (potential energy) of patients treated with 3 or fewer agents.
基金supported by the National Natural Science Foundation of China under Grant No. 61303074 and No. 61309013the Henan Province Science and Technology Project Funds under Grant No. 12210231002
文摘Nowadays, security defence of network uses the game theory, which mostly applies complete information game model or even the static game model. To get closer to the actual network and defend actively, we propose a network attack-defence game model by using signalling game, which is modelled in the way of dynamic and incomplete information. We improve the traditional attack-defence strategies quantization method to meet the needs of the network signalling game model. Moreover, we give the calculation of the game equilibrium and analyse the optimal defence scheme. Finally, we analyse and verify effectiveness of the model and method through a simulation experiment.
基金supported by the National Social Science Foundation of China(No.20BGL025)and the Postgraduate Practice Innovation Program of Jiangsu Province(No.SJCX200316).
文摘Given the fragmentation of public opinion dissemination and the lag of network users’cognition,the paper analyzes public opinion dissemination with incomplete information,which can provide reference for us to control and guide the spread of public opinion.Based on the derivative and secondary radiation of public opinion dissemination with incomplete information,the Susceptible-Susceptible-Infected-Recovered-Recovered-Infected(SSIRR-I)model is proposed.Given the interaction between users,the Deffuant opinion dynamics model and evolutionary game theory are introduced to simulate the public opinion game between dissemination and immune nodes.Finally,the numerical simulation and results analysis are given.The results reveal that the rate of opinion convergence significantly affects disseminating public opinion,which is positively correlated with the promotion effect of the dissemination node and negatively correlated with the suppression effect of the immune node of public opinion dissemination.Derivative and secondary radiations have different effects on public opinion dissemination in the early stage,but promote public opinion dissemination in the later stage.The dominant immune nodes have an apparent inhibitory effect on the spread of public opinion;nevertheless,they cannot block the dissemination of public opinion.
文摘In this paper, we proposed a general form of a multi-team Bertrand game. Then, we studied a two-team Bertrand game, each team consists of two firms, with heterogeneous strategies among teams and homogeneous strategies among players. We find the equilibrium solutions and the conditions of their local stability. Numerical simulations were used to illustrate the complex behaviour of the proposed model, such as period doubling bifurcation and chaos. Finally, we used the feedback control method to control the model.