This paper studies the evolutionary process of cooperative behavior in a public goods game model with heterogeneous investment strategies in square lattices.In the proposed model,players are divided into defectors,coo...This paper studies the evolutionary process of cooperative behavior in a public goods game model with heterogeneous investment strategies in square lattices.In the proposed model,players are divided into defectors,cooperators and discreet investors.Among these,defectors do not participate in investing,discreet investors make heterogeneous investments based on the investment behavior and cooperation value of their neighbors,and cooperators invest equally in each neighbor.In real life,heterogeneous investment is often accompanied by time or economic costs.The discreet investors in this paper pay a certain price to obtain their neighbors'investment behavior and cooperation value,which quantifies the time and economic costs of the heterogeneous investment process.The results of Monte Carlo simulation experiments in this study show that discreet investors can effectively resist the invasion of the defectors,form a stable cooperative group and expand the cooperative advantage in evolution.However,when discreet investors pay too high a price,they lose their strategic advantage.The results in this paper help us understand the role of heterogeneous investment in promoting and maintaining human social cooperation.展开更多
The key advantage of unmanned swarm operation is its autonomous cooperation. How to improve the proportion of cooperators is one of the key issues of autonomous collaboration in unmanned swarm operations. This work pr...The key advantage of unmanned swarm operation is its autonomous cooperation. How to improve the proportion of cooperators is one of the key issues of autonomous collaboration in unmanned swarm operations. This work proposes a strategy dominance mechanism of autonomous collaboration in unmanned swarm within the framework of public goods game. It starts with the requirement analysis of autonomous collaboration in unmanned swarm;and an aspiration-driven multiplayer evolutionary game model is established based on the requirement. Then the average abundance function and strategy dominance condition of the model are constructed by theoretical derivation. Furthermore, the evolutionary mechanism of parameter adjustment in swarm cooperation is revealed via simulation,and the influences of the multiplication factor r, aspiration levelα, threshold m and other parameters on the strategy dominance conditions were simulated for both linear and threshold public goods games(PGGs) to determine the strategy dominance characteristics;Finally, deliberate proposals are suggested to provide a meaningful exploration in the actual control of unmanned swarm cooperation.展开更多
In this paper, we study the public goods games with punishment by adopting the well-known approximate best response dynamics. It shows that the evolution of cooperation is affected by two aspects when other parameters...In this paper, we study the public goods games with punishment by adopting the well-known approximate best response dynamics. It shows that the evolution of cooperation is affected by two aspects when other parameters are fixed. One is the punishment mechanism which can avoid the dilemma of lacking investment, and the other is the degree of rationality. Theoretical analysis and numerical results indicate that the existence of punishment mechanism and distribution of rationality are the keys to the enhancement of cooperation level. We also testify that they can heavily influence the payoffs of system as well. The findings in this paper may provide a deeper understanding of some social dilemmas.展开更多
We study the stochastic evolutionary public goods game with punishment in a finite size population. Two kinds of costly punishments are considered, i.e., first-order punishment in which only the defectors are punished...We study the stochastic evolutionary public goods game with punishment in a finite size population. Two kinds of costly punishments are considered, i.e., first-order punishment in which only the defectors are punished, and second-order punishment in which both the defectors and the cooperators who do not punish the defective behaviors are punished. We focus on the stochastic stable equilibrium of the system. In the population, the evolutionary process of strategies is described as a finite state Markov process. The evolutionary equilibrium of the system and its stochastic stability are analyzed by the limit distribution of the Markov process. By numerical experiments, our findings are as follows.(i) The first-order costly punishment can change the evolutionary dynamics and equilibrium of the public goods game, and it can promote cooperation only when both the intensity of punishment and the return on investment parameters are large enough.(ii)Under the first-order punishment, the further imposition of the second-order punishment cannot change the evolutionary dynamics of the system dramatically, but can only change the probability of the system to select the equilibrium points in the "C+P" states, which refer to the co-existence states of cooperation and punishment. The second-order punishment has limited roles in promoting cooperation, except for some critical combinations of parameters.(iii) When the system chooses"C+P" states with probability one, the increase of the punishment probability under second-order punishment will further increase the proportion of the "P" strategy in the "C+P" states.展开更多
In this work, the optional public goods games with punishment are studied. By adopting the approximate best response dynamics, a micro model is given to explain the evolutionary process. Simultaneously, the magnitude ...In this work, the optional public goods games with punishment are studied. By adopting the approximate best response dynamics, a micro model is given to explain the evolutionary process. Simultaneously, the magnitude of rationality is also considered. Under the condition of bounded rationality which provides a light to interpret phenomena in human society, the model leads to two types of equilibriums. One is the equilibrium without punishers and the other is the equilibrium including only punishers and cooperators. In addition, the effects of rationality on equilibriums are briefly investigated.展开更多
We investigate the evolution of cooperation with evolutionary public goods games based on finite populations, where four pure strategies: cooperators, defectors, punishers and loners who are unwilling to participate ...We investigate the evolution of cooperation with evolutionary public goods games based on finite populations, where four pure strategies: cooperators, defectors, punishers and loners who are unwilling to participate are considered. By adopting approximate best response dynamics, we show that the magnitude of rationality not only quantitatively explains the experiment results in [Nature (London) 425 (2003) 390], but also it will heavily influence the evolution of cooperation. Compared with previous results of infinite populations, which result in two equilibriums, we show that there merely exists a special equilibrium cooperation. In addition, we characterize that loner's and the relevant high value of bounded rationality will sustain payoff plays an active role in the maintenance of cooperation, which will only be warranted for the low and moderate values of loner's payoff. It thus indicates the effects of rationality and loner's payoff will influence the cooperation. Finally, we highlight the important result that the introduction of voluntary participation and punishment will facilitate cooperation greatly.展开更多
Payoff-driven strategy updating rule has always been adopted as a classic mechanism,but up to now,there have been a great many of researches on considering other forms of strategy updating rules,among which pursuing h...Payoff-driven strategy updating rule has always been adopted as a classic mechanism,but up to now,there have been a great many of researches on considering other forms of strategy updating rules,among which pursuing high fitness is one of the most direct and conventional motivations in the decision-making using game theory.But there are few or no researches on fitness from the perspective of others'evaluation.In view of this,we propose a new model in which the evaluation effect with fitness-driven strategy updating rule is taken into consideration,and introduce an evaluation coefficient to present the degree of others'evaluation on individual's behavior.The cooperative individuals can get positive evaluation,otherwise defective individuals get negative evaluation,and the degree of evaluation is related to the number of neighbors who have the same strategy of individual.Through numerical simulation,we find that the evaluation effect of others can enhance the network reciprocity,thus promoting the cooperation.For a strong dilemma,the higher evaluation coefficient can greatly weaken the cooperation dilemma;for a weak one,the higher evaluation coefficient can make cooperator clusters spread faster,however,there is no significant difference in the level of cooperation in the final stable state among different evaluation coefficients.The cooperation becomes more flourish as the number of fitness-driven individuals increases,when all individuals adopt fitness-driven strategy updating rule,the cooperators can quickly occupy the whole population.Besides,we demonstrate the robustness of the results on the WS small-world network,ER random network,and BA scalefree network.展开更多
We investigate the evolution of cooperation in public goods game based on individuals' historical payoffs. In particular, the fitness of individuals are characterized by two types of payoffs, which are obtained by...We investigate the evolution of cooperation in public goods game based on individuals' historical payoffs. In particular, the fitness of individuals are characterized by two types of payoffs, which are obtained by acting as cooperators and defectors, respectively. Both of payoffs are the linear combination of the current payoffs and the cumulative historical payoffs. The results show that cooperation is enhanced by an increasing memory effect with a wide range of related factors. To explain this phenomenon, we plot some representative snapshots of the population and scrutinize the mean fitness of cooperators and defectors along the boundary. It is found that increasing memory effect induces a positive feedback mechanism for cooperators to expand their districts. Defectors can just survive through forming narrower clusters to exploit cooperators more widely. The threshold values for cooperators and defectors vanishing under the influence of noise are also investigated.展开更多
在无人机(UAV)集群攻击地面目标时,UAV集群将分为两个编队:主攻目标的打击型UAV集群和牵制敌方的辅助型UAV集群。当辅助型UAV集群选择激进进攻或保存实力这两种动作策略时,任务场景类似于公共物品博弈,此时合作者的收益小于背叛者。基于...在无人机(UAV)集群攻击地面目标时,UAV集群将分为两个编队:主攻目标的打击型UAV集群和牵制敌方的辅助型UAV集群。当辅助型UAV集群选择激进进攻或保存实力这两种动作策略时,任务场景类似于公共物品博弈,此时合作者的收益小于背叛者。基于此,提出一种基于深度强化学习的UAV集群协同作战决策方法。首先,通过建立基于公共物品博弈的UAV集群作战模型,模拟智能化UAV集群在合作中个体与集体间的利益冲突问题;其次,利用多智能体深度确定性策略梯度(MADDPG)算法求解辅助UAV集群最合理的作战决策,从而以最小的损耗代价实现集群胜利。在不同数量UAV情况下进行训练并展开实验,实验结果表明,与IDQN(Independent Deep QNetwork)和ID3QN(Imitative Dueling Double Deep Q-Network)这两种算法的训练效果相比,所提算法的收敛性最好,且在4架辅助型UAV情况下胜率可达100%,在其他UAV数情况下也明显优于对比算法。展开更多
Hardin's "The Tragedy of the Commons" prophesies the inescapable collapse of many human enterprises.The emergence and abundance of cooperation in animal and human societies is a challenging puzzle to evo...Hardin's "The Tragedy of the Commons" prophesies the inescapable collapse of many human enterprises.The emergence and abundance of cooperation in animal and human societies is a challenging puzzle to evolutionary theory.In this work,we introduce a new decision-making criterion into a voluntary public goods game with incomplete information and choose successful strategies according to previous payoffs for a certain strategy as well as the risk-averse benefit.We find that the interest rate of the common pool and the magnitude of memory have crucial effects on the average welfare of the population.The appropriate sense of individuals' innovation also substantially influences the equilibrium strategies distribution in the long run.展开更多
The phenomena of cooperation in animal and human society are ubiquitous, but the selfish outcome that no player contributes to the public good will lead to the "tragedy of the commons". The recent research s...The phenomena of cooperation in animal and human society are ubiquitous, but the selfish outcome that no player contributes to the public good will lead to the "tragedy of the commons". The recent research shows that high punishment can improve the cooperation of the population. In this paper, we introduce a punishment mechanism into spatial voluntary public goods games with every individual only knowing his own payoff in each round. Using the self-adjusting rules, we find that the different cost for punishment can lead to different effects on the voluntary public goods games. Especially, when the cost for punishment is decreased, a higher contribution region will appear in the case of low r value. It means even for the low r value, individuals can form the contributing groups in large quantities to produce a more efficient outcome than that in moderate r value. In addition, we also find the players' memory can have effects on the average outcome of the population.展开更多
The regular small-world network, which contains the properties of small-world network and regular network, has recently received substantial attention and has been applied in researches on 2-person games. However, it ...The regular small-world network, which contains the properties of small-world network and regular network, has recently received substantial attention and has been applied in researches on 2-person games. However, it is a common phenomenon that cooperation always appears as a group behavior. In order to investigate the mechanism of group cooperation, we propose an evolutionary multi-person game model on a regular small-world network based on public goods game theory. Then, to make a comparison of frequency of cooperation among different networks, we carry out simulations on three kinds of networks with the same configuration of average degree: the square lattice, regular small-world network and random regular network. The results of simulation show that the group cooperation will emerge among these three networks when the enhancement factor r exceeds a threshold. Furthermore, time required for full cooperation on regular small-world network is slightly longer than the other networks, which indicates that the compact interactions and random interactions will promote cooperation, while the longer-range links are the obstacles in the emergence of cooperation. In addition, the cooperation would be promoted further by enhancing the random interactions on regular small-world network.展开更多
基金Project supported by the Open Foundation of Key Laboratory of Software Engineering of Yunnan Province(Grant Nos.2020SE308 and 2020SE309).
文摘This paper studies the evolutionary process of cooperative behavior in a public goods game model with heterogeneous investment strategies in square lattices.In the proposed model,players are divided into defectors,cooperators and discreet investors.Among these,defectors do not participate in investing,discreet investors make heterogeneous investments based on the investment behavior and cooperation value of their neighbors,and cooperators invest equally in each neighbor.In real life,heterogeneous investment is often accompanied by time or economic costs.The discreet investors in this paper pay a certain price to obtain their neighbors'investment behavior and cooperation value,which quantifies the time and economic costs of the heterogeneous investment process.The results of Monte Carlo simulation experiments in this study show that discreet investors can effectively resist the invasion of the defectors,form a stable cooperative group and expand the cooperative advantage in evolution.However,when discreet investors pay too high a price,they lose their strategic advantage.The results in this paper help us understand the role of heterogeneous investment in promoting and maintaining human social cooperation.
基金supported by the National Natural Science Foundation of China(71901217)the National Key R&D Program of China(2018YFC0806900).
文摘The key advantage of unmanned swarm operation is its autonomous cooperation. How to improve the proportion of cooperators is one of the key issues of autonomous collaboration in unmanned swarm operations. This work proposes a strategy dominance mechanism of autonomous collaboration in unmanned swarm within the framework of public goods game. It starts with the requirement analysis of autonomous collaboration in unmanned swarm;and an aspiration-driven multiplayer evolutionary game model is established based on the requirement. Then the average abundance function and strategy dominance condition of the model are constructed by theoretical derivation. Furthermore, the evolutionary mechanism of parameter adjustment in swarm cooperation is revealed via simulation,and the influences of the multiplication factor r, aspiration levelα, threshold m and other parameters on the strategy dominance conditions were simulated for both linear and threshold public goods games(PGGs) to determine the strategy dominance characteristics;Finally, deliberate proposals are suggested to provide a meaningful exploration in the actual control of unmanned swarm cooperation.
基金Project supported by the National Natural Science Foundation of China (Grant No. 10672081).
文摘In this paper, we study the public goods games with punishment by adopting the well-known approximate best response dynamics. It shows that the evolution of cooperation is affected by two aspects when other parameters are fixed. One is the punishment mechanism which can avoid the dilemma of lacking investment, and the other is the degree of rationality. Theoretical analysis and numerical results indicate that the existence of punishment mechanism and distribution of rationality are the keys to the enhancement of cooperation level. We also testify that they can heavily influence the payoffs of system as well. The findings in this paper may provide a deeper understanding of some social dilemmas.
基金supported by the National Natural Science Foundation of China(Grant Nos.71501149 and 71231007)the Soft Science Project of Hubei Province,China(Grant No.2017ADC122)the Fundamental Research Funds for the Central Universities,China(Grant No.WUT:2017VI070)
文摘We study the stochastic evolutionary public goods game with punishment in a finite size population. Two kinds of costly punishments are considered, i.e., first-order punishment in which only the defectors are punished, and second-order punishment in which both the defectors and the cooperators who do not punish the defective behaviors are punished. We focus on the stochastic stable equilibrium of the system. In the population, the evolutionary process of strategies is described as a finite state Markov process. The evolutionary equilibrium of the system and its stochastic stability are analyzed by the limit distribution of the Markov process. By numerical experiments, our findings are as follows.(i) The first-order costly punishment can change the evolutionary dynamics and equilibrium of the public goods game, and it can promote cooperation only when both the intensity of punishment and the return on investment parameters are large enough.(ii)Under the first-order punishment, the further imposition of the second-order punishment cannot change the evolutionary dynamics of the system dramatically, but can only change the probability of the system to select the equilibrium points in the "C+P" states, which refer to the co-existence states of cooperation and punishment. The second-order punishment has limited roles in promoting cooperation, except for some critical combinations of parameters.(iii) When the system chooses"C+P" states with probability one, the increase of the punishment probability under second-order punishment will further increase the proportion of the "P" strategy in the "C+P" states.
基金Project supported by the National Natural Science Foundation of China (Grant No. 10672081)the Center for Asia Studies of Nankai University (Grant No. 2010-5)
文摘In this work, the optional public goods games with punishment are studied. By adopting the approximate best response dynamics, a micro model is given to explain the evolutionary process. Simultaneously, the magnitude of rationality is also considered. Under the condition of bounded rationality which provides a light to interpret phenomena in human society, the model leads to two types of equilibriums. One is the equilibrium without punishers and the other is the equilibrium including only punishers and cooperators. In addition, the effects of rationality on equilibriums are briefly investigated.
基金Supported by National Nature Science Foundation under Grant No.60904063the Tianjin municipal Natural Science Foundation under Grant Nos.11JCYBJC06600,11ZCKF6X00900,11ZCKFGX00900
文摘We investigate the evolution of cooperation with evolutionary public goods games based on finite populations, where four pure strategies: cooperators, defectors, punishers and loners who are unwilling to participate are considered. By adopting approximate best response dynamics, we show that the magnitude of rationality not only quantitatively explains the experiment results in [Nature (London) 425 (2003) 390], but also it will heavily influence the evolution of cooperation. Compared with previous results of infinite populations, which result in two equilibriums, we show that there merely exists a special equilibrium cooperation. In addition, we characterize that loner's and the relevant high value of bounded rationality will sustain payoff plays an active role in the maintenance of cooperation, which will only be warranted for the low and moderate values of loner's payoff. It thus indicates the effects of rationality and loner's payoff will influence the cooperation. Finally, we highlight the important result that the introduction of voluntary participation and punishment will facilitate cooperation greatly.
基金Project supported by the National Natural Science Foundation of China(Grant Nos.61673096 and 62076057)the Social Science Project of the Ministry of Education of China(Grant No.16YJC630118)the Project of Promoting Talents in Liaoning Province,China(Grant No.XLYC1807033)。
文摘Payoff-driven strategy updating rule has always been adopted as a classic mechanism,but up to now,there have been a great many of researches on considering other forms of strategy updating rules,among which pursuing high fitness is one of the most direct and conventional motivations in the decision-making using game theory.But there are few or no researches on fitness from the perspective of others'evaluation.In view of this,we propose a new model in which the evaluation effect with fitness-driven strategy updating rule is taken into consideration,and introduce an evaluation coefficient to present the degree of others'evaluation on individual's behavior.The cooperative individuals can get positive evaluation,otherwise defective individuals get negative evaluation,and the degree of evaluation is related to the number of neighbors who have the same strategy of individual.Through numerical simulation,we find that the evaluation effect of others can enhance the network reciprocity,thus promoting the cooperation.For a strong dilemma,the higher evaluation coefficient can greatly weaken the cooperation dilemma;for a weak one,the higher evaluation coefficient can make cooperator clusters spread faster,however,there is no significant difference in the level of cooperation in the final stable state among different evaluation coefficients.The cooperation becomes more flourish as the number of fitness-driven individuals increases,when all individuals adopt fitness-driven strategy updating rule,the cooperators can quickly occupy the whole population.Besides,we demonstrate the robustness of the results on the WS small-world network,ER random network,and BA scalefree network.
基金Supported by the National Natural Science Foundation of China (NSFC) (No. 61074120)
文摘We investigate the evolution of cooperation in public goods game based on individuals' historical payoffs. In particular, the fitness of individuals are characterized by two types of payoffs, which are obtained by acting as cooperators and defectors, respectively. Both of payoffs are the linear combination of the current payoffs and the cumulative historical payoffs. The results show that cooperation is enhanced by an increasing memory effect with a wide range of related factors. To explain this phenomenon, we plot some representative snapshots of the population and scrutinize the mean fitness of cooperators and defectors along the boundary. It is found that increasing memory effect induces a positive feedback mechanism for cooperators to expand their districts. Defectors can just survive through forming narrower clusters to exploit cooperators more widely. The threshold values for cooperators and defectors vanishing under the influence of noise are also investigated.
文摘在无人机(UAV)集群攻击地面目标时,UAV集群将分为两个编队:主攻目标的打击型UAV集群和牵制敌方的辅助型UAV集群。当辅助型UAV集群选择激进进攻或保存实力这两种动作策略时,任务场景类似于公共物品博弈,此时合作者的收益小于背叛者。基于此,提出一种基于深度强化学习的UAV集群协同作战决策方法。首先,通过建立基于公共物品博弈的UAV集群作战模型,模拟智能化UAV集群在合作中个体与集体间的利益冲突问题;其次,利用多智能体深度确定性策略梯度(MADDPG)算法求解辅助UAV集群最合理的作战决策,从而以最小的损耗代价实现集群胜利。在不同数量UAV情况下进行训练并展开实验,实验结果表明,与IDQN(Independent Deep QNetwork)和ID3QN(Imitative Dueling Double Deep Q-Network)这两种算法的训练效果相比,所提算法的收敛性最好,且在4架辅助型UAV情况下胜率可达100%,在其他UAV数情况下也明显优于对比算法。
基金supported by the National High-Tech Research and Development Program of China (2009AA043703)the National Natural Science Foundation of China (91023045)+1 种基金the Center for Asia Studies of Nankai University (AS1005)the Development Fund of Science and Technology for the Higher Education in Tianjin (20100908)
文摘Hardin's "The Tragedy of the Commons" prophesies the inescapable collapse of many human enterprises.The emergence and abundance of cooperation in animal and human societies is a challenging puzzle to evolutionary theory.In this work,we introduce a new decision-making criterion into a voluntary public goods game with incomplete information and choose successful strategies according to previous payoffs for a certain strategy as well as the risk-averse benefit.We find that the interest rate of the common pool and the magnitude of memory have crucial effects on the average welfare of the population.The appropriate sense of individuals' innovation also substantially influences the equilibrium strategies distribution in the long run.
基金Supported by National High Technology Research and Development Program of China(863 program/2009AA043703)National Natural Science Foundation of China under Grant No.91023045+1 种基金the Center for Asia Studies of Nankai University under Grant No.AS1005Tianjin City High School Science&Technology Fund Planning Project under Grant No.20100908
文摘The phenomena of cooperation in animal and human society are ubiquitous, but the selfish outcome that no player contributes to the public good will lead to the "tragedy of the commons". The recent research shows that high punishment can improve the cooperation of the population. In this paper, we introduce a punishment mechanism into spatial voluntary public goods games with every individual only knowing his own payoff in each round. Using the self-adjusting rules, we find that the different cost for punishment can lead to different effects on the voluntary public goods games. Especially, when the cost for punishment is decreased, a higher contribution region will appear in the case of low r value. It means even for the low r value, individuals can form the contributing groups in large quantities to produce a more efficient outcome than that in moderate r value. In addition, we also find the players' memory can have effects on the average outcome of the population.
基金Supported by the National Natural Science Foundation of China(71601148)the National Social Science Foundation of China(14ZDA062)Humanities and Social Science Research Foundation of Ministry of Education(14JDGC012)
文摘The regular small-world network, which contains the properties of small-world network and regular network, has recently received substantial attention and has been applied in researches on 2-person games. However, it is a common phenomenon that cooperation always appears as a group behavior. In order to investigate the mechanism of group cooperation, we propose an evolutionary multi-person game model on a regular small-world network based on public goods game theory. Then, to make a comparison of frequency of cooperation among different networks, we carry out simulations on three kinds of networks with the same configuration of average degree: the square lattice, regular small-world network and random regular network. The results of simulation show that the group cooperation will emerge among these three networks when the enhancement factor r exceeds a threshold. Furthermore, time required for full cooperation on regular small-world network is slightly longer than the other networks, which indicates that the compact interactions and random interactions will promote cooperation, while the longer-range links are the obstacles in the emergence of cooperation. In addition, the cooperation would be promoted further by enhancing the random interactions on regular small-world network.