博弈收益控制研究进展被引量：1

Payoff control in game theory

导出

摘要在博弈论中,单个个体控制全部个体的收益通常被认为是不可能的.一个例外是20世纪末在重复囚徒困境中提出的均衡器策略:使用这种策略的个体可以将对手的收益设置为由收益函数所决定的某个区间内的任意值.十余年后发现的零行列式策略通过单方面设置个体收益的线性关系,将该结果一般化.在此基础上,关于博弈收益控制的研究取得了一系列成果.本文概述了博弈收益控制的研究现状;介绍了单次博弈和重复博弈中的收益控制技术;从收益控制的基本概念、能控制的收益关系、收益控制策略的形式和收益控制策略的演化特性等方面总结了博弈中收益控制的主要进展和成果;并讨论了博弈收益控制的未来发展趋势. In game theory,a single player usually cannot control the payoffs of all players in a game.An exception is the equalizer strategy proposed at the end of the last century for prisoner’s dilemma,with which a player can set their opponent’s payoff to be any designated value in a certain interval,regardless of which strategy the opponent uses.This result was further generalized with the discovery of the zero-determinant(ZD)strategies,which allow a player to unilaterally enforce a linear relationship between his own payoff and that of the opponent.The question of how payoff control can be established has attracted significant attention from computer scientists,control theorists,and evolutionary biologists,and many new results have been subsequently derived.This paper discusses the latest advances in payoff control,enforcing either a linear or nonlinear relation in one-shot or repeated games.In particular,we highlight the above question from four aspects:the concept of payoff control,forms of payoff relation that can be established,strategies that can control payoffs,and the evolutionary behavior of these strategies.We also provide an outlook on the directions for future research.

作者王龙陈芳陈星如 Long WANG;Fang CHEN;Xingru CHEN(Center for Systems and Control,Peking University,Beijing 100871,China;School of Sciences,Beijing University of Posts and Telecommunications,Beijing 100876,China)

机构地区北京大学系统与控制研究中心北京邮电大学理学院

出处《中国科学：信息科学》 CSCD 北大核心 2023年第4期623-646,共24页 Scientia Sinica(Informationis)

基金国家自然科学基金(批准号:62036002)资助项目。

关键词博弈论收益控制零行列式策略演化博弈论策略设计 game theory payoff control zero-determinant strategy evolutionary game theory strategy design

分类号 F224.32 [经济管理—国民经济]

引文网络
相关文献

参考文献4

1王龙,武斌,杜金铭,魏钰婷,周达.复杂动态网络上的传播行为分析献给清华大学郑大钟教授[J].中国科学：信息科学,2020,50(11):1714-1731. 被引量：8
2王龙,伏锋,陈小杰,王靖,武斌,楚天广,谢广明.复杂网络上的群体决策[J].智能系统学报,2008,3(2):95-108. 被引量：24
3王龙,杜金铭.多智能体协调控制的演化博弈方法[J].系统科学与数学,2016,36(3):302-318. 被引量：18
4王龙,丛睿,李昆.合作演化中的反馈机制[J].中国科学：信息科学,2014,44(12):1495-1514. 被引量：16

二级参考文献275

1程代展,陈翰馥.从群集到社会行为控制[J].科技导报,2004,22(8):4-7. 被引量：32
2王龙,伏锋,陈小杰,王靖,李卓政,谢广明,楚天广.复杂网络上的演化博弈[J].智能系统学报,2007,2(2):1-10. 被引量：33
3王龙,伏锋,陈小杰,楚天广,谢广明.演化博弈与自组织合作[J].系统科学与数学,2007,27(3):330-343. 被引量：16
4[1]STAUFFER D.Opinion dynamics and sociophysics[EB/OL].[2007-05-07].http://arxiv.org/abs/0705.0891v1.
5[2]STAUFFER D.Sociophysics simulations II:opinion dynamics[C]// AIP Conference Proceedings,Granada,Spain,2005,779:56-68.
6[3]STAUFFER D,De OLIVEIRA S,De OLIVEIRA PMC,et al.Biology,sociology,geology by computational physicists[M].Amsterdam:Elsevier,2006.
7[4]BILLARI FC,FENT T,PRSKAWETZ A,et al.Agent-based computational modeling[M].Heidelberg:Physica-Verlag,2006.
8[5]JAIN S,MUKAND S.Public opinion and the dynamics of reform[C]// Sixth Jacques Polak Annual Research Conference Hosted by the International Monetary Fund Washington.Washington,USA,2005.
9[6]VALLACHER R R,NOWAK A,MILLER M E.Social influence and group dynamics[M].New York:Wiley,2003.
10[7]BALDASSARRI D,BEARMAN P.Dynamics of political polarization[J].American Sociological Review,2007,72(5):784-811.