期刊文献+

TOWARDS A THEORY OF GAME-BASED NON-EQUILIBRIUM CONTROL SYSTEMS

TOWARDS A THEORY OF GAME-BASED NON-EQUILIBRIUM CONTROL SYSTEMS
原文传递
导出
摘要 This paper considers optimization problems for a new kind of control systems based on non-equilibrium dynamic games.To be precise,the authors consider the infinitely repeated games between a human and a machine based on the generic 2×2 game with fixed machine strategy of finite k-step memory.By introducing and analyzing the state transfer graphes(STG),it will be shown that the system state will become periodic after finite steps under the optimal strategy that maximizes the human’s averaged payoff,which helps us to ease the task of finding the optimal strategy considerably. Moreover,the question whether the optimizer will win or lose is investigated and some interesting phenomena are found,e.g.,for the standard Prisoner’s Dilemma game,the human will not lose to the machine while optimizing her own averaged payoff when k = 1;however,when k≥2,she may indeed lose if she focuses on optimizing her own payoff only The robustness of the optimal strategy and identification problem are also considered.It appears that both the framework and the results are beyond those in the classical control theory and the traditional game theory. This paper considers optimization problems for a new kind of control systems based on non-equilibrium dynamic games. To be precise, the authors consider the infinitely repeated games between a human and a machine based on the generic 2 × 2 game with fixed machine strategy of finite k-step memory. By introducing and analyzing the state transfer graphes (STG), it will be shown that the system state will become periodic after finite steps under the optimal strategy that maximizes the human's averaged payoff, which helps us to ease the task of finding the optimal strategy considerably. Moreover, the question whether the optimizer will win or lose is investigated and some interesting phenomena are found, e.g., for the standard Prisoner's Dilemma game, the human will not lose to the machine while optimizing her own averaged payoff when k ---- 1; however, when k 〉 2, she may indeed lose if she focuses on optimizing her own payoff only. The robustness of the optimal strategy and identification problem are also considered. It appears that both the framework and the results are beyond those in the classical control theory and the traditional game theory.
作者 Yifen MU Lei GUO
出处 《Journal of Systems Science & Complexity》 SCIE EI CSCD 2012年第2期209-226,共18页 系统科学与复杂性学报(英文版)
基金 supported by the National Natural Science Foundation of China under Grant No.60821091 by the Knowledge Innovation Project of Chinese Academy of Sciences under Grant No.KJCX3-SYW-S01
关键词 平衡控制系统 战略游戏 优化问题 经典控制理论 最优策略 重复博弈 囚徒困境 识别问题 Heterogeneous players, non-equilibrium dynamical games, optimization, state transfer graph, win-loss criterion.
  • 相关文献

参考文献33

  • 1K. J. Astrom and B. Wittenmark, Adaptive Control, 2nd ed., Addison-Wesley, Reading, MA, 1995.
  • 2L. Guo and H. Chen The Astrom-Wittenmark self-tuning regulator revised and ELS-based adapptive trakers, IEEE Trans. on Automatic Control, 1991, 36: 802-812.
  • 3L. Guo and L. Ljung, Performance analysis of general tracking algorithms, IEEE Trans. on Automatic Control, 1995, 40:1388 -1402.
  • 4L. Guo, Self-convergence of weighted least-squares with applications to stochastic adaptive control, IEEE Trans. on Automatic Control, 1996, 41: 79-89.
  • 5T. L. Duncan, L. Guo, and B. Pasik-Duncan, Continous-time linear-quadratic Gaussian adaptive control, IEEE Trans. on Automatic Control, 1999, 44: 1653-1662.
  • 6G. C. Goodwin and K. S. Sin, Adaptive Filtering, Prediction and Control, Prentice-Hall, Englewood Cliffs N J, 1984.
  • 7P. R. Kumar and P. Varaiya, Stochastic Systems: Estimation, Identification and Adaptive Control, Prentice Hall, Englewood Cliffs N J, 1986.
  • 8M. Kristic, I. Kanellakopoulos, and P. Kokotoric, Interscience Publication, John Wiley & Sons, Inc., Nonlinear Adaptive Control Design, A Wiley- Canada, 1995.
  • 9L. Guo, Adaptive systems theory: some basic concepts, methods and results, Journal of Systems Science & Complexity, 2003, 16(2): 293-306.
  • 10J. Holland, Hidden Order: How "Adaptation Builds Cornplexity, Addison-Wesley, Reading, MA: 1995.

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部