期刊文献+

Playing against Hedge

Playing against Hedge
下载PDF
导出
摘要 Hedge has been proposed as an adaptive scheme, which guides the player’s hand in a multi-armed bandit full information game. Applications of this game exist in network path selection, load distribution, and network interdiction. We perform a worst case analysis of the Hedge algorithm by using an adversary, who will consistently select penalties so as to maximize the player’s loss, assuming that the adversary’s penalty budget is limited. We further explore the performance of binary penalties, and we prove that the optimum binary strategy for the adversary is to make greedy decisions. Hedge has been proposed as an adaptive scheme, which guides the player’s hand in a multi-armed bandit full information game. Applications of this game exist in network path selection, load distribution, and network interdiction. We perform a worst case analysis of the Hedge algorithm by using an adversary, who will consistently select penalties so as to maximize the player’s loss, assuming that the adversary’s penalty budget is limited. We further explore the performance of binary penalties, and we prove that the optimum binary strategy for the adversary is to make greedy decisions.
出处 《International Journal of Communications, Network and System Sciences》 2014年第12期497-507,共11页 通讯、网络与系统学国际期刊(英文)
关键词 HEDGE ALGORITHM Adversary Online ALGORITHM GREEDY ALGORITHM PERIODIC Performance Binary PENALTIES Path Selection Network Interdiction Hedge Algorithm Adversary Online Algorithm Greedy Algorithm Periodic Performance Binary Penalties Path Selection Network Interdiction
  • 相关文献

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部