期刊文献+
共找到2篇文章
< 1 >
每页显示 20 50 100
Risk-averse Contextual Multi-armed Bandit Problem with Linear Payoffs
1
作者 Yifan Lin Yuhao Wang enlu zhou 《Journal of Systems Science and Systems Engineering》 SCIE EI CSCD 2023年第3期267-288,共22页
In this paper we consider the contextual multi-armed bandit problem for linear payoffs under a risk-averse criterion.At each round,contexts are revealed for each arm,and the decision maker chooses one arm to pull and ... In this paper we consider the contextual multi-armed bandit problem for linear payoffs under a risk-averse criterion.At each round,contexts are revealed for each arm,and the decision maker chooses one arm to pull and receives the corresponding reward.In particular,we consider mean-variance as the risk criterion,and the best arm is the one with the largest mean-variance reward.We apply the Thompson sampling algorithm for the disjoint model,and provide a comprehensive regret analysis for a variant of the proposed algorithm.For T rounds,K actions,and d-dimensional feature vectors,we prove a regret bound of O((1+ρ+1/ρ)d In T ln K/δ√dKT^(1+2∈)ln K/δ1/e)that holds with probability 1-δunder the mean-variance criterion with risk tolerance p,for any 0<ε<1/2,0<δ<1.The empirical performance of our proposed algorithms is demonstrated via a portfolio selection problem. 展开更多
关键词 Multi-armed bandit CONTEXT RISK-AVERSE Thompson sampling
原文传递
Experimental design and model reduction in systems biology 被引量:1
2
作者 Jenny E.Jeong Qinwei Zhuang +2 位作者 Mark K.Transtrum enlu zhou Peng Qiu 《Frontiers of Electrical and Electronic Engineering in China》 CSCD 2018年第4期287-306,共20页
原文传递
上一页 1 下一页 到第
使用帮助 返回顶部