Universal Best Arm Identification.

IEEE Transactions on Signal Processing(2019)

引用 11|浏览23
暂无评分
摘要
In this paper, we study the problem of universal best arm identification in multi-armed bandits, where the underlying setting can be either stochastic or adversarial and is not revealed to the forecaster a priori. We propose an algorithm, called S3-BA, that identifies the best arm without prior knowledge of the underlying setting. The key idea is to simultaneously explore the arms and learn the pr...
更多
查看译文
关键词
Stochastic processes,Signal processing algorithms,Minimization,Switches,Convergence,Interference,Upper bound
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要