Multi-Player Bandits: The Adversarial Case.

Pragnya Alatur,Kfir Y. Levy,Andreas Krause

JOURNAL OF MACHINE LEARNING RESEARCH（2020）

引用 33|浏览148

暂无评分

摘要

We consider a setting where multiple players sequentially choose among a common set of actions (arms). Motivated by an application to cognitive radio networks, we assume that players incur a loss upon colliding, and that communication between players is not possible. Existing approaches assume that the system is stationary. Yet this assumption is often violated in practice, e.g., due to signal strength fluctuations. In this work, we design the first multi-player Bandit algorithm that provably works in arbitrarily changing environments, where the losses of the arms may even be chosen by an adversary. This resolves an open problem posed by Rosenski et al. (2016).

查看译文

关键词

Multi-Armed Bandits,Multi-Player Problems,Online Learning,Sequential Decision Making,Cognitive Radio Networksd

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要