Multi-armed Bandit Mechanism with Private Histories

AAMAS, pp. 1607-1609, 2017.

Cited by: 0|Views8


The fundamental challenge in bandit problem is the trade off between exploration and exploitation. To minimize the regret in a long period, an algorithm has to explore by actually choosing seemingly suboptimal arms so as to gather more information about them. The exploration obviously has higher short-term regrets. In recommendation of ne...More



