Dueling Bandits: From Two-dueling to Multi-dueling
AAMAS '19: International Conference on Autonomous Agents and Multiagent Systems Auckland New Zealand May, 2020, pp. 348-356, 2020.
We study a general multi-dueling bandit problem, where an agent compares multiple options simultaneously and aims to minimize the regret due to selecting suboptimal arms. This setting generalizes the traditional two-dueling bandit problem and finds many real-world applications involving subjective feedback on multiple options. We start wi...More
Full Text (Upload PDF)
PPT (Upload PPT)