Greedy Algorithm almost Dominates in Smoothed Contextual Bandits

Cited by: 0|Bibtex|Views47
Other Links: arxiv.org

Abstract:

Online learning algorithms, widely used to power search and content optimization on the web, must balance exploration and exploitation, potentially sacrificing the experience of current users in order to gain information that will lead to better decisions in the future. While necessary in the worst case, explicit exploration has a numbe...More

Code:

Data:

Full Text
Your rating :
0

 

Tags
Comments