CS/CNS/EE 253: Advanced Topics in Machine Learning Topic: Dealing with Partial Feedback #1

2013.

Cited by: 0|Bibtex|Views6
Other Links: academic.microsoft.com

Abstract:

function ri(t) which is unknown. In each round t, an arm i is chosen and the reward ri(t)2 (0; 1) is gained. Only ri(t) is revealed to the algorithm at the end of round t, where i is the arm chosen in that round; it is kept ignorant of rj(t) for all other arms j6= i. The goal is to nd an algorithm specifying how to choose an arm in each r...More

Code:

Data:

Your rating :
0

 

Tags
Comments