Offline Policy Selection under Uncertainty

Cited by: 0|Bibtex|Views12
Other Links: arxiv.org

Abstract:

The presence of uncertainty in policy evaluation significantly complicates the process of policy ranking and selection in real-world settings. We formally consider offline policy selection as learning preferences over a set of policy prospects given a fixed experience dataset. While one can select or rank policies based on point estimat...More

Code:

Data:

Full Text
Your rating :
0

 

Tags
Comments