Bayesian Optimization for Policy Search via Online-Offline Experimentation

Benjamin Letham
Benjamin Letham

Journal of Machine Learning Research, 2019.

Cited by: 4|Bibtex|Views94
EI
Other Links: dblp.uni-trier.de|academic.microsoft.com|arxiv.org

Abstract:

Online field experiments are the gold-standard way of evaluating changes to real-world interactive machine learning systems. Yet our ability to explore complex, multi-dimensional policy spaces - such as those found in recommendation and ranking problems - is often constrained by the limited number of experiments that can be run simultaneo...More

Code:

Data:

Full Text
Your rating :
0

 

Tags
Comments