Bayesian Optimization for Policy Search via Online-Offline Experimentation
Journal of Machine Learning Research, 2019.
Online field experiments are the gold-standard way of evaluating changes to real-world interactive machine learning systems. Yet our ability to explore complex, multi-dimensional policy spaces - such as those found in recommendation and ranking problems - is often constrained by the limited number of experiments that can be run simultaneo...More
PPT (Upload PPT)