Policy Gradients for Contextual Recommendations

WWW '19: The Web Conference on The World Wide Web Conference WWW 2019, pp.1421-1431, (2019)

Cited by: 6|Views24
EI

Abstract:

Decision making is a challenging task in online recommender systems. The decision maker often needs to choose a contextual item at each step from a set of candidates. Contextual bandit algorithms have been successfully deployed to such applications, for the trade-off between exploration and exploitation and the state-of-art performance on...More

Code:

Data:

ZH
Your rating :
0

 

Tags
Comments