Video Recommendation with Multi-gate Mixture of Experts Soft Actor Critic
SIGIR '20: The 43rd International ACM SIGIR conference on research and development in Information Retrieval Virtual Event China July, 2020, pp. 1553-1556, 2020.
In this paper, we propose a reinforcement learning based large scale multi-objective ranking system for optimizing short-video recommendation on an industrial video sharing platform. Multiple competing ranking objective and implicit selection bias in user feedback are the main challenges in real-world platform. In order to address those c...More
Full Text (Upload PDF)
PPT (Upload PPT)