Efficient Contextual Bandits with Continuous Actions

Majzoubi Maryam,Zhang Chicheng,Chari Rajan,Krishnamurthy Akshay,Langford John,Slivkins Aleksandrs

NIPS 2020（2020）

引用 29|浏览127

暂无评分

摘要

We create a computationally tractable algorithm for contextual bandits with continuous actions having unknown structure. Our reduction-style algorithm composes with most supervised learning representations. We prove that it works in a general sense and verify the new functionality with large-scale experiments.

查看译文

关键词

continuous actions

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要