Fitted Q-Learning for Relational Domains

Das Srijita,Natarajan Sriraam,Roy Kaushik,Parr Ronald,Kersting Kristian

CoRR（2020）

引用 10|浏览51

暂无评分

摘要

We consider the problem of Approximate Dynamic Programming in relational domains. Inspired by the success of fitted Q-learning methods in propositional settings, we develop the first relational fitted Q-learning algorithms by representing the value function and Bellman residuals. When we fit the Q-functions, we show how the two steps of Bellman operator; application and projection steps can be performed using a gradient-boosting technique. Our proposed framework performs reasonably well on standard domains without using domain models and using fewer training trajectories.

查看译文

关键词

q-learning

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要