Counting to Explore and Generalize in Text-based Games.

Xingdi Yuan,Marc-Alexandre Côté,Alessandro Sordoni,Romain Laroche,Remi Tachet des Combes,Matthew J. Hausknecht,Adam Trischler

arXiv: Computation and Language（2018）

引用 60|浏览201

暂无评分

摘要

We propose a recurrent RL agent with an episodic exploration mechanism that helps discovering good policies in text-based game environments. We show promising results on a set of generated text-based games of varying difficulty where the goal is to collect a coin located at the end of a chain of rooms. In contrast to previous text-based RL approaches, we observe that our agent learns policies that generalize to unseen games of greater difficulty.

查看译文

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要