Chrome Extension
WeChat Mini Program
Use on ChatGLM

Feature Reinforcement Learning Using Looping Suffix Trees.

EWRL(2012)

Cited 27|Views31
No score
Abstract
There has recently been much interest in history-based methods using suffix trees to solve POMDPs. However, these suffix trees cannot efficiently represent environments that have long-term dependencies. We extend the recently introduced CTMDP algorithm to the space of looping suffix trees which have previously only been used in solving determinis- tic POMDPs. The resulting algorithm replicates results from CTMDP for environments with short term dependencies, while it outperforms LSTM-based methods on TMaze, a deep memory environment.
More
Translated text
Key words
Feature Extraction,Approximate Matching
AI Read Science
Must-Reading Tree
Example
Generate MRT to find the research sequence of this paper
Chat Paper
Summary is being generated by the instructions you defined