The Concept of Criticality in Reinforcement Learning
arXiv: Learning, Volume abs/1810.07254, 2019, Pages 251-258.
Reinforcement learning methods carry a well known bias-variance trade-off in n-step algorithms for optimal control. Unfortunately, this has rarely been addressed in current research. This trade-off principle holds independent of the choice of the algorithm, such as n-step SARSA, n-step Expected SARSA or n-step Tree backup. A small n resul...More
Full Text (Upload PDF)
PPT (Upload PPT)