Independence-aware Advantage Estimation
2019.
Abstract:
Most of existing advantage function estimation methods in reinforcement learning suffer from the problem of high variance, which scales unfavorably with the time horizon. To address this challenge, we propose to identify the independence property between current action and future states in environments, which can be further leveraged to e...More
Code:
Data:
Tags
Comments