Independence-aware Advantage Estimation

Pushi Zhang
Pushi Zhang
Guoqing Liu
Guoqing Liu
Jiang Bian
Jiang Bian
Minglie Huang
Minglie Huang
Tie-Yan Liu
Tie-Yan Liu

2019.

Cited by: 0|Bibtex|Views4

Abstract:

Most of existing advantage function estimation methods in reinforcement learning suffer from the problem of high variance, which scales unfavorably with the time horizon. To address this challenge, we propose to identify the independence property between current action and future states in environments, which can be further leveraged to e...More

Code:

Data:

Your rating :
0

 

Tags
Comments