D4rl: Datasets for deep data driven reinforcement learning
arXiv preprint arXiv:2004.07219, 2020.
The offline reinforcement learning (RL) problem, also referred to as batch RL, refers to the setting where a policy must be learned from a dataset of previously collected data, without additional online data collection. In supervised learning, large datasets and complex deep neural networks have fueled impressive progress, but in contrast...More
Full Text (Upload PDF)
PPT (Upload PPT)