DREAM: Deep Regret minimization with Advantage baselines and Model-free learning

Eric Steinberger
Eric Steinberger
Cited by: 0|Bibtex|Views14
Other Links: arxiv.org

Abstract:

We introduce DREAM, a deep reinforcement learning algorithm that finds optimal strategies in imperfect-information games with multiple agents. Formally, DREAM converges to a Nash Equilibrium in two-player zero-sum games and to an extensive-form coarse correlated equilibrium in all other games. Our primary innovation is an effective algo...More

Code:

Data:

Your rating :
0

 

Tags
Comments