DREAM: Deep Regret minimization with Advantage baselines and Model-free learning
Abstract:
We introduce DREAM, a deep reinforcement learning algorithm that finds optimal strategies in imperfect-information games with multiple agents. Formally, DREAM converges to a Nash Equilibrium in two-player zero-sum games and to an extensive-form coarse correlated equilibrium in all other games. Our primary innovation is an effective algo...More
Code:
Data:
Tags
Comments