Efficient Dynamics Modeling in Interactive Environments with Koopman Theory

Arnab Kumar Mondal, Siba Smarak Panigrahi,Sai Rajeswar,Kaleem Siddiqi,Siamak Ravanbakhsh

arXiv (Cornell University)（2023）

引用 0|浏览11

暂无评分

摘要

The accurate modeling of dynamics in interactive environments is critical for successful long-range prediction. Such a capability could advance Reinforcement Learning (RL) and Planning algorithms, but achieving it is challenging. Inaccuracies in model estimates can compound, resulting in increased errors over long horizons. We approach this problem from the lens of Koopman theory, where the nonlinear dynamics of the environment can be linearized in a high-dimensional latent space. This allows us to efficiently parallelize the sequential problem of long-range prediction using convolution, while accounting for the agent's action at every time step. Our approach also enables stability analysis and better control over gradients through time. Taken together, these advantages result in significant improvement over the existing approaches, both in the efficiency and the accuracy of modeling dynamics over extended horizons. We also report promising experimental results in dynamics modeling for the scenarios of both model-based planning and model-free RL.

查看译文

关键词

interactive environments,efficient dynamics modeling,koopman

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要