Variational Inference Mpc Using Tsallis Divergence

Ziyi Wang,Oswin So,Jason Gibson,Bogdan Vlahov,Manan S. Gandhi,Guan-Horng Liu,Evangelos A. Theodorou

ROBOTICS: SCIENCE AND SYSTEM XVII（2021）

引用 8|浏览13

暂无评分

摘要

In this paper, we provide a generalized framework for Variational Inference-Stochastic Optimal Control by using the non-extensive Tsallis divergence. By incorporating the deformed exponential function into the optimality likelihood function, a novel Tsallis Variational Inference-Model Predictive Control algorithm is derived, which includes prior works such as Variational Inference-Model Predictive Control, Model Predictive Path Integral Control, Cross Entropy Method, and Stein Variational Inference Model Predictive Control as special cases. The proposed algorithm allows for effective control of the cast/reward transform and is characterized by superior performance in terms of mean and variance reduction of the associated cost. The aforementioned features are supported by a theoretical and numerical analysis on the level of risk sensitivity of the proposed algorithm as well as simulation experiments on 5 different robotic systems with 3 different policy parameterizations.

查看译文

关键词

mpc,divergence,inference

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要