Reinforcement learning with formal performance metrics for quadcopter attitude control under non-nominal contexts

Nicola Bernini,Mikhail Bessa,Remi Delmas,Arthur Gold,Eric Goubault,Romain Pennec,Sylvie Putot,Francois Sillion

ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE（2024）

引用 0|浏览1

暂无评分

摘要

We explore the reinforcement learning approach to designing controllers by extensively discussing the case of a quadcopter attitude controller. We provide all details allowing to reproduce our approach, starting with a model of the dynamics of a crazyflie 2.0 under various nominal and non-nominal conditions, including partial motor failures and wind gusts. We develop a robust form of a signal temporal logic to quantitatively evaluate the vehicle's behavior and measure the performance of controllers. The paper thoroughly describes the choices in training algorithms, neural net architecture, hyperparameters, observation space in view of the different performance metrics we have introduced. We discuss the robustness of the obtained controllers, both to partial loss of power for one rotor and to wind gusts and finish by drawing conclusions on practical controller design by reinforcement learning.

查看译文

关键词

Reinforcement learning,Control,Quadcopter dynamics,Performance metrics,Temporal logics

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要