Regret-Optimal Controller for the Full-Information Problem

2021 AMERICAN CONTROL CONFERENCE (ACC)(2021)

引用 10|浏览0
暂无评分
摘要
We consider the infinite-horizon, discrete-time full-information control problem. Motivated by learning theory, as a criterion for controller design we focus on regret, defined as the difference between the linear quadratic regulator (LQR) cost of a causal controller (that has only access to past and current disturbances) and the LQR cost of a clairvoyant one (that has also access to future disturbances). In the full-information setting, there is a unique optimal non-causal controller that in terms of LQR cost dominates all other controllers, and we focus on the regret compared to this particular controller. Since the regret itself is a function of the disturbances, we consider the worst-case regret over all possible bounded energy disturbances, and propose to find a causal controller that minimizes this worst-case regret. The resulting controller has the interpretation of guaranteeing the smallest possible regret compared to the best non-causal controller that has can see the future, no matter what the disturbances are. We show that the regret-optimal control problem can be reduced to a Nehari extension problem, i.e., to approximate an anticausal operator with a causal one in the operator norm. In the state-space setting we obtain explicit formulas for the optimal regret and for the regret-optimal controller. The regret-optimal controller is the sum of the classical H-2 control law and an n-th order controller (where n is the state dimension of the plant) obtained from the Nehari problem. The controller construction simply requires the solution to the standard LQR Riccati equation, in addition to two Lyapunov equations. Simulations over a range of plants demonstrates that the regret-optimal controller interpolates nicely between the H-2 and the H-infinity optimal controllers, and generally has H-2 and H-infinity costs that are simultaneously close to their optimal values. The regret-optimal controller thus presents itself as a viable option for control system design.
更多
查看译文
关键词
full-information control problem,controller design,causal controller,LQR cost,worst-case regret,regret-optimal controller,control system design,optimal noncausal controller,H2 control law,H∞ optimal controllers,Lyapunov equations,Nehari extension problem
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要