A Simple Online Algorithm for Competing with Dynamic Comparators

UAI(2020)

引用 14|浏览30
暂无评分
摘要
Online learning in dynamic environments has recently drawn considerable attention, where dynamic regret is usually employed to compare decisions of online algorithms to dynamic comparators. In previous works, dynamic regret bounds are typically established in terms of regularity of comparators or that of online functions . Recently, Jadbabaie et al.[2015] propose an algorithm that can take advantage of both regularities and enjoy an $\tilde {O}(\sqrt {1+ D_T}+\min\{\sqrt {(1+ D_T) C_T},(1+ D_T)^{1/3} V_T^{1/3} T^{1/3}\}) $ dynamic regret, where is an additional quantity to measure the niceness of environments. The regret bound adapts to the smaller regularity of problem environments and is tighter than all existing dynamic regret guarantees. Nevertheless, their algorithm involves non-convex programming at each iteration, and thus requires burdensome computations. In this paper, we design a simple algorithm based on the online ensemble, which provably enjoys the same (even slightly stronger) guarantee as the state-of-the-art rate, yet is much more efficient because our algorithm does not involve any non-convex problem solving. Empirical studies also verify the efficacy and efficiency.
更多
查看译文
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要