Gradient Descent is Pareto-Optimal in the Oracle Complexity and Memory Tradeoff for Feasibility Problems

arxiv(2024)

引用 0|浏览0
暂无评分
摘要
In this paper we provide oracle complexity lower bounds for finding a point in a given set using a memory-constrained algorithm that has access to a separation oracle. We assume that the set is contained within the unit d-dimensional ball and contains a ball of known radius ϵ>0. This setup is commonly referred to as the feasibility problem. We show that to solve feasibility problems with accuracy ϵ≥ e^-d^o(1), any deterministic algorithm either uses d^1+δ bits of memory or must make at least 1/(d^0.01δϵ^21-δ/1+1.01 δ-o(1)) oracle queries, for any δ∈[0,1]. Additionally, we show that randomized algorithms either use d^1+δ memory or make at least 1/(d^2δϵ^2(1-4δ)-o(1)) queries for any δ∈[0,1/4]. Because gradient descent only uses linear memory 𝒪(dln 1/ϵ) but makes Ω(1/ϵ^2) queries, our results imply that it is Pareto-optimal in the oracle complexity/memory tradeoff. Further, our results show that the oracle complexity for deterministic algorithms is always polynomial in 1/ϵ if the algorithm has less than quadratic memory in d. This reveals a sharp phase transition since with quadratic 𝒪(d^2 ln1/ϵ) memory, cutting plane methods only require 𝒪(dln 1/ϵ) queries.
更多
查看译文
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要