Conditions on LR in Gradient Descent

Question

In Introductory Lectures in Convex Optimization by Yurii Nesterov, Section 1.2.3 shows that gradient descent is guaranteed to converge if the step size is chosen either with a fixed step size or chosen by the Goldstein-Armijo rule.

Do you know if there exists conditions on a variable step size (upper/lower bounds?) which guarantees the convergence of gradient descent? (The conditions need not guarantee convergence to a local minimum, just to a stationary point). Thank you!

Can you be more specific? I am sure this plenty of literature out there already. Do you mean to choose the LR arbitrarily at each step but within a certain interval? — shuhalo
– shuhalo, Commented Oct 14, 2024 at 21:31
Not exactly - I'm working on a project for which I have an upper bound on the learning rate already and I want to ensure that gradient descent will still converge even if my upper bound gets small. Do you know if there are conditions on a variable step size, for example a maximum rate it decays, for which GD can still converge to a stationary point? — Kyle
– Kyle, Commented Oct 15, 2024 at 5:40
I think the usual condition will be that the step sizes are not summable. Check for example Bertsekas' Nonlinear Programming Proposition 1.2.3. — Hannes
– Hannes, Commented Oct 15, 2024 at 14:04

Stack Exchange Network

Conditions on LR in Gradient Descent

0

You must log in to answer this question.

Conditions on LR in Gradient Descent

0

You must log in to answer this question.

Related