DeepAI

AI Chat AI Image Generator AI Video AI Music Generator

Parameter-free version of Adaptive Gradient Methods for Strongly-Convex Functions

06/11/2023

∙

by Deepak Gouda, et al.

∙

∙

The optimal learning rate for adaptive gradient methods applied to λ-strongly convex functions relies on the parameters λ and learning rate η. In this paper, we adapt a universal algorithm along the lines of Metagrad, to get rid of this dependence on λ and η. The main idea is to concurrently run multiple experts and combine their predictions to a master algorithm. This master enjoys O(d log T) regret bounds.

Deepak Gouda
1 publication
Hassan Naveed
1 publication
Salil Kamath
1 publication

page 1

page 2

page 3

page 4

research

∙ 05/15/2019

Adaptivity and Optimality: A Universal Algorithm for Online Convex Optimization

In this paper, we study adaptive online convex optimization, and aim to ...

0 Guanghui Wang, et al. ∙

research

∙ 06/26/2019

Dual Adaptivity: A Universal Algorithm for Minimizing the Adaptive Regret of Convex Functions

To deal with changing environments, a new performance measure---adaptive...

2 Moshe Y. Vardi, et al. ∙

research

∙ 09/12/2023

ELRA: Exponential learning rate adaption gradient descent optimization method

We present a novel, fast (exponential rate adaption), ab initio (hyper-p...

0 Alexander Kleinsorge, et al. ∙

research

∙ 03/27/2023

Learning Rate Schedules in the Presence of Distribution Shift

We design learning rate schedules that minimize regret for SGD-based onl...

0 Matthew Fahrbach, et al. ∙

research

∙ 12/19/2016

Corralling a Band of Bandit Algorithms

We study the problem of combining multiple bandit algorithms (that is, o...

0 Alekh Agarwal, et al. ∙

research

∙ 09/05/2018

Anytime Hedge achieves optimal regret in the stochastic regime

This paper is about a surprising fact: we prove that the anytime Hedge a...

2 Jaouad Mourtada, et al. ∙

research

∙ 08/02/2018

Online Aggregation of Unbounded Losses Using Shifting Experts with Confidence

We develop the setting of sequential prediction based on shifting expert...

0 Vladimir V'yugin, et al. ∙