Alon Cohen

research

∙ 08/28/2023

Rate-Optimal Policy Optimization for Linear Markov Decision Processes

We study regret minimization in online episodic linear Markov Decision P...

0 Uri Sherman, et al. ∙

research

∙ 08/24/2023

APART: Diverse Skill Discovery using All Pairs with Ascending Reward and DropouT

We study diverse skill discovery in reward-free environments, aiming to ...

0 Hadar Schreiber Galler, et al. ∙

research

∙ 03/02/2023

Efficient Rate Optimal Regret for Adversarial Contextual MDPs Using Online Function Approximation

We present the OMG-CMDP! algorithm for regret minimization in adversaria...

0 Orin Levy, et al. ∙

research

∙ 11/27/2022

Counterfactual Optimism: Rate Optimal Regret for Stochastic Contextual MDPs

We present the UC^3RL algorithm for regret minimization in Stochastic Co...

0 Orin Levy, et al. ∙

research

∙ 06/03/2022

Rate-Optimal Online Convex Optimization in Adaptive Linear Control

We consider the problem of controlling an unknown linear dynamical syste...

8 Asaf Cassel, et al. ∙

research

∙ 03/02/2022

Efficient Online Linear Control with Stochastic Convex Costs and Unknown Dynamics

We consider the problem of controlling an unknown linear dynamical syste...

3 Asaf Cassel, et al. ∙

research

∙ 06/22/2021

Asynchronous Stochastic Optimization Robust to Arbitrary Delays

We consider stochastic optimization with delayed gradients where, at eac...

10 Alon Cohen, et al. ∙

research

∙ 03/24/2021

Minimax Regret for Stochastic Shortest Path

We study the Stochastic Shortest Path (SSP) problem in which an agent ha...

15 Alon Cohen, et al. ∙

research

∙ 01/31/2021

Online Markov Decision Processes with Aggregate Bandit Feedback

We study a novel variant of online finite-horizon Markov Decision Proces...

10 Alon Cohen, et al. ∙

research

∙ 02/23/2020

Near-optimal Regret Bounds for Stochastic Shortest Path

Stochastic shortest path (SSP) is a well-known problem in planning and c...

9 Alon Cohen, et al. ∙

research

∙ 02/19/2020

Logarithmic Regret for Learning Linear Quadratic Regulators Efficiently

We consider the problem of learning in Linear Quadratic Control systems ...

4 Asaf Cassel, et al. ∙

research

∙ 11/05/2019

Apprenticeship Learning via Frank-Wolfe

We consider the applications of the Frank-Wolfe (FW) algorithm for Appre...

0 Tom Zahavy, et al. ∙

research

∙ 05/23/2019

Average reward reinforcement learning with unknown mixing times

We derive and analyze learning algorithms for policy evaluation, apprent...

0 Tom Zahavy, et al. ∙

research

∙ 02/17/2019

Learning Linear-Quadratic Regulators Efficiently with only √(T) Regret

We present the first computationally-efficient algorithm with O(√(T)) r...

0 Alon Cohen, et al. ∙

research

∙ 02/13/2019

Learning and Generalization for Matching Problems

We study a classic algorithmic problem through the lens of statistical l...

0 Alon Cohen, et al. ∙

research

∙ 11/16/2018

Incentivizing the Dynamic Workforce: Learning Contracts in the Gig-Economy

In principal-agent models, a principal offers a contract to an agent to ...

0 Alon Cohen, et al. ∙

research

∙ 06/19/2018

Online Linear Quadratic Control

We study the problem of controlling linear time-invariant systems with k...

0 Alon Cohen, et al. ∙

research

∙ 05/07/2018

Planning and Learning with Stochastic Action Sets

In many practical uses of reinforcement learning (RL) the set of actions...

0 Craig Boutilier, et al. ∙

research

∙ 05/23/2016

Online Learning with Feedback Graphs Without the Graphs

We study an online learning framework introduced by Mannor and Shamir (2...

0 Alon Cohen, et al. ∙

Alon Cohen

Featured Co-authors

Sign in with Google

Consider DeepAI Pro