Balázs Szörényi

Chat Image Generator Video Music Voice Chat Photo Editor

Featured Co-authors

Shie Mannor
157 publications
Chen-Yu Wei
33 publications
Dimitris Fotakis
29 publications
Manolis Zampetakis
29 publications
Chicheng Zhang
27 publications
Alexander Golovnev
21 publications
Gal Dalal
19 publications
Alina Beygelzimer
12 publications
Wojciech Kotłowski
12 publications
Gugan Thoppe
11 publications
Robert Busa-Fekete
10 publications

research

∙ 11/20/2019

A Tale of Two-Timescale Reinforcement Learning with the Tightest Finite-Time Bound

Policy evaluation in reinforcement learning is often conducted using two...

0 Gal Dalal, et al. ∙

research

∙ 06/03/2019

Optimal Learning of Mallows Block Model

The Mallows model, introduced in the seminal paper of Mallows 1957, is o...

0 Robert Busa-Fekete, et al. ∙

research

∙ 05/29/2019

Learning to Crawl

Web crawling is the problem of keeping a cache of webpages fresh, i.e., ...

0 Utkarsh Upadhyay, et al. ∙

research

∙ 02/06/2019

Bandit Multiclass Linear Classification: Efficient Algorithms for the Separable Case

We study the problem of efficient online multiclass linear classificatio...

0 Alina Beygelzimer, et al. ∙

research

∙ 01/16/2019

The information-theoretic value of unlabeled data in semi-supervised learning

We quantify the separation between the numbers of labeled examples requi...

0 Alexander Golovnev, et al. ∙

research

∙ 04/04/2017

Finite Sample Analyses for TD(0) with Function Approximation

TD(0) is one of the most commonly used algorithms in reinforcement learn...

0 Gal Dalal, et al. ∙

research

∙ 04/26/2016

Distributed Clustering of Linear Bandits in Peer to Peer Networks

We provide two distributed confidence ball algorithms for solving linear...

0 Nathan Korda, et al. ∙

Success!

An error occurred

Balázs Szörényi

Featured Co-authors

A Tale of Two-Timescale Reinforcement Learning with the Tightest Finite-Time Bound

Optimal Learning of Mallows Block Model

Learning to Crawl

Bandit Multiclass Linear Classification: Efficient Algorithms for the Separable Case

The information-theoretic value of unlabeled data in semi-supervised learning

Finite Sample Analyses for TD(0) with Function Approximation

Distributed Clustering of Linear Bandits in Peer to Peer Networks

Sign in with Google

Consider DeepAI Pro