Jalaj Bhandari | DeepAI

Chat Image Generator Video Music Voice Chat Photo Editor

Featured Co-authors

Fan Liu
50 publications
Daniel Russo
32 publications
Yuchen He
20 publications
Ruiyang Xu
9 publications
Zheqing Zhu
7 publications
Dmytro Korenkevych
5 publications
Alex Nikulkov
2 publications
Raghav Singal
2 publications

research

∙ 05/23/2023

Optimizing Long-term Value for Auction-Based Recommender Systems via On-Policy Reinforcement Learning

Auction-based recommender systems are prevalent in online advertising pl...

0 Ruiyang Xu, et al. ∙

research

∙ 07/21/2020

A Note on the Linear Convergence of Policy Gradient Methods

We revisit the finite time analysis of policy gradient methods in the si...

0 Jalaj Bhandari, et al. ∙

research

∙ 06/05/2019

Global Optimality Guarantees For Policy Gradient Methods

Policy gradients methods are perhaps the most widely used class of reinf...

0 Jalaj Bhandari, et al. ∙

research

∙ 06/06/2018

A Finite Time Analysis of Temporal Difference Learning With Linear Function Approximation

Temporal difference learning (TD) is a simple iterative algorithm used t...

0 Jalaj Bhandari, et al. ∙

Success!

An error occurred