Remi Tachet | DeepAI

DeepAI

AI Chat AI Image Generator AI Video AI Music Voice Chat AI Photo Editor Math AI

Featured Co-authors

Alessandro Sordoni
37 publications
Carlo Ratti
33 publications
Romain Laroche
29 publications
Shangtong Zhang
24 publications
Dmitry V. Dylov
23 publications
Yadollah Yaghoobzadeh
22 publications
Hannes Schulz
13 publications
Geoff Gordon
11 publications
Nouha Dziri
11 publications
Paolo Santi
11 publications
Michael Rosenblum
8 publications

research

∙ 02/15/2022

Beyond the Policy Gradient Theorem for Efficient Policy Updates in Actor-Critic Algorithms

In Reinforcement Learning, the optimal action at a given state is depend...

0 Romain Laroche, et al. ∙

research

∙ 02/14/2022

On the Chattering of SARSA with Linear Function Approximation

SARSA, a classical on-policy control algorithm for reinforcement learnin...

0 Shangtong Zhang, et al. ∙

research

∙ 11/04/2021

Global Optimality and Finite Sample Analysis of Softmax Off-Policy Actor Critic under State Distribution Mismatch

In this paper, we establish the global optimality and convergence rate o...

0 Shangtong Zhang, et al. ∙

research

∙ 09/29/2021

Dr Jekyll and Mr Hyde: the Strange Case of Off-Policy Policy Updates

The policy gradient theorem states that the policy should only be update...

0 Romain Laroche, et al. ∙

research

∙ 06/25/2021

Decomposed Mutual Information Estimation for Contrastive Representation Learning

Recent contrastive representation learning methods rely on estimating mu...

0 Alessandro Sordoni, et al. ∙

research

∙ 02/22/2020

Reinforcement Learning Framework for Deep Brain Stimulation Study

Malfunctioning neurons in the brain sometimes operate synchronously, rep...

0 Dmitrii Krylov, et al. ∙

research

∙ 11/10/2019

Robust Natural Language Inference Models with Example Forgetting

We investigate whether example forgetting, a recently introduced measure...

0 Yadollah Yaghoobzadeh, et al. ∙

research

∙ 10/13/2017

Estimating savings in parking demand using shared vehicles for home-work commuting

The increasing availability and adoption of shared vehicles as an altern...

0 Dániel Kondor, et al. ∙