Most offline reinforcement learning (RL) algorithms return a target poli...
Offline reinforcement learning (RL) struggles in environments with rich ...
Goal-conditioned reinforcement learning (RL) is a promising direction fo...
We propose a unifying view to analyze the representation quality of
self...
Most theoretically motivated work in the offline reinforcement learning
...
Several recent works have proposed a class of algorithms for the offline...
A central object of study in Reinforcement Learning (RL) is the Markovia...
Humans have the capability, aided by the expressive compositionality of ...
In Reinforcement Learning, the optimal action at a given state is depend...
SARSA, a classical on-policy control algorithm for reinforcement learnin...
In this paper, we establish the global optimality and convergence rate o...
In Batched Multi-Armed Bandits (BMAB), the policy is not allowed to be
u...
The policy gradient theorem states that the policy should only be update...
By the age of two, children tend to assume that new word categories are ...
We study the problem of Safe Policy Improvement (SPI) under constraints ...
We investigate the discounting mismatch in actor-critic algorithm
implem...
Malfunctioning neurons in the brain sometimes operate synchronously,
rep...
Playing text-based games requires skill in processing natural language a...
We are interested in learning how to update Knowledge Graphs (KG) from t...
Previous work has shown the unreliability of existing algorithms in the ...
Batch Reinforcement Learning (Batch RL) consists in training a policy us...
Can we learn a control policy able to adapt its behaviour in real time s...
We consider the decentralized exploration problem: a set of players
coll...
We propose a recurrent RL agent with an episodic exploration mechanism t...
A common goal in Reinforcement Learning is to derive a good strategy giv...
This position paper formalises an abstract model for complex negotiation...
We consider tackling a single-agent RL problem by distributing it to n
l...
This paper formalises the problem of online algorithm selection in the
c...
In this paper, we propose a framework for solving a single-agent task by...