We propose a new class of deep reinforcement learning (RL) algorithms th...
Offline reinforcement learning (RL), which aims to learn an optimal poli...
Most recommender systems are myopic, that is they optimize based on the
...
Online advertising has typically been more personalized than offline
adv...
Listwise ranking losses have been widely studied in recommender systems....
Industrial recommender systems are frequently tasked with approximating
...
Recent work in deep reinforcement learning (RL) has produced algorithms
...
Deep reinforcement learning (RL) algorithms have made great strides in r...
Neural networks augmented with external memory have the ability to learn...
We adapt the ideas underlying the success of Deep Q-Learning to the
cont...