Inspired by the recent success of sequence modeling in RL and the use of...
Recently, spoken dialogue systems have been widely deployed in a variety...
Ads allocation, that allocates ads and organic items to limited slots in...
With the recent prevalence of reinforcement learning (RL), there have be...
A mixed list of ads and organic items is usually displayed in feed and h...
E-commerce platforms usually display a mixed list of ads and organic ite...
Recently, the graph neural network (GNN) has shown great power in matrix...
Recently, various auxiliary tasks have been proposed to accelerate
repre...
Modern machine learning models (such as deep neural networks and boostin...
In mobile crowdsourcing (MCS), the platform selects participants to comp...
Exploring the transition dynamics is essential to the success of
reinfor...
We observe that several existing policy gradient methods (such as vanill...