Most existing works on federated bandits take it for granted that all cl...
The past decade has witnessed the flourishing of a new profession as med...
Off-policy learning, referring to the procedure of policy optimization w...
Meta-reinforcement learning (meta-RL) aims to quickly solve new tasks by...
Content creators compete for exposure on recommendation platforms, and s...
Accuracy and diversity have long been considered to be two conflicting g...
To demonstrate the value of machine learning based smart health technolo...
In recent years, machine learning has achieved impressive results across...
Personalized text generation has broad industrial applications, such as
...
Graph contrastive learning (GCL), as an emerging self-supervised learnin...
Conversational recommender systems (CRS) dynamically obtain the user
pre...
Bandit algorithms have become a reference solution for interactive
recom...
Advertisements (ads) are an innate part of search engine business models...
Deep neural networks (DNNs) demonstrate significant advantages in improv...
We tackle the communication efficiency challenge of learning kernelized
...
Conversational recommender systems (CRS) explicitly solicit users'
prefe...
Explanations in a recommender system assist users in making informed
dec...
In real-world recommendation problems, especially those with a formidabl...
Contextual bandit algorithms have been recently studied under the federa...
Existing Domain Adaptation (DA) algorithms train target models and then ...
Thanks to the power of representation learning, neural contextual bandit...
Most real-world optimization problems have multiple objectives. A system...
Existing online learning to rank (OL2R) solutions are limited to linear
...
Online learning to rank (OL2R) has attracted great research interests in...
Graph Convolutional Networks (GCNs) have fueled a surge of interest due ...
As recommendation is essentially a comparative (or ranking) process, a g...
The exploitation of graph structures is the key to effectively learning
...
Graph embedding techniques have been increasingly employed in real-world...
We study adversarial attacks on linear stochastic bandits, a sequential
...
We propose a new problem setting to study the sequential interactions be...
Linear contextual bandit is a popular online learning problem. It has be...
Crowdsourcing provides an efficient label collection schema for supervis...
Collaborative bandit learning, i.e., bandit algorithms that utilize
coll...
We study the problem of incentivizing exploration for myopic users in li...
Online Learning to Rank (OL2R) eliminates the need of explicit relevance...
Combinatorial optimization problem (COP) over graphs is a fundamental
ch...
Textual explanations have proved to help improve user satisfaction on
ma...
Crowdsourcing provides a practical way to obtain large amounts of labele...
Non-stationary bandits and online clustering of bandits lift the restric...
User-provided multi-aspect evaluations manifest users' detailed feedback...
Implicit feedback, such as user clicks, is a major source of supervision...
Predicting users' preferences based on their sequential behaviors in his...
User representation learning is vital to capture diverse user preference...
Reinforcement learning is effective in optimizing policies for recommend...
Reinforcement learning is effective in optimizing policies for recommend...
Multi-aspect user preferences are attracting wider attention in recommen...
Residential homes constitute roughly one-fourth of the total energy usag...
In this paper, we focus on unsupervised domain adaptation for Machine Re...
Online Learning to Rank (OL2R) algorithms learn from implicit user feedb...
We study the problem of online influence maximization in social networks...