Dynamic treatment regimes (DTRs) are sequences of decision rules that
re...
While interest in the application of machine learning to improve healthc...
Reinforcement learning (RL) is acquiring a key role in the space of adap...
Multi-armed bandit algorithms like Thompson Sampling can be used to cond...
Using bandit algorithms to conduct adaptive randomised experiments can
m...
Multi-armed bandit algorithms have been argued for decades as useful for...