We present speculative sampling, an algorithm for accelerating transform...
Can continuous diffusion models bring the same performance breakthrough ...
We introduce DeepNash, an autonomous agent capable of learning to play t...
Effective decision making involves flexibly relating past experiences an...
We investigate the optimal model size and number of tokens for training ...
The performance of a language model has been shown to be effectively mod...
We enhance auto-regressive language models by conditioning on document c...
We propose a novel policy update that combines regularized policy
optimi...
Beam search is the go-to method for decoding auto-regressive machine
tra...
Constructing agents with planning capabilities has long been one of the ...
The game of chess is the most widely-studied domain in the history of
ar...
Learning to navigate in complex environments with dynamic elements is an...
We introduce a two-layer wavelet scattering network, for object
classifi...