The unprecedented photorealistic results achieved by recent text-to-imag...
We argue that there are many notions of 'similarity' and that models, li...
Many common methods for learning a world model for pixel-based environme...
Unsupervised visual representation learning offers the opportunity to
le...
Training self-driving systems to be robust to the long-tail of driving
s...
Multi-modal reasoning systems rely on a pre-trained object detector to
e...
We present a new method that views object detection as a direct set
pred...
Effective coordination is crucial to solve multi-agent collaborative (MA...
We formulate the problem of defogging as state estimation and future sta...
Accurate and robust cell nuclei classification is the cornerstone for a ...