ML models are increasingly being pushed to mobile devices, for low-laten...
Existing defense methods against adversarial attacks can be categorized ...
Machine learning models often fail on out-of-distribution (OOD) samples....
Pretrained large-scale vision-language models like CLIP have exhibited s...
Despite their excellent performance, state-of-the-art computer vision mo...
Many visual recognition models are evaluated only on their classificatio...
Deep networks for computer vision are not reliable when they encounter
a...
Recent years have seen advances on principles and guidance relating to
a...
Many machine learning methods operate by inverting a neural network at
i...
3D reconstruction is a fundamental problem in computer vision, and the t...
Visual representations underlie object recognition tasks, but they often...
Full-precision deep learning models are typically too large or costly to...
Deep networks achieve state-of-the-art performance on computer vision ta...
Automatic speech recognition systems have created exciting possibilities...
Vision Transformer (ViT) is emerging as the state-of-the-art architectur...
We find that images contain intrinsic structure that enables the reversa...
We introduce a framework for learning robust visual representations that...
Although deep networks achieve strong accuracy on a range of computer vi...
Like all software systems, the execution of deep learning models is dict...
Semantic segmentation is one of the most impactful applications of machi...
Deep networks are well-known to be fragile to adversarial attacks. Using...
We consider the problem of inferring the values of an arbitrary set of
v...