Robustness and compactness are two essential components of deep learning...
We propose a novel framework and a solution to tackle the continual lear...
We design a new family of hybrid CNN-ViT neural networks, named FasterVi...
Cascaded computation, whereby predictions are recurrently refined over
s...
We propose RANA, a relightable and articulated neural avatar for the
pho...
Structural pruning can simplify network architecture and improve inferen...
We propose global context vision transformer (GC ViT), a novel architect...
Acquisition and creation of digital human avatars is an important proble...
In this work we demonstrate the vulnerability of vision transformers (Vi...
Federated learning (FL) allows the collaborative training of AI models
w...
We introduce AdaViT, a method that adaptively adjusts the inference cost...
Pruning enables appealing reductions in network memory footprint and tim...
Structural pruning can simplify network architecture and improve inferen...
Transformers yield state-of-the-art results across many tasks. However, ...
Understanding the behavior and vulnerability of pre-trained deep neural
...
Hand pose estimation is difficult due to different environmental conditi...
We present KAMA, a 3D Keypoint Aware Mesh Articulation approach that all...
Training deep neural networks requires gradient estimation from data bat...
We introduce DexYCB, a new dataset for capturing hand grasping of object...
In this work, we study how well different type of approaches generalise ...
Estimating 3D hand pose from 2D images is a difficult, inverse problem d...
One major challenge for monocular 3D human pose estimation in-the-wild i...
As Convolutional Neural Networks (CNNs) are increasingly being employed ...
We introduce DeepInversion, a new method for synthesizing images from th...
Structural pruning of neural network parameters reduces computation, ene...
Inter-personal anatomical differences limit the accuracy of
person-indep...
Parts provide a good intermediate representation of objects that is robu...
In many cases, especially with medical images, it is prohibitively
chall...
Deep residual networks (ResNets) made a recent breakthrough in deep lear...
Estimating the 3D pose of a hand is an essential part of human-computer
...
In this paper, we strive to answer two questions: What is the current st...
In this paper, we address the challenging problem of effi- cient tempora...
We present two techniques to improve landmark localization from partiall...
Estimating surface reflectance (BRDF) is one key component for complete ...
We propose a new formulation for pruning convolutional kernels in neural...