Most works on transformers trained with the Masked Language Modeling (ML...
Benchmarking the tradeoff between neural network accuracy and training t...
Multiplying matrices is among the most fundamental and compute-intensive...
Robustness to certain distribution shifts is a key requirement in many M...
Test-time augmentation (TTA)—the aggregation of predictions across
trans...
Neural network pruning—the task of reducing the size of a network by
rem...
In this paper, we apply a multiple instance learning paradigm to signal-...
Thanks to the rapid proliferation of connected devices, sensor-generated...